How To Extract Common Cds Among Huge Data Set?
1
0
Entering edit mode
10.3 years ago
HG ★ 1.2k

Hi everyone, I have 50 Ecoli whole genome sequence data set. I already assemble all the genome. Next step i want to extract all the common CDs among all the 50 contig file in a batch mode. Can anyone please suggest me any approach?

cds • 2.1k views
ADD COMMENT
0
Entering edit mode

as we discussed before, you can do prodigal and follow by cd-hit and counting in clusters

ADD REPLY
0
Entering edit mode

Yes..its one good idea. In fact have started also. Let me compare others approach also. So that i can compare better result.

ADD REPLY
0
Entering edit mode

Hi Pavel, After finding the gene and the traslated file how it is posiible to find funtion of each gene ...for example i need in a tabular format ....

ADD REPLY
0
Entering edit mode
10.3 years ago
aawitney ▴ 20

You could try annotating one of the genomes (I would use prokka http://www.vicbioinformatics.com/software.prokka.shtml). Then BLAST each of the annotated genes against all the other sets of contigs and list only the genes which have a significant hit in each genome,

ADD COMMENT

Login before adding your answer.

Traffic: 3338 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6