Entering edit mode
3.1 years ago
shail.nair05
▴
20
I have metatranscriptomic assembled contig. I used Prodigal to predict CDS (which gave me 391250 CDS), and further mapped them to initial reads to find raw count. Then i used the translated protein file from Prodigal (with same number of proteins as that of CDS) to annotate kegg orthologs via KEGG GhostKoala server. The ghostkoala output file shows that there are 391443 proteins ( aprox 190 more than the initial file ). Is this possible? If so, how do i find the raw reads of these 190 CDS/proteins (whose mapping data i do not have)?