database for better classification of illumina contigs
1
0
Entering edit mode
14 months ago
sapuizait ▴ 10

hi everyone

I have sequenced some filth flies guts using Novoseq and assembled the contigs using megahit. I have assigned taxonomy to the contigs using the NR (which is part of another pipeline, that compares predicted aa to the protein NR) but I have A LOT of contigs that are unclasssifed (ca. 78% average from 30 samples). On one hand this is to be expected as there are no published studies with bacterial (or other) genomes from filth flies guts but on the other hand i was wondering if you have any recommendation for another database that may be able to reduce the nr of unclassified contigs.

Thanks

classification illumina contigs NR • 731 views
ADD COMMENT
0
Entering edit mode
14 months ago
shelkmike ★ 1.2k

You can align the contigs by Blastn to NCBI nt.

ADD COMMENT
0
Entering edit mode

thanks - you think that will improve it? i can give it a go

ADD REPLY
1
Entering edit mode

Yes. When I perform taxonomic classification of contigs of a genome assembly, I use NCBI nt instead of NCBI nr, because not all contigs contain protein-coding genes. I usually do this to remove contamination.

ADD REPLY

Login before adding your answer.

Traffic: 1655 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6