Question: Bowtie database of all bacterial and fungal complete genomes
0
gravatar for c.e.chong
5 months ago by
c.e.chong20
c.e.chong20 wrote:

Hi all,

I would like to create a database of all the bacterial and fungal complete genomes in the RefSeq database to map all of my Metagenome samples to it.

Does anyone know if this is possible? The manual states that you have to input a comma separated list of fasta files to build the database, is there another way to do this if I have so many fasta files?

Thank you in advance!

ADD COMMENTlink written 5 months ago by c.e.chong20
1

It should certainly be possible provided you have enough compute resources available locally. Instead of using every genome have you considered getting representatives for broad classes? That should reduce the search space to some extent.

There are tools like kraken2 and centrifuge that are meant for taxonomic assignment of reads that may also be more appropriate in this case.

ADD REPLYlink modified 5 months ago • written 5 months ago by genomax75k

Thank you for your reply!

I have considered getting representatives for broad classes. My plan was to use CD-HIT to remove redundancy once I had downloaded all of the bacterial and fungal complete genomes, do you think this is the best method?

I have used Kraken2 but I want to access the bam files so that I can then use this for statistics and I am unsure how to do this from the kraken output?

Thanks!

ADD REPLYlink written 5 months ago by c.e.chong20
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1817 users visited in the last hour