small representative database
1
0
17 months ago
lagartija ▴ 90

I wonder if it exists somwhere a representative database that if smaller than refseq. I would need genomes from different phylums of Bacteria, Archaea, Viruses and Eukaryotes and that I can download directly on my computer. This would probably result very useful for anyone who wants to run some pre-tests before using refseq (too big) or metagenomic datasets (uncertain taxonomy)

taxonomy data genomes proteomes • 277 views
If you need representative metagenomic databases then take a look at those provide by kaiju (see box on left of page).

1
17 months ago
Mensur Dlakic ★ 12k

UniProt has all kinds of reference databases, including those clustered at 50% and 90% identity. Files related to taxonomic groups are here.

This is great thank you ! I think though that Uniprot is biaised toward model organisms but for pre-tests it is more than enough.