Finding closest genome from genbank
3
0
Entering edit mode
7 months ago
arshad1292 ▴ 70

Hello,

I have a few draft bacterial genomes that I would like to BLAST in entirety to find the closest genomes in the Genbank. Can anyone suggest a tool or a way to do that?

Many thanks,

bacterial genome blast genbank • 605 views
ADD COMMENT
0
Entering edit mode

If you know the taxid you can find the closest whole genome using gaas_ncbi_get_genome_tree.pl from GAAS

ADD REPLY
0
Entering edit mode

I don't know taxid. All I have is contig sequences.

ADD REPLY
0
Entering edit mode

There is no such tool that align a draft genome against all genomes in Genbank. First you must identify to which taxonomic lineage (taxid) your draft genomes belong. To figure out the taxid get the 16S from the annotated genome or use the Type Strain Genome Server to find the closest type-strain

ADD REPLY
1
Entering edit mode
7 months ago
Mensur Dlakic ★ 20k

You may want to look into tools that determine the similarity based on large-scale matching of k-mers, sketches, etc.

ADD COMMENT
1
Entering edit mode

Yes, KMCP provides some prebuilt databases for genome searching, no Genbank but you can use GTDB. Here's the tutorial.

Sourmash provides databases for GTDB too, and old Genbank databases (2018) are also available.

Both tools work great, the biggest challenge is downloading the whole Genbank database. Building the database using a sketching algorithm is also fast.

ADD REPLY
0
Entering edit mode
7 months ago

You can use BlobTools to identify the taxonomy of your sequences and further use different tools to find your closest relative.

ADD COMMENT
0
Entering edit mode
6 months ago
arshad1292 ▴ 70

I have found a best and easy way to download all the reference genomes available in genbank and include your genomes into it.

Use PhyloPhlAn 3.0 and detail is here:

https://github.com/biobakery/biobakery/wiki/PhyloPhlAn-3.0:-Example-02:-Tree-of-life
ADD COMMENT

Login before adding your answer.

Traffic: 1670 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6