How do I download all genomes with specific ST type from ncbi
1
2
Entering edit mode
18 months ago
blur ▴ 280

Hi,

I want to download all the genomes available in NCBI that are a certain ST (sequence type) for Klebsiella. Does anyone know how to do that? I am using NCBI Sequence Read Archive (https://www.ncbi.nlm.nih.gov/sra) I wrote the ST type I wanted in the search engine, but I don't know how to verify that the sequences that I got are what I was aiming at.

I used: (klebsiella ST258) AND "Klebsiella pneumoniae"[orgn:__txid573]

Thank you in advance,

ncbi • 635 views
ADD COMMENT
1
Entering edit mode

I'm not sure that ST is a field that NCBI specifically tracks unless it happens to be in the name of the deposit, but I could be wrong.

I think you might be better off going via an ST database like PubMLST or BIGSDB to at least obtain a list of accessions and then feed this to NCBI.

ADD REPLY
1
Entering edit mode
18 months ago
vkkodali_ncbi ★ 3.7k

You can use NCBI Datasets for this. Specifically, go to the Genomes page and search for "Klebsiella pneumoniae". Out of the >43K genomes, you can filter ST258 ones by clicking on the "Filters" tab and searching for "ST258" in the "Search Within Results" to get 80 genomes. This is a permissive search and can be noisy. Separately, I see that there are 3 genomes for the organism "Klebsiella pneumoniae subsp. pneumoniae ST258". May be that's what you want?

ADD COMMENT
0
Entering edit mode

Thank you, I want as many of the deposited isolates as possible (the 80).

ADD REPLY

Login before adding your answer.

Traffic: 1502 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6