Question: where can I get environmental bacteria genome in fasta format (as many as possible)?
gravatar for janghj12000
4.4 years ago by
janghj120000 wrote:


I am trying to obtain environmental bacteria genome data in fasta format. (as many as possible).

That fasta file will be used as input file for biobloomtools' biobloommaker.

Can you tell me where to get it?

I already visited jgi ( and try to use genome portal and img; I have no clue how I can get all fasta files at once..

Thank you.

soil bacteria fasta • 2.1k views
ADD COMMENTlink modified 4.4 years ago by natasha.sernova3.7k • written 4.4 years ago by janghj120000
gravatar for natasha.sernova
4.4 years ago by
natasha.sernova3.7k wrote:

Dear janghj,

You can try NCBI new location. Old NCBI-site has been changed. Now it is situated here: Be careful - some of the genomes here are not finished.

Inside the file mentioned below you will find a particular url for any bacterium you are looking for: Inside each reference you will find fna.gz - genomic gz-files and some other useful files.

For example, you will find the page below for Acidithiobacillales bacterium SM1_46

Old NCBI site has been moved here:

See README files for more information. A lot has been changed there. Good luck!

ADD COMMENTlink modified 4.4 years ago • written 4.4 years ago by natasha.sernova3.7k

Thank you for you respond.

I visited the NCBI new location as you provided.

It seems like it show me all kinds of bacteria. I am only looking to environmental ones...

Is there any way I can grep environmental ones?

Or should I check and download files one by one?

Thank you

ADD REPLYlink written 4.4 years ago by janghj120000

Hello! The tags you used initially imply you are looking for only soil bacteria. Is your question implied how to search NCBI just for soil bacteria? I would go to NCBI general site, and type your question: You will receive several thousand papers. Probably you will have to specify what particular species you are looking for. “Environmental bacteria” – it’s too general question in my opinion. The question has to be as narrow and special as possible. I am not sure the sequence authors mentioned where and only where a particlar bacteria may be found. Probably, you have to make your environment definition as specific as possible. And I am not sure NCBI has a special tag for this kind of search. Actually you can easily check it. Go inside the known "environmental bacteria" genome file and see what information you can find there. I didn't find a general method. I would try to do it one after another just to start. Good luck! Natasha

ADD REPLYlink modified 4.4 years ago • written 4.4 years ago by natasha.sernova3.7k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 966 users visited in the last hour