How to download via biopython (proteom or genome) individual or random organisms?
1
0
Entering edit mode
3.0 years ago
ja4123 ▴ 20

Hey! Question as in the title. Can someone give some advice or suggest some biopython functions? Kindly help.

genom proteom • 1.4k views
ADD COMMENT
1
Entering edit mode
3.0 years ago
GenoMax 141k

Why does it have to be biopython? When you have specific requirements like that you should make a note of why that is so.

Otherwise see the solutions in: How to download all Pseudomonas aeruginosa Genomes from NCBI Genomes database?

ADD COMMENT
0
Entering edit mode

Thanks for the answer! For example I need 100 randomly selected Bacteria from UniProt, it would be hard to do it by hand.

ADD REPLY
1
Entering edit mode

Proteomes listed on UniProt site for bacteria appear to be in a random order. You could simply take the first 100. See "non-redundant" and "non-excluded" filters at top of page to further refine the selections.

ADD REPLY
0
Entering edit mode

It is possible to do the same using PDB?

ADD REPLY
0
Entering edit mode

PDB has individual structures of proteins. Are you referring to random individual proteins?

ADD REPLY
0
Entering edit mode

Less by PDB. Going back to the example (100 randomly selected bacteria) in the link you put in there is a download option (you can select 100) but there is no sequence information in what you can download.

ADD REPLY
1
Entering edit mode

You will need to click through on an entry until you get to page (example LINK) where you can select all proteins and download them as fasta file.

Better option: You should be able to download the proteomes from UniProt's FTP site. Here is the page for bacteria. A random example of protein fasta.

ADD REPLY

Login before adding your answer.

Traffic: 1972 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6