Question: Cross-reference with PDB database
15 months ago by
adirsommer10 wrote:


I have a list of several thousand proteins and their UNIPROT IDs. I'm looking for an efficient method of cross-referencing it against the PDB tertiary structure database, and get a list of those proteins with a tertiary structure in the PDB database.

I've tried to BLASTP the list of UNIPROT IDs against the PDB database, using the NCBI BLAST portal but encountered too many errors of "Error: Failed to read the Blast query: Sequence ID not found", making the process of manual filtering not convenient and not efficient.

Any ideas?

Thank you!

sequencing uniprot protein pdb • 545 views
15 months ago by adirsommer10

Use UniProt ID converter to map them to PDB ID's. That can give you an idea of how many are present in PDB. From there you can start looking for things with a tertiary structure.

written 14 months ago by genomax64k

there are several similar posts in Biostars,

see this one below and the right panel:

Protein PDB ID

written 14 months ago by natasha.sernova3.4k
14 months ago by
Elisabeth Gasteiger1.6k wrote:

Here is the list of UniProtKB entries cross-referenced to PDB,entry%20name,reviewed,database(PDB)

For reviewed entries only, you can also have a look at this precomputed file:

The reason why some of your NCBI BLAST queries by UniProtKB identifiers fail is that NCBI_nr does not include UniProtKB/TrEMBL. I presume that it is failing on these identifiers?

written 14 months ago by Elisabeth Gasteiger1.6k
