Question: Cross-reference with PDB database
gravatar for adirsommer
22 months ago by
adirsommer10 wrote:


I have a list of several thousand proteins and their UNIPROT IDs. I'm looking for an efficient method of cross-referencing it against the PDB tertiary structure database, and get a list of those proteins with a tertiary structure in the PDB database.

I've tried to BLASTP the list of UNIPROT IDs against the PDB database, using the NCBI BLAST portal but encountered too many errors of "Error: Failed to read the Blast query: Sequence ID not found", making the process of manual filtering not convenient and not efficient.

Any ideas?

Thank you!

sequencing uniprot protein pdb • 778 views
ADD COMMENTlink modified 22 months ago by Elisabeth Gasteiger1.7k • written 22 months ago by adirsommer10

Use UniProt ID converter to map them to PDB ID's. That can give you an idea of how many are present in PDB. From there you can start looking for things with a tertiary structure.

ADD REPLYlink modified 22 months ago • written 22 months ago by genomax74k

there are several similar posts in Biostars,

see this one below and the right panel:

Protein PDB ID

ADD REPLYlink written 22 months ago by natasha.sernova3.6k
gravatar for Elisabeth Gasteiger
22 months ago by
Elisabeth Gasteiger1.7k wrote:

Here is the list of UniProtKB entries cross-referenced to PDB,entry%20name,reviewed,database(PDB)

For reviewed entries only, you can also have a look at this precomputed file:

The reason why some of your NCBI BLAST queries by UniProtKB identifiers fail is that NCBI_nr does not include UniProtKB/TrEMBL. I presume that it is failing on these identifiers?

ADD COMMENTlink written 22 months ago by Elisabeth Gasteiger1.7k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1727 users visited in the last hour