Which atoms are not found in protein PDB files?
1
0
Entering edit mode
22 months ago
user366312 ▴ 20

I am developing an educational Bioinformatics framework, I need to know

  • Which atoms are absolutely not found in any PDB files?

in the following list -

pdb protein • 391 views
ADD COMMENT
2
Entering edit mode
22 months ago
Wayne ★ 2.0k

So far from my limited searching I haven't found a list that already compiles this information However, the data where you could mine the elements that are found in PDB files is available and updated weekly.

Proteopedia's 'Ligand' page is a good jumping off point for what I found. It has a lot of links about the non-standard resides and heteroatoms found in PDB file entries. Importantly, it links to the Chemical Component Dictionary that is updated weekly.

"This dictionary contains detailed chemical descriptions for standard and modified amino acids/nucleotides, small molecule ligands, and solvent molecules"

The full dictionary is available there and it each entry has a line _chem_comp.formula that could be parsed to collect all the represented elements. From there you could use Python's sets mat to find those elements not represented. Of course, you could do similar with your favorite computational language.

The dictionary and history are further described the Chemical Component Dictionary page of the Worldwide Protein Data Bank. There's even an associated publication by Westbrook et al., 2014.

ADD COMMENT

Login before adding your answer.

Traffic: 2712 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6