Question

Is there any easy way to get ASA/RSA for the whole PDB?

0

Entering edit mode

9.2 years ago

ajingnk ▴ 130

I want to get the accessible surface area for each residue of a protein structure. I need to do that for biological assembly and remove ligands. I hope this could cover all proteins in UniProt. However, I don't have an efficient way to remove ligands. I have data from SIFTS, which maps PDB residue to UniProt residue. What I am thinking is:

I calculate ASA/RSA for all PDBs(Biological Assembly)
I take the maximum ASA for all corresponding residues, and assign the maximum ASA to the residue on UniProt

Is this a right way to do it? Because, here I assume, I will always find a PDB without any ligand for each protein in UniProt.

I am thinking to use DSSP or NACCESS. I hope these two tools can deal with water and other molecule correctly.

Thanks

pdb asa protein • 3.3k views

ADD COMMENT • link updated 24 months ago by Jiyao Wang ▴ 370 • written 9.2 years ago by ajingnk ▴ 130

score 0 · Answer 1 · 2015-02-21

0

Entering edit mode

9.2 years ago

vavrusa ▴ 80

I'm not sure if I understand the question right, but the ligand residues are stored as HETATM, just grep them out.

$ grep -vi HETATM 1avx.pdb > 1avx_nohet.pdb

Note that this also removes non-standard residues, substrate, solvents etc. which you might not want in the PDB anyway for the SA calculation.

ADD COMMENT • link 9.2 years ago by vavrusa ▴ 80

0

Entering edit mode

The problem is that HETATM can be part of protein. Because, modified residues will also be assigned as HETATM. ION is HETATM which may be part of protein also. And, atoms which are not HETATM may not be part of the protein, like binding peptides.

ADD REPLY • link 9.2 years ago by ajingnk ▴ 130

0

Entering edit mode

Makes sense. If you don't know how the composition of the ligands or have PDBs with modified residues or the ligands are part of protein chains, then I'm not sure if I can think of an easy way. What you propose, reconstructing the protein ASA sans the ligand, may work. You can do it the other way round, filter out the ligands based on the ligand database search, but I suppose there's a risk of filtering legitimate parts of the protein. Neither is "easy" computation-wise.

ADD REPLY • link 9.2 years ago by vavrusa ▴ 80

score 0 · Answer 2 · 2022-04-28

0

Entering edit mode

24 months ago

Jiyao Wang ▴ 370

You can use Node.js scripts based on iCn3D to get ASA for a list of structures. The example is at https://github.com/ncbi/icn3d/blob/master/icn3dnode/surfacearea.js. The Instruction is at https://github.com/ncbi/icn3d/blob/master/icn3dnode. You can open issues in GitHub if you run into problems.

ADD COMMENT • link 24 months ago by Jiyao Wang ▴ 370