Question

Approach To Get All Ligands Structure From Pdb That H-Bond To Gln Without Downloading Coordinates One By One And Analyzing

4

Entering edit mode

12.9 years ago

Harijay ▴ 60

I have a csv list of all ligands in the pdb and their associated ids.This list has 23000 entries and possibly 10,000 unique pdb ids

I need to compile a list of all ligands from this list which are within h-bonding distance of Gln.

My current approach is to download-pdb coordinate then query and report using pymol and python scripting , since I know the pymol API and python.

Instead , can I simply get the list of interactions already compiled for each pdb id using some HTTP/REST/Web service interface and then get at the presence absence of Gln in that list?

I am looking for suggestions on how to implement this without downloading and querying.

python structural pdb • 7.7k views

ADD COMMENT • link updated 10.8 years ago by Biostar 20 • written 12.9 years ago by Harijay ▴ 60

0

Entering edit mode

Can you please add few PDB IDs to the question ?

ADD REPLY • link 12.9 years ago by Khader Shameer 18k

Ram · Answer 1 · 2011-05-25

You can download the software from http://www.ebi.ac.uk/pdbe-site/pdbemotif/

under the link Download the software and source code there. It should generate a database of all interactions <16A and possibly divide into interaction types like H-bonds, van der Waals (I am not sure exactly, you must try yourself).

The only problem that according to README generation of the database will take a lot of time and disk space!

score 3 · Answer 2 · 2011-05-28

3

Entering edit mode

12.9 years ago

Joerg Kurt Wegner ▴ 230

The easiest might be using the CREDO database, which comes with a CC-SA license, a MySQL dump, and a well developed Python interface.

ADD COMMENT • link 12.9 years ago by Joerg Kurt Wegner ▴ 230

Ram · Answer 3 · 2011-05-24

2

Entering edit mode

12.9 years ago

Michael Schubert ★ 7.1k

I'm afraid the tool you're looking for does not exist.

As you already know Python, there's a relatively easy way to do it using NeighbourSearch from the PDB module of BioPython on the PDB files directly (i.e, not loading them into PyMOL).

You're welcome to refine your question if you get stuck somewhere along the way.

ADD COMMENT • link updated 4.6 years ago by Ram 43k • written 12.9 years ago by Michael Schubert ★ 7.1k

0

Entering edit mode

I always wanted to familiarize myself with the biopython modules. But its just that I am so familiar with the pymol API- i.e python scripting of "headless" pymol that I will probably use it for this project

ADD REPLY • link 12.9 years ago by Harijay ▴ 60

0

Entering edit mode

If you're working with pdb structures a lot the module will definitely benefit you, if for no other than the reason that it runs at (almost) C speed without the PyMOL-specific overhead.

ADD REPLY • link 12.9 years ago by Michael Schubert ★ 7.1k

0

Entering edit mode

Does the module handle alternate locations for a residue?. I may end up going this route since I am having problems running "headless" pymol

ADD REPLY • link 12.9 years ago by Harijay ▴ 60

0

Entering edit mode

Thanks for this Pointer . Currently reading : PDB file parser and structure class implemented in Python T Hamelryck (2003). PMID :14630660 The Biopython PDB library seems to fit the bill. Plus having problems running headless pymol

ADD REPLY • link 12.9 years ago by Harijay ▴ 60

Ram · Answer 4 · 2011-05-25

2

Entering edit mode

12.9 years ago

Khader Shameer 18k

AFAIK, this data is not readily available, but I know this data is hidden somewhere in PDBSum.

For example See PDB ID 1ASH in PDBSum, here you can see the plot, this plot includes the information about the residues that interacts with the ligands (red squares indicates residues which interacts with the ligand). This data must be available from PDBSum. You may request Dr. Roman Laskowski for the raw data. In 2008, I requested similar type of data from PDBSum and they provided scriptable (via Mechanize) access to the data that generate similar type of figures.

alt text

They also have LIGPLOTs of the PDB Structures, for example:

alt text

To produce LIGPLOTs they should have run HBPLUS, you can use those HBPLUS output to get the information you are looking.

PS. NON-REST/WS option: Download PDBs, install HBPLUS / hbond(part of JOY package). Run the program, parse file using the chain IDs. Filter for Glycines.

ADD COMMENT • link 12.9 years ago by Khader Shameer 18k

1

Entering edit mode

Hi Khader , Thanks to your suggestion I wrote to Dr Laskowski . In his very detailed reply he told me how I could accomplish this. Basically the grow.out info is what I need

"The address of each grow.out file is:

http://www.ebi.ac.uk/thornton-srv/databases/pdbsum/XXXX/grow.out

where XXXX is the PDB code."

ADD REPLY • link updated 4.6 years ago by Ram 43k • written 12.9 years ago by Harijay ▴ 60

1

Entering edit mode

Hari Glad that it worked !!

ADD REPLY • link 12.9 years ago by Khader Shameer 18k

0

Entering edit mode

I was really hoping that the raw data that went into ligplot was queryable. Regardless thanks for the pointer to HBPLUS.

ADD REPLY • link 12.9 years ago by Harijay ▴ 60

0

Entering edit mode

I cannot use HBPLUS because of licence restrictions

ADD REPLY • link 12.9 years ago by Harijay ▴ 60

0

Entering edit mode

How about JOY See: http://www.ncbi.nlm.nih.gov/pubmed/9730927 http://tardis.nibio.go.jp/joy/download.htm ?

ADD REPLY • link 12.9 years ago by Khader Shameer 18k

score 1 · Answer 5 · 2011-05-24

1

Entering edit mode

12.9 years ago

Woa ★ 2.9k

You may check the (commercial) RELIBASE databse. Knowledge based scoring functions typically use the same kind of information you've mentioned. Check if ISOSTAR/SUPERSTAR can serve your purpose. =

ADD COMMENT • link 12.9 years ago by Woa ★ 2.9k