Question: Common target genes
gravatar for Nicolas Rosewick
5.7 years ago by
Belgium, Brussels
Nicolas Rosewick8.8k wrote:


I've a list of ~50 genes and I want to know their common downstream or upstream target genes ( within all known pathways ). My idea was to download for every gene, the list of genes that interact with it, and intersect the results. But I don't know where to start ( KEGG, REACTOME,...) I know it's possible to do it with IPA but I'm looking for a free solution.


target genes pathway • 2.0k views
ADD COMMENTlink modified 5.7 years ago by B. Arman Aksoy1.2k • written 5.7 years ago by Nicolas Rosewick8.8k
gravatar for B. Arman Aksoy
5.7 years ago by
B. Arman Aksoy1.2k
New York, NY
B. Arman Aksoy1.2k wrote:

You can accomplish this via the Pathway Commons 2 web service. The query type of interest to you is the commonstream in either upstream or downstream directions. For example, the following gives you the common downstream of EGFR and ERBB2:,ERBB2&kind=commonstream&direction=downstream&format=BIOPAX

or if you would like to get this data in a simple interaction format, you can simply use this query instead:,ERBB2&kind=commonstream&direction=downstream&format=BINARY_SIF

or one with more identifiers to the nodes:,ERBB2&kind=commonstream&direction=downstream&format=EXTENDED_BINARY_SIF

You can either script this via your programming language of choice, or Paxtools library (Java) or PaxtoolsR. If you rather work with a graphical user interface, then I suggest you either use ChiBE 2 or CyPath2 -- both of them will allow you to run these queries easily.

ADD COMMENTlink modified 5.7 years ago • written 5.7 years ago by B. Arman Aksoy1.2k

thank you that's great. Juste a quick question about pathway commons 2. How can I specify to have the gene symbol in place of uniprot id url ( for example : gives me HBEGF ).


ADD REPLYlink written 5.7 years ago by Nicolas Rosewick8.8k

Right now it is not possible to request another identifier through the web service, but the last example which produces EXTENDED_BINARY_SIF, will help you get this. In that file, in addition to the standard identifiers, you also have access to other identifiers associated with the node (like HGNC, Entrez Gene ID and etc.) It requires a little bit more parsing, but has much more information compared to simple BINARY_SIF.

ADD REPLYlink written 5.7 years ago by B. Arman Aksoy1.2k
gravatar for Larry_Parnell
5.7 years ago by
Boston, MA USA
Larry_Parnell16k wrote:

KEGG and REACTOME are indeed good places to begin to identify the genes encoding proteins that reside in pathways just upstream or downstream of your set of 50 genes, with regard to information flow. The word "target" could imply transcription factor (TF) binding, or your gene encodes a TF. If this is what your genes and their analysis entails, you'd need a different approach, but I'm not certain this is your intention.


ADD COMMENTlink written 5.7 years ago by Larry_Parnell16k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1093 users visited in the last hour