How To Extract Introns Positions And Domains From Entrez Gene Xml?
1
2
Entering edit mode
9.7 years ago
Dror ▴ 280

Entrez gene XML is very complicated, is there a way to extract all the genomic information (exon-introns positions) and the assigned protein domains positions in an automatic way from the XML ?

In other words: where in the Entrez GENE XML I could find the introns annotation and can I found the domains annotations?

If anyone has a python or perl scripts that can do that please share.

entrez gene intron xml parsing • 2.2k views
ADD COMMENT
2
Entering edit mode
9.7 years ago
Joachim ★ 2.9k

Hi!

There is a paper out about a Perl implementation that can efficiently process Entrez Gene XML: http://bioinformatics.oxfordjournals.org/content/21/14/3189.full

Their software is still available at http://sourceforge.net/projects/egparser/, but it has not been updated for a couple of years now.

Joachim

ADD COMMENT

Login before adding your answer.

Traffic: 1334 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6