Question: Parsing Non-Coding Region Around Protein Of Interest From Embl File
0
gravatar for Pappu
6.7 years ago by
Pappu1.9k
Pappu1.9k wrote:

I want to parse noncoding DNA sequences around a protein (P08707) from an embl file: http://www.ebi.ac.uk/ena/data/view/U32222&display=txt&expanded=true

I could grep '^ CDS' and then figure out the non coding regions and compare to the location of the target protein in python. I am wondering if there are any smarter way of doing it. Thanks.

python • 1.6k views
ADD COMMENTlink modified 6.7 years ago • written 6.7 years ago by Pappu1.9k

If you are just pulling one sequence why not just copy and paste from the link? Just look for 22433..23011 in the genomic sequence. Or you could pull the fasta and use a subsequence program: http://code.google.com/p/biopieces/wiki/extract_seq

ADD REPLYlink modified 6.7 years ago • written 6.7 years ago by Zev.Kronenberg11k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1179 users visited in the last hour