Question: Parsing Non-Coding Region Around Protein Of Interest From Embl File
gravatar for Pappu
6.2 years ago by
Pappu1.9k wrote:

I want to parse noncoding DNA sequences around a protein (P08707) from an embl file:

I could grep '^ CDS' and then figure out the non coding regions and compare to the location of the target protein in python. I am wondering if there are any smarter way of doing it. Thanks.

python • 1.5k views
ADD COMMENTlink modified 6.2 years ago • written 6.2 years ago by Pappu1.9k

If you are just pulling one sequence why not just copy and paste from the link? Just look for 22433..23011 in the genomic sequence. Or you could pull the fasta and use a subsequence program:

ADD REPLYlink modified 6.2 years ago • written 6.2 years ago by Zev.Kronenberg11k
