Entering edit mode
5.3 years ago
Evergreen
▴
10
Hello,
I would like to extrac intronic sequences from Genbank files.
I need the sequences of the introns in a FASTA format. So far I have a script which extracts the CDS sequences from a Genbank file. I think the complement of the CDS sequences (introns) should be given somehow to complete the task. Could somebody take a look at it?
My script:
from Bio import SeqIO
from Bio import GenBank
for rec in SeqIO.parse("nidulans.gb", "genbank"):
if rec.features:
for feature in rec.features:
if feature.type == "CDS":
print (feature.location)
print (feature.location.extract(rec).seq)
Please help me!
Thank you in advance!