Grasp the 200bp gene sequence before all coding sequence in a bacteria genome
2
0
Entering edit mode
6.7 years ago
gaoyanwang • 0

Hi all,

I am a rookie in computational biology and am learning on using python to do some analysis. Could any one give me some advise in the following questions? Thanks a lot!!

In bacteria genome, the 200-300bp before coding sequence (CDS) are usually the regulatory region. There are about 5000 annotated CDS in a certain bacteria strain and I would like to output all the 5000 regulatory regions for the CDS. Is there a package or developed scripts to do so?

Thanks a lot!

Best,

Grace

genome • 1.1k views
ADD COMMENT
0
Entering edit mode
6.7 years ago
 bedtools flank
 bedtools getfasta

Those are the basic steps, assuming you don't have splicing (if you do then you'll need to do some filtering).

ADD COMMENT
0
Entering edit mode

Thank you very much Devon. I would not do any splicing. Would there be a tool to convert my .gb file to bed format? Thanks!

ADD REPLY
0
Entering edit mode
6.6 years ago
gaoyanwang • 0

Thank you very much Devon. I would not do any splicing. Would there be a tool to convert my .gb file to bed format? Thanks!

ADD COMMENT

Login before adding your answer.

Traffic: 2635 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6