Web resources for downloading intergenic regions from many plant species?
1
0
Entering edit mode
9.1 years ago
biolab ★ 1.4k

Hi, everyone,

I am starting to work on plant genomes. I would like to download intergenic region sequences from many plant species. Ensembl Biomart is a good site for downloading CDS sequences, however, it seems to be unable to provide intergenic sequences. Does anyone know how to download plant intergenic sequences? Any other web resources are also OK. I much appreciate your comments. THANKS a lot in advance.

intergenic plant • 2.3k views
ADD COMMENT
1
Entering edit mode
9.1 years ago
mark.ziemann ★ 1.9k

Download the genome GTF file from the Ensembl FTP site. It contains positional information for genes, exons, CDS, etc. In linux, use grep -w "gene" file.gtf to extract coordinates of genes, then bedtools complement to extract the coordinates of the intergenic regions and bedtools getfasta to retrieve the sequences.

ADD COMMENT
0
Entering edit mode

Hi mark.ziemann,

Thank you every much for your answer!

ADD REPLY

Login before adding your answer.

Traffic: 2701 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6