Question: Web resources for downloading intergenic regions from many plant species?
gravatar for biolab
4.9 years ago by
biolab1.2k wrote:

Hi, everyone,

I am starting to work on plant genomes.  I would like to download intergenic region sequences from many plant species.  Ensembl Biomart is a good site for downloading CDS sequences, however, it seems to be unable to provide intergenic sequences.  Does anyone know how to download plant intergenic sequences?  Any other web resources are also OK.   I much appreciate your comments.  THANKS a lot in advance.

intergenic plant • 1.4k views
ADD COMMENTlink modified 4.9 years ago by mark.ziemann1.2k • written 4.9 years ago by biolab1.2k
gravatar for mark.ziemann
4.9 years ago by
mark.ziemann1.2k wrote:

Download the genome GTF file from the Ensembl FTP site. It contains positional information for genes, exons, CDS, etc. In linux, use grep -w "gene" file.gtf to extract coordinates of genes, then bedtools complement to extract the coordinates of the intergenic regions and bedtools getfasta to retrieve the sequences.

ADD COMMENTlink written 4.9 years ago by mark.ziemann1.2k

Hi mark.ziemann,

Thank you every much for your answer!

ADD REPLYlink written 4.9 years ago by biolab1.2k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1676 users visited in the last hour