Question

Annotaion Based On The Genomic Range

0

Entering edit mode

11.5 years ago

ancient_learner ▴ 680

Hello all I have some data like this related to mouse genome

chr1 3000000 3000090
chr2 4339993 4389898
chr5 3000330 3003339
chr7 3323233 3390393

I know that by using UCSC genome browser we can get the information related to the presence of genes, proteins at those regions. however i am more interested in identifying all functional elements (may be promoters, enhancers tfs etc) with in that region. is there any way to do that. With in UCSC is there any option like that?

genome annotation r ucsc • 5.6k views

ADD COMMENT • link updated 11.2 years ago by Emily 23k • written 11.5 years ago by ancient_learner ▴ 680

Ram · Answer 1 · 2012-11-09

3

Entering edit mode

11.5 years ago

Alex Reynolds 35k

ADD COMMENT • link updated 4.4 years ago by Ram 43k • written 11.5 years ago by Alex Reynolds 35k

0

Entering edit mode

@Alex Can you please help me with this question How to assigning Active Enhancer to every genes within 10 kb region?. Thanks

ADD REPLY • link 7.2 years ago by Bioinformatist Newbie ▴ 270

score 2 · Answer 2 · 2012-11-09

You have to define promoters and enhancers by yourself, there is no proper definition. Get a list of all genes, refer this Fetching Transcription Start And End For A Custom Gene List From Ucsc (Hg18/Ncbi36) for that, change the organism and build. If you know R or any other language, add and subtract the number of bases or a region of some KB (eg +/-1KB) from the TSS (labelled as txStart in the table) strand specifically. This number depends on how you define promters and then use the intersectBed tool from Bedtools. Check this How To Determine Overlaps From Coordinates or manual for usage.

For Enhancers, some people say they are 5-10KB far, but a way to do it would be overlay the ChIP-Seq data(peaks) of p300 (marker for enhancers) on the genome to get the list of enhancers and then intersect with you own file If you know Galaxy, then this might be helpful, From BED Coordinates to Genes

Cheers

score 2 · Answer 3 · 2012-11-09

If you want information on annotating genomic intervals in general see some similar Biostars-posts:

score 1 · Answer 4 · 2013-03-01

1

Entering edit mode

11.2 years ago

Emily 23k

No idea about UCSC, but you can do that using the Ensembl Region Report tool.http://www.ensembl.org/tools.html

This allows you to inout genomic coordinates, then see everything that's within them. There's a tick box list where you can choose what to see. The options are:

Genes, Transcripts and Proteins

Genomic Sequence

Constrained Elements (Conserved Regions)

Variations (SNPs and InDels)

Structural Variations (CNVs etc)

Regulatory Features

ADD COMMENT • link 11.2 years ago by Emily 23k

0

Entering edit mode

Thank you for the reply. i am more interested in Constrained Elements (Conserved Regions) feature. does this tool supports graphical view? I know ECR browser does but I cannot give each coordinate manually.

ADD REPLY • link 11.1 years ago by ancient_learner ▴ 680

0

Entering edit mode

This will just give you a list.

ADD REPLY • link 11.1 years ago by Emily 23k

score 0 · Answer 5 · 2012-11-09

If you are comfortable with little programming and unix or you can use snpEff software and set up databases for different genomic elements like genes, transcription factor binding sites, enhancers etc then it is pretty simple thing to do. You can get most of the files you need from ENSEMBL

http://useast.ensembl.org/info/data/ftp/index.html

The cis-regulatory elements information could be derived from Regulations gff file and Regulation data files.