Question: Extracting Features That Appear In TSS Regions
0
gravatar for armonazizi
21 months ago by
armonazizi0
armonazizi0 wrote:

Hi,

I'm working with ATAC-Seq data and I need to extract the features in my bed files that are located in transcription start site regions of the mm10 genome. Can anyone recommend a way to do this?

I was thinking of generating a bed file of only tss regions from the mm10 genome and finding the intersect between the TSS file and the sample file. However, I'm not sure how to generate a bed file that only contains TSS sites.

Any help would be appreciated.

Thanks

atac-seq chip-seq • 923 views
ADD COMMENTlink modified 21 months ago by christacaggiano20 • written 21 months ago by armonazizi0
1
gravatar for novice
21 months ago by
novice890
United States
novice890 wrote:

Hi armonazizi,

From my experience, TSS are not explicitly annotated. In fact, they are usually impossible to identify, but I don't know about mm10. Take a look at the annotation file, and decide what feature you want to select. Let's say the features you want have the 3rd column saying 'transcript' (in .gff format). You can extract them into a sorted BED file like so: $ cat mm10.gff | grep -v '^#' | awk '$3=="transcript"' | cut -f1,4,5 | sort -k1,1 -k2,2n -k3,3n > transcripts.bed

From there, I would recommend using BEDtools intersect for your purpose.

ADD COMMENTlink written 21 months ago by novice890

Thanks, I'll give this a try.

ADD REPLYlink written 21 months ago by armonazizi0
1
gravatar for christacaggiano
21 months ago by
UCSF
christacaggiano20 wrote:

I typically used homer annotatePeaks.pl to find my TSS when I was working with Atac-seq data

http://homer.ucsd.edu/homer/ngs/annotation.html

this gives you a good estimate of the distance of the peak to a TSS and the output is an excel table that can be easily made into a bedfile.

Hope this help! ATAC-seq was a tricky point in my bioinformatics career!

ADD COMMENTlink written 21 months ago by christacaggiano20
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1879 users visited in the last hour