I have a RNA-seq data and want to get the proportion of reads that map to a window of 100pb
between 5'UTR
and CDS
(50 bp
at 5'UTR
and 50 bp
at the CDS). since I am using Tophat
to align and bowtie
to build indexes, I guess I have to get the sequence of that part to build indexes. do you guys know how to find the sequence of that part?
You have BAM files and a GTF or something like that already, yes?
yes I have bam file. I can also download GTF FILE.