how to combine junction.bed files with different rows into one table
0
0
Entering edit mode
9.0 years ago
shirley0818 ▴ 110

Hi All,

I am working on ~100 samples for detecting alternative splicing. TopHat generates a junction.bed file for each sample. However, each of these bed files has different number of rows, and the coordinates of each junction is not same across samples. I think this junction.bed file includes known and novel junctions.

Since I am only interested in known junctions in Ensembl annotation database, how can I map these 100 junction.bed files to Ensembl gtf file and obtain a table matrix with the row as known-junction and column as sampleID?

Or do I need to create a exon-exon junction annotation bed file from Ensembl, then apply RSeQC to obtain reads for each junction against mapped .bam files?

Many thanks,

Shirley

RNA-Seq • 2.4k views
ADD COMMENT

Login before adding your answer.

Traffic: 2080 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6