To determine the number of correct splice junctions from the alignment output from HISAT2
0
0
Entering edit mode
4.0 years ago
Chyguc • 0

Hi,

I am new to Bioinformatics and programming but can follow a clear program in order to arrive at my result.I have a deadline task in alignment of reads.

I played around with Rsubread, HISAT2 and STAR to align 15 million paired simulated data and I got results in BAM file for STAR and Rsubread but SAM format for HISAT2. I will like to know the if the splice junction files have alignment that is the same as in the annotated file and I used hg38 build 2. In other words, I want to know the true or correct splice junctions. As I said earlier, I am new to this. I already have the Rsubread bed file, HISAT2 text file and the SJ.out.tab file from STAR.

What I want to find out is the number of correct splice junctions in each result? How can I go about this? Does anyone have an idea of script I can use or a software that can detect this value or reference to any material that could be useful?

Another question, is that I want to know the splice junctions that are common to the aligners AND the ones that are unique to each. Maybe a script that can produce this result in Venn diagram.

Thanks for your response.

alignment sequencing rna-seq assembly next-gen • 727 views
ADD COMMENT

Login before adding your answer.

Traffic: 2307 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6