How to filter nanopore transcriptome alignments to trust 3' ends?
0
0
Entering edit mode
3.9 years ago

I have direct RNA data mapped to the gencode transcriptome with minimap2. Finding the 'true' transcript of origin for a read is nontrivial as there are many secondary alignments with very close alignment scores to the primary. After visualising I can see some alignments are to transcripts which start further 3 prime than my alignment. However, due to the mechanism of direct RNA sequencing, the three prime ends of reads are the true end site.

I want to discard alignments to transcripts that have a 3' start site over 100nt prior to my read start site.

I've thought about simply extracting TES from the gencode gtf but these are genomic coordinates and I need to use the transcriptome mapping. Another way I've been thinking is if the query end site is over 100nt of my read end site, to discard the alignment. But I am not sure how to do this, any ideas? Thanks.

nanopore direct RNA minimap2 • 1.2k views
ADD COMMENT
0
Entering edit mode

Did you end up solving this issue? I am facing it now... direct RNA sequencing is tough!

ADD REPLY
0
Entering edit mode

the problem looks simple but I would need a example bam with a few reads to test.

ADD REPLY
0
Entering edit mode

are u the dude from jvarkit? cuz if so, nice work bro

ADD REPLY

Login before adding your answer.

Traffic: 2359 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6