Question: How can I filter my RNA-seq transcript data to display ONLY novel isoforms and variants?
0
gravatar for wolfgang.rumpf
5 weeks ago by
wolfgang.rumpf10 wrote:

I have some RNA-seq data from (human) cancer cells and am trying to find non-canonical transcripts (e.g. splice variants, fusion products, etc.). I have already created a SAM/BAM file of the transcripts and retrieved accession information (using MagicBLAST), but now what I'd like to do is filter out the "known" (canonical) transcripts and leave only the novel transcript variants. It seems like there should be an easy way to do this, but I'm at a loss as to where to find the a single collection of canonical transcripts to filter against....help?

ADD COMMENTlink modified 7 days ago by geek_y9.6k • written 5 weeks ago by wolfgang.rumpf10
0
gravatar for geek_y
7 days ago by
geek_y9.6k
Barcelona/CRG/London/Imperial
geek_y9.6k wrote:

Take all the Exon-exon junctions coordinates (i.e intron coordinates i.e retained introns) and check if they are in latest Gencode and FANTOM CAT GTF files. If not, pull out the transcripts that contain those junctions and create a new GTF file.

This will give you all novel transcripts that can have novel exon skipping/inclusion events and also alternate 3' and 5' splice sites.

ADD COMMENTlink modified 7 days ago • written 7 days ago by geek_y9.6k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 2300 users visited in the last hour