Question: Novel TSS in Cufflinks - false positives adjacent to reference
0
gravatar for Chris Gene
24 months ago by
Chris Gene60
United Kingdom
Chris Gene60 wrote:

For an RNA-Seq data set, I performed transcription start site usage analysis using cufflinks. The workflow was as follows: multiple replicates of each genotype (2) sequenced on an Illumina platform; the reads were aligned using STAR; performed assembly using cufflinks; assembled gtf files were merged using cuffmerge (with reference annotation included). Isoform quantification was performed using cuffquant.

I wanted to analyse differential TSS usage. However when I actually look at the differentially expressed TSS, I see that this includes several "novel" TSS, but many of these have a start which differs only 1 nucleotide from the reference. Is this just a mmapping issue, and what can I do to systematcally indentify and remove these that seem to be false positive novel TSS?

Thanks

tss rna-seq alignment cufflinks • 850 views
ADD COMMENTlink modified 24 months ago by Devon Ryan73k • written 24 months ago by Chris Gene60
0
gravatar for Devon Ryan
24 months ago by
Devon Ryan73k
Freiburg, Germany
Devon Ryan73k wrote:

You'll just have to remove any TSS within some small distance of an annotated TSS of the same gene. This isn't a mapping issue, it's an issue with how cufflinks works. Perhaps you'll get better results with stringTie, but I wouldn't hold my breath that the situation is any better there.

ADD COMMENTlink written 24 months ago by Devon Ryan73k

Thanks Devon. Can you think of any automated way to do that? For the novel TSS groups that are adjacent to the gene start I could use a cut-off; but for other TSS more downstream that are also next to annotated TSS, I'm not sure how that would be feasible, even with the annotated files.

I'm also wondering with what confidence one can trust the differential expression analysis then - if reads are redistributed to these TSS that are not actually novel, then the value of the fpkm assigned to this TSS and the adjacent canonical TSS may be miscalculated. Would you agree with that rationalle?

ADD REPLYlink written 24 months ago by Chris Gene60

bedtools closest could be used to filter the novel TSSs. Regarding how reliable the results are for differential TSS usage I can't really say. To be honest, I've never personally used the differential TSS testing.

ADD REPLYlink written 24 months ago by Devon Ryan73k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1454 users visited in the last hour