Question

Different mappings, different PCAs using TEtranscripts

0

Entering edit mode

5.0 years ago

compuTE ▴ 140

Hello,

I made two PCA plots from two types of mappings towards the same genome. In one of them, I allow the reads to just map to one loci, and in the other one I let them map 100 times. After that I do transcript assembly with StringTie in the uniquely mapped reads, and i use the software TEtranscripts for the multi mapped reads (this plot is made just with the gene read counts).

The PCA plots look pretty different: The "unique" mapping separates the two conditions pretty well, and the other one not (not at all). I was wondering if I can still trust the results from the second mapping, or what does this really mean?

Thanks!!

Uniquely mapped:

Multi mapped:

pca TEtranscripts • 1.5k views

ADD COMMENT • link 5.0 years ago by compuTE ▴ 140

0

Entering edit mode

Based on which features is the PCA plot, and on how many of them, or on all genes?

ADD REPLY • link 5.0 years ago by ATpoint 81k

0

Entering edit mode

The unique mapping is made with transcripts (79,000), TEtranscripts outputs gene read counts so it is less (24,333 genes).

ADD REPLY • link 5.0 years ago by compuTE ▴ 140

0

Entering edit mode

TETranscripts is a very odd choice for this situation and I'm not sure how well it will perform given that it's meant to be used with genomic alignments in the context of a well-annotated genome (with repeats). Please use something like salmon instead.

What organism is this?

ADD REPLY • link 5.0 years ago by Devon Ryan 104k

0

Entering edit mode

Thank you for your answer. This is human data, and I am using the repeat information that TEtranscripts offer for other analysis.

I was expecting similar results from the two approaches and was wondering if there was a reason behind it. Even if the software was not specifically designed to only work with gene counts, I think it is relevant to make sure the conditions are being somehow differentiated if the software will perform a DEA afterwards.. Don't you think?

ADD REPLY • link 5.0 years ago by compuTE ▴ 140

1

Entering edit mode

There's no need to perform transcript assembly with humans (don't bother with stringTie). What this suggests is that repeat expression is dominating things and not different by condition.