Question: Gene vs Transcript Level DEG Analysis
3
gravatar for asumani
28 days ago by
asumani60
Istanbul/Turkey
asumani60 wrote:

Hi all,

I use Ballgown v2.20.0 for DEG analysis after HISAT and StringTie. I conducted both gene and transcript level analysis in stattest function. In gene level, I got only 50 genes with qval < 0.05. In transcript level, I got 1127 transcripts with qval < 0.05. I want to perform gene level analysis for a comparison between another expression results which was analyzed in gene level.

Since I want to capture as many genes as possible, I pulled only the highest expressed isoforms to represent genes among 1127 transcripts. That gave me 1042 'gene names'. I know this sounds like a brute simplification.

Do you have any idea why I get such a few genes in gene level analysis? What is the main biological concern in pulling only the highest expressed isoforms in transcript level analysis?

Thanks in advance,

rna-seq ballgown • 144 views
ADD COMMENTlink modified 28 days ago • written 28 days ago by asumani60
8
gravatar for ATpoint
28 days ago by
ATpoint40k
Germany
ATpoint40k wrote:

Aggregate transcripts to gene level with tximport, then use DEseq2 or edgeR, all tools with extensive documentation. It is not the highest isoform but the sum of all isoforms that should be used to build gene level counts. tximport takes care of that.

ADD COMMENTlink modified 28 days ago • written 28 days ago by ATpoint40k

Thanks a lot! I used tximport followed by limma/voom since my other data set was already analyzed with limma. Limma/voom option is also in tximport manual, so I think that is alright.

ADD REPLYlink written 28 days ago by asumani60

Yes, that is perfectly valid to use limma/voom.

ADD REPLYlink written 28 days ago by ATpoint40k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 776 users visited in the last hour