Question: Why to remove lowly expressed genes in RNAseq
1
gravatar for Ankit
8 months ago by
Ankit60
Ankit60 wrote:

Hi all,

I have a question regarding the filtering of lowly expressed genes in analysis of RNAseq data. What I understood from literature is that these low counts genes are basically a noise and not a true picture of differentially expressed genes, so they need to be removed if very low counts are observed for all the samples (not only in one sample).

I wonder if there is any other basis of it, mainly in terms of :

1). Differential expression

2). Statistical

3). any Molecular / Biological

I appreciate any suggestion.

Thanks

Ankit

rna-seq • 577 views
ADD COMMENTlink modified 8 months ago by Carlo Yague4.4k • written 8 months ago by Ankit60

StatQuest gives some nice details on this.

ADD REPLYlink written 8 months ago by ATpoint13k
2
gravatar for Carlo Yague
8 months ago by
Carlo Yague4.4k
Belgium
Carlo Yague4.4k wrote:

The main reason behind the idea of discarding low count genes is to NOT test genes for which we presume a difference in expression would not be relevant. If you do less tests, then the correction for multiple testing becomes less stringent, and the overal power of your experiment increases. This concept, called 'independent filtering' is well explained in this publication: Independent filtering increases detection power for high-throughput experiments

ADD COMMENTlink modified 8 months ago • written 8 months ago by Carlo Yague4.4k

Thanks Carlo for the suggestion.

I have one more question after reading this.

How the low expressed gene (or zero counts) is related with type 1 error and false discovery rate. Is there a mathematical correlation?

Please suggest.

Thanks

ADD REPLYlink written 8 months ago by Ankit60
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 796 users visited in the last hour