sample-specific filtering of counts matrix
0
0
Entering edit mode
19 months ago
Assa Yeroslaviz ★ 1.6k

I know I can filter my counts matrix using this command

filtered.counts <- counts[rowSums(counts==0)<3, ]

when I would like to keep genes with counts in more than three samples.

But is there a way to do the same and removes rows from the matrix when this three 0 are in only one condition? I have 2 conditions with each four replica. I would like to filter for genes with counts in at least two of them.

Would this kind of filtering make sense? Or do I create a bias in the expression matrix?

thanks Assa

counts RNA-Seq deseq2 condition • 540 views
ADD COMMENT
1
Entering edit mode

You could simply use something like FilterByExpr from edgeR.

ADD REPLY
0
Entering edit mode

I would keep the rows (genes) if one condition has all zeros while the rest having non-zero values. Depending on the sequencing depth across different samples/conditions, this gene might simply be under-/over-represented in one condition vs others. And yes, sample-specific filtering might result in biases in the downstream steps.

ADD REPLY

Login before adding your answer.

Traffic: 2531 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6