Question: Dealing with NAs in microbiome transcriptome count data for differential expression
gravatar for MAPK
5 months ago by
United States
MAPK1.4k wrote:

I am analyzing microbiome data from human gut samples and wanted to do DESeq2 analysis. I have lots of NAs in my count matrix. The reason I have NA's is because one sample may have a particular group of microorganisms that is completely or partially present in other samples. Normally, I don't get NAs while analyzing RNAseq data from a species, but for this microbiome I am getting lots of NAs. How should I deal with these NAs? Should I remove the rows with one or more NAs from the count matrix or replace NAs with zero(as they may not be part of that sample's microbiome). If I remove rows with NAs, I will be left with only 5 rows (very few loci are shared across all samples). Any suggestion would be appreciated. Thanks

rna-seq deseq2 microbiome • 261 views
ADD COMMENTlink modified 5 months ago • written 5 months ago by MAPK1.4k

What do you mean by NAs? Isn't this count data? Shouldn't be zeros from the beginning? If something is not present in a sample, when you map and count you should get zero counts, not NAs.

ADD REPLYlink written 5 months ago by h.mon24k

I mapped them to Trinity assembled contigs from all metatranscriptome data, so each sample is mapped differently to the assembled contigs. Meaning one sample has genes that are not present in other samples.

ADD REPLYlink modified 5 months ago • written 5 months ago by MAPK1.4k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1096 users visited in the last hour