Question: What is the current state-of-the-art for differential abundance in microbiome pathway analysis?
gravatar for ariel
9 months ago by
ariel110 wrote:

I've seen other folks use DESeq2 for performing differential abundance analysis after pathway annotation with picrust2, piphillin, tax4fun, etc.

However, when I look at my KO (KEGG Orthologs) abundances or mapped pathway abundances, they are definitely NOT ZINB (zero-inflated negative binomial) distribution, which is the model DESeq2 is based upon.

So, I'm curious what others feel about DESeq2 being the right approach, and/or if there are other normalization/analysis algorithms that may be superior.

As a note, I found that using both PICRUSt2 and PiPHILLIN that my abundance distributions tend to be bimodal.

ADD COMMENTlink written 9 months ago by ariel110

You won't find a consensus. DESeq2 assumes negative binomial distribution which should be correct for metagenomics reads, I don't know how picrust2 etc. compute pathway abundance. The problem with DESeq2 (and all other RNAseq analysis software) is that it assumes that most of the rows are the same between the samples which is usually incorrect for metagenomics.

Your best shot is using a simple non-parametric test, although its power will be low.

ADD REPLYlink written 9 months ago by Asaf8.3k

Thing is, we have a lot of important covariates, so Mann-Whitney or Kruskal-Wallace won't work :(

ADD REPLYlink written 9 months ago by ariel110

Maybe a logistic regression for the presence of a pathway?

ADD REPLYlink written 9 months ago by Asaf8.3k

I like that idea, but it would sacrifice the abundance information. I'm going to try linear regression on the log aundances and see what happens. The log distributions don't look "too" bad.

ADD REPLYlink written 9 months ago by ariel110
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1485 users visited in the last hour