Hi, I have a methodological question about creating summary composition plots in phyloseq.
When creating a stacked bar plot of mean relative abundance by group, which approach is more statistically sound, especially for sparse microbiome datasets? Method A: Calculate relative abundance for each individual sample, and then group_by() and take the mean() of those abundances. Method B: Use merge_samples() to pool the raw counts for each group first, and then calculate the relative abundance on the pooled 'meta-sample.' With my sparse data, Method A produces a misleading plot where the bars do not sum to 1.0, while Method B works consistently. I'd like to confirm if Method B is the standard and recommended approach for this type of visualization.