How to deal with differences in sample size for metagenomic diversity based on abundance?
Entering edit mode
21 months ago

I have a large metagenomic sample set across 15 sampling locations. some locations have as many as four replicates while others have only a single replicate, meaning the number of fastq reads I have for each site can vary by large numbers.

I can calculate abundance with Bracken, and pull the data into something like phyloseq for diversity analysis, but do I need to normalize the number of reads? That is, I don't want my diversity estimates to be bias because of unequal sample size.

What is the best way to deal with this problem? any help is appreciated.

metagenomics sample-size • 306 views

Login before adding your answer.

Traffic: 2149 users visited in the last hour
Help About
Access RSS

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6