Hi everyone!

I have taxonomic abundance count data obtained from shotgun metagenome analysis. My data is not normally distributed and zero-inflated. Each of the samples in my dataset has unequal library size. In this case, how can I normalize my dataset to do all downstream analysis? Let's say, ones I normalize the data, can I use it for all downstream analysis? what i mean is, does each different analysis need a different normalization method?

I am open to any tips/ suggestions and information.

Thank you all!

You don't need normalization. You need to accurately model your data. That's it.

could you explain more how I can model my data?

That's can sadly solve only a semester course in regression modelling :( better several years of stats ofc

time to open the stats notes :D thanks anyway!