I was wondering how everyone deals with unmatched reads when calculating Shannon Diversity.
Since they may be valid sequences and match an unknown 16S sequence, should they be included?
Or is it skewing the analysis because they most likely don't all come from the same organism?
In that case, is it safe to just discard them?
Thanks for any thoughts!