I have a question on analyzing amplicon Miseq sequencing data and hope to have your suggestion.
Specifically, I have 100 samples but are Miseq sequenced in two separate runs. In the first run including the first 50 samples, I have sufficient paired sequences from both forward and reverse reads. In the second run including other 50 samples (five samples are also included in the first run and are repeatedly sequenced), I only have sufficeint forward reads, but the sequencing quality of reverse reads is too bad to use. My question is what's the best practice between 1) using paired sequences from the first run and forward reads from the second run, and 2) using forward reads from both the first run and the second run? I have tried the first practice and found the same five samples that are repeatedly sequenced in the two separate runs did not group by samples but rather by runs in PCoA analysis. I also detected signifiant beta-community difference between the two separate sequencing of the five samples.
Please let me know if you have any comments or suggestions. Thanks in advance.