Question

Pooling fastq reads from different samples in metabarcoding

0

Entering edit mode

7.1 years ago

lvogel ▴ 30

Hi, I've read that it is possible to pool all fastq files of Illumina reads, of different samples for metabarcoding, into one file, and then continue with analysis. I've even heard that you can combine reads from other runs, if they are from the same environment. Now, I would think that if you are going to combine samples from different runs, they should also be from the same sequencing depth, or the relative abundance estimations of biological sequences will be all wrong. Could anyone comment? Thanks.

Illumina metabarcoding • 1.9k views

ADD COMMENT • link updated 7.1 years ago by Pierre Lindenbaum 161k • written 7.1 years ago by lvogel ▴ 30

score 3 · Accepted Answer · 2017-03-12

3

Entering edit mode

7.1 years ago

Pierre Lindenbaum 161k

if you plan to mark the duplicates , reads from different flowcell/lane/library cannot be considered as an optical/pcr duplicate So you'll have to assign a distinct read-group for each of those conditions. Furthermore, tools like BWA use a subset of reads to calculate the average segment length.

ADD COMMENT • link 7.1 years ago by Pierre Lindenbaum 161k