Question: Merging illumina paired-end data Lxxx and R1/R2
gravatar for euduca
4.2 years ago by
Brasil, Ilhéus, UESC
euduca0 wrote:


I received paired-end illumina data for some DNA sequencing (Hiseq). There are four fastq files and are named like this:

Fungal_L001_R1_001.fastq and Fungal_L001_R2_001.fastq
Fungal_L002_R1_001.fastq and Fungal_L002_R2_001.fastq

They are from the same library (TruSeq LT DNA kit, insert size of 800-900bp).

Can I merge the R1 and R2 files from different lanes?  And so, work normally with the post processing (trimming, assembly, etc.)

Like this:

$ cat Fungal_L001_R1_001.fastq Fungal_L002_R1_001.fastq > Fungal_R1.fastq
$ cat Fungal_L001_R2_001.fastq Fungal_L002_R2_001.fastq > Fungal_R2.fastq


Thanks =)



ADD COMMENTlink modified 4.2 years ago by DG7.1k • written 4.2 years ago by euduca0

Yes you can as long as it is the same sample (which you say it is).

ADD REPLYlink written 4.2 years ago by genomax80k
gravatar for DG
4.2 years ago by
DG7.1k wrote:

Absolutely. Depending on your workflow and exact organism some people chose to align their lanes independently and add read groups to the reads that came from different lanes and then merge the BAM files after having added read groups. This makes sure that you retain some meta-data associated with the sequencing lane. Mostly that is just for some quality control downstream in case you notice anything weird and want to see if there were issues with a particular sequencing lane.

ADD COMMENTlink written 4.2 years ago by DG7.1k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1878 users visited in the last hour