Question

Joining Paired-End Illumina Raw Reads

2

Entering edit mode

10.1 years ago

Yongjie Zhang ▴ 110

I recently have amplicons sequenced (Illumina PE250) to investigate microbial community. I want to know what quality detection, and what kind of trims of raw reads by what programs should be performed before joining paired-end raw reads? I think low-quality parts, potential barcode or primer sequences existed in raw reads should be excluded in advance. Is this right?

illumina paired-end • 8.5k views

ADD COMMENT • link updated 10.1 years ago by Joseph Hughes ★ 3.0k • written 10.1 years ago by Yongjie Zhang ▴ 110

1

Entering edit mode

I finally merged reads pairs using usearch -fastq_mergepairs with -fastq_allowmergestagger enabled, but before merging, I cut primer/adapters using CutAdapt and trimed low-quality regions from ends of reads using Trimmomatric, and after merging, I cut potential primer/adapter again by CutAdapt and filtered reads with >0.5 expected errors by usearch -fastq_filter. Then I can do downstream analyses.

ADD REPLY • link 10.0 years ago by Yongjie Zhang ▴ 110

score 1 · Answer 1 · 2014-03-26

1

Entering edit mode

10.1 years ago

Janake ▴ 170

Here is a good example to start: http://www.mothur.org/wiki/MiSeq_SOP

ADD COMMENT • link 10.1 years ago by Janake ▴ 170

0

Entering edit mode

Very useful link. Thanks. I found the join_paired_ends.py in Qiime is not very ideal to my data. Mothur might be better.

ADD REPLY • link 10.1 years ago by Yongjie Zhang ▴ 110

score 0 · Answer 2 · 2014-03-28

0

Entering edit mode

10.1 years ago

Danielk ▴ 640

You could use SeqPrep for this. see https://github.com/jstjohn/SeqPrep or my fork https://github.com/dakl/SeqPrep which adds two parameters for more stringent handling of mismatches (set to N, essentially).

ADD COMMENT • link 10.1 years ago by Danielk ▴ 640

score 0 · Answer 3 · 2014-03-28

0

Entering edit mode

10.1 years ago

Joseph Hughes ★ 3.0k

You should be able to find everything you need for removing adapters, splitting the sample based on barcodes and joining the pairs with a combination of the tools in the fastx_toolkit and ea-utils.

ADD COMMENT • link 10.1 years ago by Joseph Hughes ★ 3.0k