Question

How to process Illumina MiSeq paired-end reads for analysis

0

Entering edit mode

4.6 years ago

snow_seq • 0

Hello,

I am trying to pre-process paired-end reads for immunoglobulin repertoire analysis.

I have R1 and R2 fastq files from 2x250bp MiSeq. I am not sure whether I am supposed to trim indexed adapters before merging, what to tool is best to use for merging, and how many sequences I should anticipate should successfully merge. I have already attempted trimming adapters and then joining, as well as directly joining, but in either case, less than 50% of my sequences form pairs.

Does anyone have a general outline of how to process paired-end reads? What steps should be done before using sequences for alignment to a reference database? How many sequences are typically lost at the merge step?

Thank you in advance.

rna-seq paired-end reads pre-processing • 2.0k views

ADD COMMENT • link updated 4.6 years ago by Antonio R. Franco ★ 5.1k • written 4.6 years ago by snow_seq • 0

0

Entering edit mode

What steps should be done before using sequences for alignment to a reference database?

This is the key sentence of your question for getting the right answer. What kind of alignment? If you want to use something like BWA or bowtie. Just quality trim and do the alignment will be sufficient. The tools can handle paired-end data.

ADD REPLY • link 4.6 years ago by gb ★ 2.2k

0

Entering edit mode

Yes, that is my question. I need to merge the reads to assess the entire target gene region.

ADD REPLY • link 4.5 years ago by snow_seq • 0

0

Entering edit mode

Again, depends on the type of alignment if you use BWA you don't need to merge. If you only want to blast, for now I would say use only the forward reads or a combination of merged reads and forward reads.

ADD REPLY • link 4.5 years ago by gb ★ 2.2k

score 0 · Answer 1 · 2019-09-13

I am wondering why you need to merge the two files. Most of program and resources are able to handle fastq file by separate.

In addition, if using trimmers like Trim-galore or the like, you can ensure that both files are "synchronized" automatically

The processing of paired-end reads is easy. In most programs you need to declare the using of paired reads, and need only to write their file names. After the trimming to get rid of poor quality reads and the excess of adapters, you can use your fastq file immediately