Exome Seq Alignment - What to do with unpaired reads post trimming
1
3
Entering edit mode
8.3 years ago

Hi,

I have some paired end exome sequencing, and after checking out the samples in fastqc, I found there was an enrichment for Illumina adapters. I used Trimmomatic to remove the adapters (using Illumina Clip), which gives me paired reads, and unpaired reads for forward and reverse respectively post trimming.

I'm following the GATK best practises for variant calling, which recommends BWA MEM for alignment. I can use the paired, trimmed reads for to align, but there's quite a large proportion of reads which are trimmed and unpaired. The bedrock of my question is how can I use these effectively?

Would the best way be to do an alignment with the pairs, then unpaired, and combine the SAM files in some way? Or even merge the BAMs?

Anyone had a similar experience? Advice is welcome!

Thanks

ngs sequencing exome • 3.0k views
ADD COMMENT
6
Entering edit mode
8.3 years ago
Brice Sarver ★ 3.8k

I do this regularly with exome datasets.

Reads where one mate fails but are otherwise fine are included in a single-end file. It's still good data! Map them the same way you do the PE data, and subsequently combine the BAMs using Picard's MergeSamFiles or another approach.

Some cleaning approaches make this easy. I use expHTS to manage my cleaning now - does a fantastic job and is blazingly fast with stream-based processing. Documentation is forthcoming, but the developers are happy to help.

ADD COMMENT

Login before adding your answer.

Traffic: 1580 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6