Why do all of my PCR reads not map to the reference sequence?
1
0
Entering edit mode
18 months ago
Saran ▴ 50

Hello,

I was given fastq files of 150 bp illumina sequenced reads after PCR was performed on the region of interest and I was given a reference sequence of the gene. I used BWA mem to align my sequences to the reference and only got 60% mapped. I was wondering if I am using the correct algorithm for PCR produced reads and some explanation as to what could cause not all of them to map? I did not adapter trim because BWA does soft-clipping and I know some reads did sequence into the adapters because they were short. Could someone else explain as to why not all would map? Could this be PCR/sequencing error or am I doing something wrong?

Thank You

PCR sequencing BWA illumina • 499 views
ADD COMMENT
2
Entering edit mode
18 months ago
GenoMax 142k

Did you check to see if there are phiX sequences in your data? Often times phiX is spiked in to normalize the base distribution when sequencing a low nucleotide diversity sample (amplicons would qualify for this). Generally phiX data is removed during demultiplexing but if your samples were not multiplexed phiX reads may still be there. If that is not the case grab a few reads (convert them to fasta format) that do not map and then blast them at NCBI to find out what they are.

ADD COMMENT

Login before adding your answer.

Traffic: 2732 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6