remove sequencing error based on phred score
0
0
Entering edit mode
7.9 years ago

Hi guys, I have a fastq file from PacBio SMRT sequencing, first I will map reads to a reference, then I want to replace the low phred score base(<30) with consensus base. Is there any algorithm can do this? Thanks.

next-gen • 1.8k views
ADD COMMENT
0
Entering edit mode

What is the rationale behind doing this and what is it that you want to ultimately do?

You could have saved some money and created simulated PacBio sequences using the reference.

ADD REPLY
0
Entering edit mode

Thanks, what I submitted for PacBio sequencing are PCR products of a variant pool with similarity of 99% in DNA. I want to identify true variants from the pacbio reads, so I want to use the phred score to determine the mismatch base is true or not. If you have any other method or algorithm can do this, please tell me?

Thanks again.

ADD REPLY
0
Entering edit mode

Do you have consensus reads from PacBio data (now called "reads_of_insert" or what used to be called CCS reads)? If that is the case then there is good chance that the base you have in the read is real (and has support from multiple passes). Those may be the SNP's you are interested in.

Instead of replacing bases you should try to align (or cluster) your reads using the reference.

ADD REPLY
0
Entering edit mode

Yes, I have the CCS reads. Thank you very much.

ADD REPLY

Login before adding your answer.

Traffic: 2893 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6