PacBio long read error correction tools
1
0
Entering edit mode
6.1 years ago
freddiejung ▴ 60

Dear all,

in my previous post (https://www.biostars.org/p/303214/#303734), some people suggested that it's better to correct errors in log reads from PacBio sequel using short reads from illumina ahead of the genome assembly.

I happened to find that the assembly with only short reads contains lots of mis-assembled loci. The assembled sequences did not match FISH data. Currently we suspect that the extremely similar repeated sequences dispersed among the genome caused the mis-assembly.

In this case, I felt that error-correcting software would not work well like LoRDEC or HALC that needs the assembly derived from short reads as input. Is this right?

If so, what kind of software is better to correct errors?

PacBio Long-read error-correction short-read • 2.5k views
ADD COMMENT
0
Entering edit mode

What is the scientific/biological question you are looking for an answer to?

Depending on your question there are different initial error correction bfx pipelines that are reccomeded for PacBio data.

ADD REPLY
0
Entering edit mode

Did anyone use FMLRC (https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5807796) or is HALC better?

ADD REPLY
0
Entering edit mode
6.1 years ago

The build-in error correction procedure of the CANU pipeline works quite well. That is however not using illumina data but only PacBio.

ADD COMMENT

Login before adding your answer.

Traffic: 2494 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6