Question

proovread error correction (help!)

1

Entering edit mode

9.2 years ago

midox ▴ 290

Hello,

I wanted to know is what proovread performs error correction with reference genome (or reference sequence) or de Novo?

because they wrote "the reference sequences"

Thank you for your reply

error-correction Assembly error • 2.7k views

ADD COMMENT • link updated 2.0 years ago by Ram 43k • written 9.2 years ago by midox ▴ 290

Ram · Answer 1 · 2015-02-11

4

Entering edit mode

9.2 years ago

Medhat 9.7k

proovread:

tool to correct the LR "long read from SMRT" using high quality SR "short reads " generated by illumina sequencer

Primarily proovread has been designed to correct PacBio subreads. You get these reads either from PacBio's SMRT-Portal or by using dextract from Gene Myers PacBio assembler DAZZLER

For correction of long reads, proovread needs high coverage short read data. Typically these are HiSeq (75-150bp) and MiSeq reads (200-300bp)

Update:

To use it

/path/to/proovread --long-reads=<FASTA/FASTQ> --short-reads=<FASTA/FASTQ> --threads [default 4]

    -l, --long-reads=<FASTA/FASTQ>
        PacBio subreads or other erroneous sequences to be corrected.

    -s, --short-reads=<FASTA/FASTQ>
        High confidence short reads. Specify mulitple times for multiple
        libs, need to have same format (FASTA/FASTQ and offset 33,64)

          -s lib1.fq -s lib2.fq # use reads from two libs

note;

 [--sam/--bam=<SAM/BAM>]
        External SAM or sorted BAM file instead of --short-reads.

[2]:

ADD COMMENT • link updated 2.0 years ago by Ram 43k • written 9.2 years ago by Medhat 9.7k

0

Entering edit mode

Thank you for your help.

Yes I know but is that they use a reference genome or reference sequence or not?

ADD REPLY • link updated 2.0 years ago by Ram 43k • written 9.2 years ago by midox ▴ 290

1

Entering edit mode

Reread what medhat posted and the paper. There is no reference sequence involved. It uses short reads to correct long reads (i.e., a hybrid approach).

ADD REPLY • link updated 2.0 years ago by Ram 43k • written 9.2 years ago by Devon Ryan 104k