proovread error correction (help!)
1
1
Entering edit mode
9.2 years ago
midox ▴ 290

Hello,

I wanted to know is what proovread performs error correction with reference genome (or reference sequence) or de Novo?

because they wrote "the reference sequences"

Thank you for your reply

error-correction Assembly error • 2.7k views
ADD COMMENT
4
Entering edit mode
9.2 years ago
Medhat 9.7k

proovread:

tool to correct the LR "long read from SMRT" using high quality SR "short reads " generated by illumina sequencer

Primarily proovread has been designed to correct PacBio subreads. You get these reads either from PacBio's SMRT-Portal or by using dextract from Gene Myers PacBio assembler DAZZLER

For correction of long reads, proovread needs high coverage short read data. Typically these are HiSeq (75-150bp) and MiSeq reads (200-300bp)


Update:

To use it

/path/to/proovread --long-reads=<FASTA/FASTQ> --short-reads=<FASTA/FASTQ> --threads [default 4]

    -l, --long-reads=<FASTA/FASTQ>
        PacBio subreads or other erroneous sequences to be corrected.

    -s, --short-reads=<FASTA/FASTQ>
        High confidence short reads. Specify mulitple times for multiple
        libs, need to have same format (FASTA/FASTQ and offset 33,64)

          -s lib1.fq -s lib2.fq # use reads from two libs

note;

 [--sam/--bam=<SAM/BAM>]
        External SAM or sorted BAM file instead of --short-reads.

[2]:

ADD COMMENT
0
Entering edit mode

Thank you for your help.

Yes I know but is that they use a reference genome or reference sequence or not?

ADD REPLY
1
Entering edit mode

Reread what medhat posted and the paper. There is no reference sequence involved. It uses short reads to correct long reads (i.e., a hybrid approach).

ADD REPLY

Login before adding your answer.

Traffic: 2362 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6