Questions about the Getting-started of wtdbg2
1
0
Entering edit mode
3.5 years ago
boymin2020 ▴ 80

Hi,

This is my first time to assemble long reads from nanopore sequencing. I also have the short reads generated by Illumina sequencer. Here is my plan, to use wtdbg2 to get the draft genome fasta file, then to use pilon to polish. However, I have been blocked at the getting-started part of wtdbg2. I am totally confused by the input and output files in the following command lines. Are they just in one pipeline or just independent examples?

#quick start with wtdbg2.pl
./wtdbg2.pl -t 16 -x rs -g 4.6m -o dbg reads.fa.gz

# Step by step commandlines

# assemble long reads
./wtdbg2 -x rs -g 4.6m -i reads.fa.gz -t 16 -fo dbg

# derive consensus
./wtpoa-cns -t 16 -i dbg.ctg.lay.gz -fo dbg.raw.fa
wtdbg2 nanopore Illumina • 1.6k views
ADD COMMENT
0
Entering edit mode

reads.fa.gz is the input sequence file. Substitute with your own.

dbg.raw.fa would be the final consensus fasta file.

ADD REPLY
1
Entering edit mode
3.5 years ago
h.mon 35k

The wtdbg2.pl is a Perl script that wraps the whole wtdbg2pipeline in one command. As such, it assemble the reads (with wtdbg2), derive the consensus (with wtpoa-cns), map (with minimap2) and filter (with samtools) the reads back to the consensus, to obtain a polished assembly (again, with wtpoa-cns).

The two commands bellow the wtdbg2.pl (wtdbg2 and wtpoa-cns) correspond to the first two steps of the Perl pipeline.

So you can run the perl script, and be done with it, or run each command separately.

ADD COMMENT
0
Entering edit mode

Thanks, h, your answer helped me a lot.

ADD REPLY

Login before adding your answer.

Traffic: 2460 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6