I am working on viral genome and the reads i generated is from ion torrent method. After performing denovo assembly using trinity tool i got contigs. Next step is to construct a draft genome from the contigs. Is there any tools/ softwares to create a draft genome from the contigs.
Can anyone please help me..
A set of contigs for an organism is considered the draft genome already (at least that is what I think). Do you mean to build a scaffold for these contigs - join contigs to improve your draft genome N50?
There are several assembly pipelines geared towards viral assembly. As viral samples are a mix of host and viral sequences, one key step is "contaminant" removal. Depending on the pipeline (and also if host and viral reference genomes are available, and if viral material has been enriched prior to library preparation), host sequences removal can be performed before or after assembly. Check some of the available pipelines:
I have some experience with eukarypte genome assembly from Illumina reads, but I've never worked on viruses or Ion torrent data. Still, a few things to note.
I'm not sure Trinity is the most suitable tool, since it is intended for transcriptome, not genome assembly (and does a great job at it). Why did you choose it? You might be better off with a genome assembler like SPAdes.
The next step after creating contigs is usually scaffolding, in which you create longer, gapped fragments out of your contigs, called scaffolds. To do that you can use paired-end data (preferably long distance, e.g. mate pair) or long reads. SPAdes will perform scaffolding as part of the assembly, if you provide the relevant data.
Achieving a full draft genome (i.e. a single sequence per chromosome) is usually challenging, although maybe with small and simple viral genomes this might be possible. You'll have to use either an existing reference genome of a closely related virus or additional information from genetic, physical or optical maps. I also hear that people interested in viral genomics use ONT sequencing, as it can sometimes sequence the whole genome in one read, and thus no assembly is needed. However, this technology has its own problems and will require you to sequence all over again.
What is the estimated genome size of your virus? to what depth did you sequence it, and with what library?