Question: How many short-reads are enough to do a valid bacterial genome assembly using Velvet and to perform a good annotation?
0
gravatar for bioinformatics_bel
2.1 years ago by
US, Alaska
bioinformatics_bel20 wrote:

I need to know how many short-reads are enough to do a valid bacterial genome assembly using Velvet and to perform a good annotation? Is it correct to use only a pair of .fastqs? The second part of the question is how to use assembly Velvet output for the genome annotation? GeneMarks software takes a fasta as input and nothing else.

genome assembly • 575 views
ADD COMMENTlink modified 2.1 years ago by Kevin Blighe52k • written 2.1 years ago by bioinformatics_bel20
0
gravatar for Kevin Blighe
2.1 years ago by
Kevin Blighe52k
Kevin Blighe52k wrote:

I don't believe that you require a huge amount of reads, certainly not more than one would normally get. With assembly, the key, as always, is to have long reads that can capture as much variation in the [Edit:] genome as possible. There is a direct inverse relationship of read-length on sensitivity and assembled genome size, as one would expect: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3988101/

Technically, you can assemble a [Edit:] genome from one sample but Velvet is designed to accept multiple samples combined, and these can have varying read lengths and insert sizes. The chosen k-mer size is important, too, and one can obviously choose a larger k-mer with longer reads.

I don't believe that Velvet does any annotation. Trinity does assembly and annotation, and Rockhopper is another tool that will identify novel transcripts that you can then 'annotate' indirectly via a BLASTx search.

Kevin

ADD COMMENTlink modified 2.1 years ago • written 2.1 years ago by Kevin Blighe52k

thank you..............................

ADD REPLYlink written 2.1 years ago by bioinformatics_bel20

Apologies, I realised that you are referring to genome assembly, not transcriptome. The same idea applies, however, in reference to read lengths. Also, programs like Rockhopper and Trinity, which I mentioned above, are designed for transcritome assembly.

For genome assembly methods, including annotation, I would encourage you to read this: https://academic.oup.com/bioinformatics/article/30/19/2709/2422249/Evaluation-and-validation-of-de-novo-and-hybrid

Best of luck

Kevin

ADD REPLYlink written 2.1 years ago by Kevin Blighe52k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 2086 users visited in the last hour