Question: Questions About The Asm-File Produced By The Velvet Program.
gravatar for Dane
7.2 years ago by
Dane10 wrote:

Hey all,

To preface, for the project I'm involved with I need to extract paired end info from an assembly - that is, for each paired end read: which contigs it connects, and the orientation and distance it suggests.

To this end, I've assembled on Velvet, created an AMOS file, and then created files for each individual contig from this (using the asmbly_splitter script). However, I'm having trouble interpreting the format of the file, and I've found very little documentation.

For example, the file output looks like:



Can anyone help me understand what these results represent? And how to extract the information I need from this?


paired velvet • 1.4k views
ADD COMMENTlink modified 7.1 years ago by Jeremy Leipzig17k • written 7.2 years ago by Dane10
gravatar for Jeremy Leipzig
7.2 years ago by
Philadelphia, PA
Jeremy Leipzig17k wrote:

from your wording "contigs it connects", it sounds like you really want to know which pairs are allowing contigs to be scaffolded

you basically want to isolate tiles with mates that appear in two different CTGs (contigs)

the weird thing is that contigs in scaffolds SCF are designated by TLEs

outside of scaffolds they are simply CTGs and the read segments making them up are TLEs

awesome stuff i know

I would start by finding SCF blocks and work your way to finding SRC's that belong to the same FRG (fragment, I can't remember if velvet produces FRG tags - they tie paired ends together). If not then you will have to rely on consecutive reads being pairs like 5769434 and 5769435.

ADD COMMENTlink modified 7.2 years ago • written 7.2 years ago by Jeremy Leipzig17k

Thanks. But I still have puzzles about the asm file. I don't know the meaning of element 'clr' in the TLE, and many others. Could you please tell me where to find the exact instruction about the asm file. Actually, I want to finish the edges bundling on the Contigs, and I followed the steps listed in the 'Opera' paper and others. But as a new research in bio-information, it's too difficulty for me, do you know any better or simple methods to do it?

ADD REPLYlink written 7.1 years ago by Dane10
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1376 users visited in the last hour