de novo chromosome level assembly
1
3
Entering edit mode
28 days ago
mthm ▴ 80

I have the contigs assembly of a non model species. the assembly itself is made de novo since there is no close reference genome for this genus. I have used RagTag before for chromosome level assembly but it requires reference chromosome assembly, which in this case, is from a very distant genus. What would be your suggestion, do I use that distant reference chromosome or run de novo chromosome assembly? and in the latter case, I found CSA (Chromosome-Scale Assembler), are there better suggestions in terms of robustness?

genome assembly chromsome • 1.0k views
ADD COMMENT
0
Entering edit mode

Plant, animal, expected genome size ?

ADD REPLY
0
Entering edit mode

Thanks for reply, it is an insect genome, expected genome size 1-2 Gb

ADD REPLY
0
Entering edit mode

what kind of data types do you have at hand?

ADD REPLY
2
Entering edit mode
28 days ago

I would personally NOT use the distant one to do reference guided scaffolding, the risk that things have changed (chromosome wise) over such an evolutionary distance is just too high to rely on the distant reference.

My suggestion would thus be to go for denovo scaffolding (if feasible off course, == if you have the data for it). Depending on your available data there is likely several options so best to first fill us in on what data you might have before we can do tool suggestion.

ADD COMMENT
0
Entering edit mode

Thank you for your answer. Data are whole genome sequencing using both short reads (Illumina hiseq 10) and long reads (PacBio) techniques. The genome is done using a hybrid assembly with wtdbg2 and MaSuRCA hybrid assembler following polishing the data and removing contaminants. My data is from a cricket family and you are right, the genome size, the chromosome numbers and even the sex chromosome mechanism varies a lot in this suborder.

ADD REPLY
2
Entering edit mode

I would reassemble with hifiasm and polish if needed with the short reads. wtdb2 is known for creating very short fragmented contigs.

ADD REPLY
0
Entering edit mode

Thanks for the suggestion, I first thought it is for contig assembly but apparently can also be used for scaffolding at chromosome level

ADD REPLY
0
Entering edit mode

any tool suggestions would be appreciated

ADD REPLY
1
Entering edit mode

colindaven already suggested HiFiasm, perhaps you can also have a look at flye (it used to have a mode to even do ref-guided assembly, but not sure if that's still supported?)

ADD REPLY
0
Entering edit mode

How can I attribute the scaffolds to chromosome numbers?

ADD REPLY
0
Entering edit mode

you mean assemble them into pseudo-chromosomes or identify which scaffold is derived from which 'real' chromosome?

ADD REPLY
0
Entering edit mode

I suppose the first one, since I don't have the information of the real chromosome

ADD REPLY
1
Entering edit mode

Look at Ragtag on github, assuming you have a close reference to align against. Else your main option is spending more on Hi-C data. Else maybe optical mapping with Bionano.

ADD REPLY

Login before adding your answer.

Traffic: 4228 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6