assemble in chromosome level for first time
1
0
Entering edit mode
7.7 years ago
reza ▴ 300

i have whole genome of camel that sequenced via illumina hiseq, i want to assemble this genome in chromosome level. there is no chromosome level assembled genome for camelid, is there any way to do this project?

i have many question in bioinformatics and i am here to learn, thank you in advance

Assembly next-gen • 1.3k views
ADD COMMENT
0
Entering edit mode

What you are trying to do is called "de novo genome assembly". I'd recommend you look around a bit on that topic - it's quite interesting. Check out de bruijn graphs and Dan Zerbino's work - that should give you a starting point.

ADD REPLY
1
Entering edit mode
7.7 years ago
igor 13k

As @Ram already mentioned, what you are looking for is "de novo assembly". See previous discussion here where you can hopefully learn a lot:

I should point out that you are not going to get full chromosomes based on Illumina short reads alone. It's usually not possible to get contigs longer than a few megabases (mammalian chromosomes are hundreds of megabases).

In general, de novo assembly is extremely difficult. There are already existing assembled camel genomes that you can use:

Why not align to those?

ADD COMMENT
0
Entering edit mode

i want to assemble in chromosome level for downstream annotation and analysis but i think that it is not possible with short reads, for this job, chromosomes must sequenced separately.

ADD REPLY
2
Entering edit mode

There are three camel genomes available (arabian, bactrian and wild bactrian) at NCBI. So at a minimum you should be able to align your data to those. As indicated by @igor even if you are not able to assemble your data at chromosome level, you should be able to get a reasonably good picture of what the genome you sequenced looks like by doing this analysis.

ADD REPLY
0
Entering edit mode

I assume by annotation, you need to assign alignments to a particular gene. If that's the case, you don't need complete chromosomes. Often people use both DNA and RNA for published assembly. Even highly fragmented genome assembly is likely to have reasonable gene annotation.

Even if you sequence chromosomes separately, you are not likely to get a fully assembled chromosome using short reads.

ADD REPLY

Login before adding your answer.

Traffic: 1779 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6