Question: De novo assembled transcript mapping to genome scaffold
0
gravatar for Bioinfonext
2.8 years ago by
Bioinfonext170
Korea
Bioinfonext170 wrote:

Hi,

I did mapping of illumina raw reads to CDS transcripts and I extracted unmapped raw reads.

after that I did assembly of these unmapped raw reads. Now I want to map these assembled transcript to genome scaffold to find how many of them are able to map to genome and whether they map to different position to each others.

FINALLY will extract only those DE NOVO ASSEMBLED TRANSCRIPT which are able to map to genome but not annotated as CDS.

So my finally reference for differential gene expression will be CDS plus de novo assembled transcripts.

rna-seq • 1.7k views
ADD COMMENTlink modified 2.8 years ago by wrf50 • written 2.8 years ago by Bioinfonext170

It's helpful that you have explained what you have tried, but it would also help if you would explain your goal, and what kind of data you are working with. Also, it's worth noting that you did not technically ask a question, so perhaps it would be worthwhile rephrasing your text.

ADD REPLYlink written 2.8 years ago by Brian Bushnell16k

Actually I notice that some of the gene sequences were present in the genome scafold but not present in the CDS trasncripts.

when I map raw read to genome than around 95 % raw read able to map to genome but when I map to CDS than only 60% were able to map.

So now I want to confirm how many of the de novo assembled trascript from (unmapped raw read to CDS), can able to map on the genome scaffold. this is Transcriptome data from Plant roots at different development stages.

ADD REPLYlink written 2.8 years ago by Bioinfonext170

it will also confirm the quality of de novo assembled transcript

ADD REPLYlink written 2.8 years ago by Bioinfonext170

You will get a much better response rate if you try to use proper English with correct capitalization. Sloppy linguistics is generally frowned upon in this forum.

ADD REPLYlink modified 2.8 years ago • written 2.8 years ago by Brian Bushnell16k
2
gravatar for wrf
2.8 years ago by
wrf50
Germany
wrf50 wrote:

For mapping de novo assembled transcripts to a genome, you may want to try GMAP, here. The usage is sort of like bowtie or blast, where you make the database from the scaffolds, then align in a second step.

ADD COMMENTlink written 2.8 years ago by wrf50

GMAP is an excellent first step for mapping -- way, way faster than blat -- but it achieves speed by playing fast and loose, often not achieving results as good as blat -- e.g. failing to align terminal exons, returning multiple tiny exons, etc where blat does not.

You can use GMAP to find the gene coords, but then I would strongly recommend using Exonerate to re-align the transcript into the GMAP region, and output GFF results.

ADD REPLYlink written 2.8 years ago by apa@stowers450
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1190 users visited in the last hour