Question: De Novo Assembly Errors
gravatar for thorerges
4.8 years ago by
United States
thorerges60 wrote:

Recently, my lab had a private vendor has assembled transcriptome contigs of several genes from a mammal. I do not have access to the raw files - only assembled contigs for genes.  

I discovered on contig has a 100bp insert, right in the middle of the gene. I did a BLAST search of this 100bp insert (the rest of the gene aligns to pre-existing homologs in other species), but it returned nothing. I suspect it is a sequencing artifact. 

Is there a way I can be sure that this insert is an artifact? Could it be the product of retrotransposition? What is the usual procedure for this kind of problem?

ADD COMMENTlink modified 4.8 years ago by Devon Ryan94k • written 4.8 years ago by thorerges60
gravatar for Devon Ryan
4.8 years ago by
Devon Ryan94k
Freiburg, Germany
Devon Ryan94k wrote:

Normally you'd need to map the original reads back to the contig and then have a look at the results. If you don't have the original sequencing (you should be able to ask the vendor for that), however, then that'd be impossible. At that point all you can do is a few wet lab experiments (e.g., PCR the region around the insert and check the size/sequence).

ADD COMMENTlink written 4.8 years ago by Devon Ryan94k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 968 users visited in the last hour