Question: Problems in identifying heterozygous transcripts using de novo transcriptome assembly via Trinity (galaxy)
gravatar for aih5
10 months ago by
aih50 wrote:

Hello, I have been annotating a genomic region that is heterozygous in our model animal. There are 10 genes I'm interested in and they all have 2 distinct alleles (sequences are not identical, 30-300 nucleotide differences over ~1000-1500 bases). We have assembled a transcriptome and aligned it to the genome in addition to the RNA-seq data to this genomic region for both haplotypes. The RNA-seq data is aligned strictly (no mismatches allowed) and all genes have complete read coverage. The de novo transcriptome assembly does not assemble transcripts for the separate alleles, and usually there is only one transcript generated and it is mapped to both alleles. There are cases when the transcript is identical to one allele, as well as cases when it is a chimera of the two alleles. I feel like the read depth is enough that trinity should catch the separate alleles when it is assembling transcript (from my understanding in what I have read). I can't find an option or a function to make trinity more stringent in how it assembles the possible transcripts. The reason I am trying to figure this out is so that we can assemble de novo transcriptomes for other samples that do not have a reference genome, but I am not confident at this point that trinity can assemble the distinct alleles appropriately even in cases when there are many differences. Does anyone have experience in running trinity to be more sensitive to additional allelic variants? Thank you for you help!

ADD COMMENTlink modified 8 months ago by Biostar ♦♦ 20 • written 10 months ago by aih50
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1966 users visited in the last hour