Studying variances in assembled transcripts
0
0
Entering edit mode
9.4 years ago
rmihael • 0

Hi all. This is my first steps in bioinfomatics, so please forgive me for potentially trivia questions.

I'm trying to study variations in transcripts of a same gene on different developmental stages of dmel flies. In particular I want to verify the hypothesis that 3' UTRs are shortened at latest developmental stages. Obvious intermediate step of this study is obtaining transcript from assembled RNA-Seq data. This is where I have my problems.

After mapping with Tophat and assembling with Cufflinks my final result is transcripts.gtf file. I can use gffread as

gffread -g dmel.bowtie.index.fa -w transcripts.fa cufflinks_out/transcripts.gtf

to get aligned transcripts, but these transcripts will be from reference genome, so stripped of any potential variations.

So, the question is, how can I get these "raw" transcripts that were aligned by cufflinks from sequencing data? It sounds like a de novo alignment problem, but there IS reference genome and annotation available, so may be it's possible to take advantage of it?

RNA-Seq • 2.1k views
ADD COMMENT

Login before adding your answer.

Traffic: 1770 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6