Question: need to identify CDS (translation) for gene models predicted by Cufflinks and Stringtie
gravatar for Ann
3.6 years ago by
Concord NC USA
Ann2.2k wrote:

I have transcript models in GTF format made from assembling RNA-Seq read alignments using stringtie and also cufflinks. The reference genome is a new blueberry assembly I'm helping to annotate.

For the next steps, I need to find the genomic coordinates of the start and end of translation for the longest open reading in each transcript model.

Is there a program that can help me do that?

Previously, some-one in my group used a perl program called Tau to do this, but I don't have a copy of it. (Embarrassing!)

It seems like a straightforward task. Surely some-one has written something like this and put it on github or somewhere?

If not, do you think this would be a good project for a student learning bioinformatics?





rna-seq cufflinks stringtie • 1.3k views
ADD COMMENTlink modified 3.6 years ago by Chirag Nepal2.2k • written 3.6 years ago by Ann2.2k
gravatar for Chirag Nepal
3.6 years ago by
Chirag Nepal2.2k
Chirag Nepal2.2k wrote:

You could try transdecoder


ADD COMMENTlink written 3.6 years ago by Chirag Nepal2.2k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1139 users visited in the last hour