Question: Should cndas provided by Ensembl be filtered by ccds or biotype prior to running kallisto?
gravatar for holgerbrandl
2.0 years ago by
holgerbrandl30 wrote:

I typically download cdnas directly from Ensembl (like with wget, build a kallisto index, and run kallisto quant to estimate isoform abundance.

However, Ensembl tends to provide very detailed transcript models. Furthermore, the provided cdna files from Ensembl also contain lots of non-coding biotypes from NMD to retained intron.

So I was wondering if a better practice would be filtering the provided cdna.fasta for just those transcripts with a CCDS id or filtering by biotype (such as "protein coding")?

As an example a ccds-filter would cut down the number of cdnas of;g=ENSG00000077782 from 41 to 9.

How sensitive is kallisto with respect to overly complex/redundant gene architectures?

kallisto isoforms • 386 views
ADD COMMENTlink written 2.0 years ago by holgerbrandl30
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1308 users visited in the last hour