Question: Is Kallisto good for identifying non-coding RNA in a RNAseq data?
gravatar for priyankaraina10
6 days ago by
priyankaraina1010 wrote:


I generated a co-expression map using Kallisto for aligning reads. However, after completing my analysis, when I checked the gene_biotype for the gene list, I only found 3% non-coding RNA. I repeated the analysis with another data from (Skymap), I got similar results.

My question is this normal or Kallisto is not good in aligning non-coding RNA.

Thank you

ADD COMMENTlink modified 6 days ago by kristoffer.vittingseerup1000 • written 6 days ago by priyankaraina1010

Kallisto uses a transcriptome genome index, so you can only detect what is present in the annotation.

ADD REPLYlink written 6 days ago by WouterDeCoster35k

Adding to what WouterDeCoster said: (i) a lot of non-coding RNAs are relatively small, which means that depending on which sequencing protocol was they might not have been sequenced at all; (ii) they might also be repetitive leading to multimapping reads which I am not sure Kallisto would be able to handle even if they are kept by the mapper.

ADD REPLYlink written 6 days ago by A. Domingues1.7k

True Kallisto is better suited for long-noncoding RNAs (lncRNAs)

ADD REPLYlink written 5 days ago by kristoffer.vittingseerup1000

You typically map short RNA species sich as microRNAs against a microRNA-only reference such as miRbase to avoid axactly this probem.

ADD REPLYlink written 5 days ago by ATpoint11k

I don't think miRNA should be a problem with standard RNA-seq as the miRNA are typically shorter than fragments analyzed due the size-selection step in the library prepreation.

ADD REPLYlink written 4 days ago by kristoffer.vittingseerup1000

Sorry, I misread the initial question, thinking that OP was quantifying miRNA-seq instead of miRNAs from standard RNA-seq. You are right, standard RNA-seq typically does size-selection for non-short RNA species.

ADD REPLYlink written 4 days ago by ATpoint11k
gravatar for kristoffer.vittingseerup
6 days ago by
European Union
kristoffer.vittingseerup1000 wrote:

Yes. Kallisto does not distinguish between coding and non-coding. It only requires that the transcripts are already annotated (so if you are looking for novel lncRNAs you should use a different pipeline).

What is the frequency of lncRNAs in the transcript annotation database to start with?

ADD COMMENTlink written 6 days ago by kristoffer.vittingseerup1000
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1533 users visited in the last hour