the most frequent isoform of each gene specific to the cell line
Entering edit mode
5.1 years ago
ashkan ▴ 130

how can I find the most frequent isoform in each cell line. for example I have RNA-seq data of HeLa cells and want to get only one isoform(transcript) per gene but the one which is specific to HeLa cells for example.

rna-seq • 1.7k views
Entering edit mode

I have not noticed it.

Entering edit mode
5.1 years ago

I dont know if there is a database for that but what I would do is:

Use publicly available data sets:

  1. Take hela cell RNA-Seq data and quantify the transcripts. A simple library size normalisation would be enough.
  2. Take RNA-Seq data from few other tissues and do the same. ( There are many data sets available )
  3. Calculate the fold changes for the transcripts ( hela cell vs other cell types ) and plot the distribution.
  4. Keep a cutoff based on distribution. Lets say a transcript has 3 or more times expression in hela cells than other tissues. This will be hela cell specific transcripts. Then get the most abundant transcript for each gene.

You will end up with tissue specific most abundant transcripts.

P.S This seems to be a lot of work but its fun to do it.


Login before adding your answer.

Traffic: 2656 users visited in the last hour
Help About
Access RSS

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6