Question

the most frequent isoform of each gene specific to the cell line

0

Entering edit mode

7.4 years ago

ashkan ▴ 160

how can I find the most frequent isoform in each cell line. for example I have RNA-seq data of HeLa cells and want to get only one isoform(transcript) per gene but the one which is specific to HeLa cells for example.

rna-seq • 2.2k views

ADD COMMENT • link 7.4 years ago by ashkan ▴ 160

1

Entering edit mode

Really, another question without following up on all previous threads?

ADD REPLY • link 7.4 years ago by WouterDeCoster 47k

0

Entering edit mode

I have not noticed it.

ADD REPLY • link 7.4 years ago by GouthamAtla 12k

score 0 · Answer 1 · 2016-12-07

I dont know if there is a database for that but what I would do is:

Use publicly available data sets:

Take hela cell RNA-Seq data and quantify the transcripts. A simple library size normalisation would be enough.
Take RNA-Seq data from few other tissues and do the same. ( There are many data sets available )
Calculate the fold changes for the transcripts ( hela cell vs other cell types ) and plot the distribution.
Keep a cutoff based on distribution. Lets say a transcript has 3 or more times expression in hela cells than other tissues. This will be hela cell specific transcripts. Then get the most abundant transcript for each gene.

You will end up with tissue specific most abundant transcripts.

P.S This seems to be a lot of work but its fun to do it.