Question: How to compute survival analysis between LncRNA and one type cancer (i.e.Breast Cancer) based on TCGA
0
gravatar for ybling2008
2.4 years ago by
ybling20080
ybling20080 wrote:
  1. I want compute survival analysis between LncRNA and Breast Cancer based on TCGA. but i don't know how get the LncRNA information. 2.Should I must download the raw RNA-seq data from the TCGA database one patient by one patient?
  2. How to identify the LncRNA based on the RAW data

Thank you

ADD COMMENTlink modified 2.4 years ago by EagleEye6.0k • written 2.4 years ago by ybling20080
2
gravatar for EagleEye
2.4 years ago by
EagleEye6.0k
Sweden
EagleEye6.0k wrote:

You can make use of already processed TCGA dataset, in TANRIC they also cover survival analysis extensively for all cancer types from TCGA.

http://ibl.mdanderson.org/tanric/_design/basic/index.html

http://ibl.mdanderson.org/tanric/_design/basic/query.html

If you want some independent dataset, you can have a look into this resource (but I am not sure whether this covers LncRNAs).

http://watson.compbio.iupui.edu/chirayu/proggene/database/?url=proggene

TCGA survival analysis tutorial,

Survival analysis of TCGA patients integrating gene expression (RNASeq) data

ADD COMMENTlink modified 2.4 years ago • written 2.4 years ago by EagleEye6.0k

My final purpose is to establish a model like as below. It is a paper. So I want to get LncRNA information. And then put them divide into two groups. one is train group, the other is validation group. Based Cox regression, construct a model of some lncRNA equation。

"Transcriptome sequencing uncovers a three–long noncoding RNA signature in predicting breast cancer survival"

ADD REPLYlink written 2.4 years ago by ybling20080

You can still follow the tutorial mentioned above. Divide the RNAseq patients samples into two group training and the validation set.

ADD REPLYlink modified 2.4 years ago • written 2.4 years ago by EagleEye6.0k

but my question the dataset from the Tutorial you give don't covers LncRNAs.

ADD REPLYlink written 2.4 years ago by ybling20080

The RNAseq data you download from TCGA will have all classes of RNAs, you just have to separate them based on their property using available annotations like, Gencode, Ensembl, UCSC etc.,

Example as discussed in these posts you can obtain the annotation file and match ensembl IDs from TCGA RNAseq data with your annotation file containing the classification of RNAs (protein_coding, lincRNAs, etc.,)

A: Identify lncRNA in list of Ensembl ID's

The easy way is to download GTF file from Gencode and follow these steps to get annotation as table format,

A: Converting gtf format to bed format

Orelse if you are familiar with UCSC table browser follow these steps,

A: I need to download a list of all human genes with their respective Esemble gene

ADD REPLYlink written 2.4 years ago by EagleEye6.0k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1095 users visited in the last hour