I'm trying to download data from the TCGA for gene expression analyses in R, but I'm in doubt if I should use FPKM, FPKM-UQ or counts? When the dataset is in counts, I suppose it's raw data, isn't it? So what's the best unit to compare multiple datasets? I'm planning to use limma or Dseq2 for GE analysis and found that with Dseq2 I need to use count(non-normalised???) data... is that correct? so what's the best package and working strategy?
Please I need help again. I need to know which column in my table corresponds to the FPKM-UQ values. Thank you for your help!
No value in that output relates to FPKM-UQ
I need a table with only FPKM-UQ and Genes values. How to identify the FPKM-UQ values in an S4 matrix? thank you
Sorry, I have no information about which data you have or what you are aiming to do.
Cancer data. I intend to compare the values of FPKM-UQ expressed in normal tissue and primary tumor.
I see, but what data have you retrieved right now? If you are relatively new to programming, I may suggest using TCGAbiolinks in R / Bioconductor. If you have no programming experience, then perhaps use cBioPortal
I used TCGAbiolinks. Yes, I am a beginner in programming. I used this code
Sure thing, be sure, therefore, to follow the extensive tutorials: https://bioconductor.org/packages/release/bioc/html/TCGAbiolinks.html
For now, you will want
3. Downloading and preparing files for analysis