I am currently working on gene based HtSeq count data from the MMRF data portal. https://research.themmrf.org/rp/download
Does anyone have experience with this data?
My question is, is this data normalized. The only information available in the ReaMe file is: MMRF_CoMMpass_IA9_E74GTF_HtSeq_Gene_Counts.txt - Simple matrix file ENSG_ID, defined by ensembl v74 GTF, in first column and HtSeq counts for each respective specimen in successive columns
I tried drawing a boxplot for the data and the samples median don't show much difference.
Any help will be greatly appreciated.