I downloaded the TCGA dataset for LUAD (Lung Cancer), I find each file with miRNA sequence data like readCount and parts per million value and their are around 123 samples. But where do I find the label, whether this sample is cancerous or not?
TCGA-05-4244-01A-01T-1108-13.hg19.mirbase20.mirna.quantification.txt TCGA-05-4244-01A-01T-1108-13.mirna.quantification.txt TCGA-05-4249-01A-01T-1108-13.hg19.mirbase20.mirna.quantification.txt TCGA-05-4249-01A-01T-1108-13.mirna.quantification.txt
- What is the difference in between isoform and mirna?
- What is the difference in between hg19.mirbase20.mirna and mirna? Should I include both files in my training model?
- Where do I find the label, whether this file data corroborates to a healthy tissue or a cancerous one?