After download batches of gene expression files from TCGA gdc (about 600 hundreds files), each file contains only one sample. I wanna know which file is a tumor sample, which file is a normal sample. I want to find differentially expressed genes via these genes next step. However, I don't know how to find sample type information in TCGA, anyone can help me? What I want is like this, a wiki of TCGA, because I didn't find similar tools in gdc.
I hope I can download a tab-delimited file contain these information:
study barcode disease disease_name sample_type sample_type_name analyte_type library_type center center_name platform filename
TCGA TCGA-56-7222-01A-11R-2045-07 LUSC Lung squamous cell carcinoma TP 01 RNA RNA-Seq UNC-LCCC UNC-LCCC ILLUMINA UNCID_2195465.13daa1a0-a236-474e-b621-eb131be34af1.120305_UNC16-SN851_0133_AC0JB6ACXX_6_GGCTAC.tar.gz