I recently downloaded the TCGA colorectal clinical data information from GDC portal. From this I got the following files.
I combined both the files and a total of 628 patients data is available. Among them I see
563 - Alive 65 - Dead
times bcr_patient_barcode patient.vital_status 49 TCGA-5M-AAT4 Dead 290 TCGA-5M-AAT6 Dead 154 TCGA-3L-AA1B Alive 1200 TCGA-5M-AATE Alive 648 TCGA-A6-2671 Alive
All the 628 patients have information available about
Similarly, I checked the cbioportal TCGA Provisional colorectal clinical data cbioportal colorectal. Here the
patient_vital_status is of different numbers.
502 - Alive 130 - Dead 8 - NA
And in this, almost 60 patients had
Days_to_Last_followup. I'm interested in doing survival analysis. Now very confused to select the right one for the analysis.
times bcr_patient_barcode patient.vital_status NA TCGA-5M-AAT4 Dead NA TCGA-5M-AAT6 Dead 154 TCGA-3L-AA1B Alive 1200 TCGA-5M-AATE Alive 648 TCGA-A6-2671 Dead
So, from the data above both
cbioportal show different information.
cbioportal clinical data is the updated one as it shows more patients ad
Dead. But why some patients in cbioportal clinical info doesnt have
Days_to_Last_followup? Which of the above is the right one for the Analysis?