I am analysing clinical data of patients with cancer from GDAC (downloaded from FireHose).
Some columns are dates in year. For example, I have "date of initial pathologic diagnosis" or "year of tobacco smoking onset", which can be dates like 2011, 2009, 2005, etc.
What can I do with these? I would want to analyse the period of time in years between this data and "now" (i.e. when the data was collected). For instance, I don't think the year of smoking onset is interesting as such, but if you know how many years the patients has been smoking, it would definitely be interesting...
Any way can I do something with those? How can I know in which year the data was collected? Have you ever used this type of data?