I am working on TCGA Datasets and have few questions regarding this. My work is mainly involved around identifying genes which are disease-causing, i.e directly or by secondary activation. going forward, I have few doubts and need your opinion on the following
- Can TCGA datasets be used to predict prognosis for Indian Population as well? (Results from the TCGA analysis) since all samples are sourced from a particular geographic location - Demographic differences.
- What sample size is the best sample size? or How much is good or how much is too much. your opinion/ view on this is valuable and It would be really helpful if you can assist me in any kind of literature available on this.
Thanks a lot for your kind help.
PS: I am a non-native English speaker, Please pardon any mistakes.