In order to do further work with the data set, I want to create from scratch the same genetic data set in two different genetic "formats" in python. One of these is supposed to represent the data set as a SNP data set, the other one as a methylated data set. I have absolutely no idea where/how to start. Has anybody yet worked with this before? Or a good webside that elaborates on how to do that? Thanks for your help!
I want to do subsequent mathematical analysis on a not yet existent data set that contains specific features needed for further analysis. These features are that one version of the data set contains SNP data and the other version contains methylated data (type of methylation does not play a role). Both data sets depict the same data, just once represented as SNP and as methylated data. When taking real data, it is not yet certain if a certain effect is present. Therefore I want to model the data from scratch. I was wondering if anybody knows how to create SNP/methylated data from scratch in python? Is there any specific information I need to take into account when creating the data?