how to deal with missing genotype data (gt) to do a machine learning
0
0
Entering edit mode
10 weeks ago

hi

I have a missing genotype data ./. and I want to encode genotype data to do machine learning algorithems

So I have 0/0 , 1/1 , 0/1 and ./. how can i encoded to do ML algorithems

thanks.

missing_data • 490 views
0
Entering edit mode

What is the scientific question you're trying to answer ?

0
Entering edit mode

you can 1) naturally encode them as NA , NA is a special value to encode missing information "Non Available", 2) impute the missing genotypes prior to any downstream analysis.

0
Entering edit mode

how can I impute the missing genotypes? Is there any methods for imputing? Can u help me

0
Entering edit mode

yes, there are multiple posts on biostars, starting from this I guess

Genotype Imputation