Hi Guys,
I have a data-frame "mydata" with 200 columns, and over 15.000 Rows
1. I would like to compare the two columns starting from the first column (XR_res1) and check if the contents match. For example, I want to compare column XR_res3 with column 1XR_res5 and get the concordance column in result with match or mismatch decision ,
2. Group by "Personal_ID" all 15.000 rows and compare mismatches with column result of match or mismatch decision in percents
Thanks
Personal_ID | XR_res1 | XR_res2 | XR_res3 | XR_res4 | XR_res5 | XR_res6 |
---|---|---|---|---|---|---|
001 | pos | pass | pos | neg | pos | neg |
001 | pos | pass | neg | pass | pass | pass |
001 | neg | neg | neg | pos | neg | pass |
002 | pass | pos | pos | pass | pass | pos |
002 | pos | pass | pass | neg | pass | pos |
003 | pass | neg | neg | pos | pass | pos |
003 | pos | neg | pass | pass | pos | pos |
003 | pass | pos | pos | pass | pass | neg |
003 | neg | pos | neg | pass | pos | neg |