Hey guys, I'm trying to make a script that will take a fasta file with many sequences from one patient at different time points and then randomly sample one sequence from each time point. For example here are some sequence title names:
01P03Pr01 01P03Pr02 01P03Pr03 01P03Pr04 01P03Kr01 01P03Kr02 01P03Kr03 09P03Pr01 09P03Pr02 09P03Pr03 09P03Kr01 09P03Kr05
Then these are the random sequences that were taken out of the larger fasta file and put in a new one:
01P03Pr02 01P03Kr01 09P03Pr03 09P03Kr05
Hopefully that makes sense. I'm a beginner with coding in python and want to improve so I would appreciate a nudge or some help. I'm using random.sample in my script and i'm not really sure where to start in terms of whether or not to make a dictionary or index, or none of those. Any help would be appreciated!!!