I'm trying to test the performances of different read correctors on metagenomic data. My analysis step are as followed: 1 - Create a metagenomic dataset without sequencing errors using my own genome set (I want to control the divergence between species in the sample). 2 - Insert errors into the reads 3 - Execute multiple read correctors on this dataset. 4 - Compare their results with the reads from step 1.
Everything is working except the step 1. I tried to use CAMISIM. But the light documentation is a huge problem (I talking a lot with the creator to understand the details). Then, I tried to use InSilicoSeq that is easy to execute. But this is impossible (for now) to have perfect reads at the end of step 1.
So, do you have suggestions for this step ?