Entering edit mode
23 months ago
esimonova.me ▴ 20
I see a range of papers where instead of real sequencing data people use simulated dataset or pseudoreplicates in order to show that they can reach high precision recall and F1-score for their bioinformatics pipeline. Can anyone please discuss here advantages and disadvantages of this approach for data analysis and why it is used? i am just newbie to Bioinformatics so i could not get it so far...
Does it mean the same - pseudoreplicate = simulated data?
Example of paper with simulated data https://bmcbioinformatics.biomedcentral.com/articles/10.1186/s12859-019-2928-9