I see a range of papers where instead of real sequencing data people use simulated dataset or pseudoreplicates in order to show that they can reach high precision recall and F1-score for their bioinformatics pipeline. Can anyone please discuss here advantages and disadvantages of this approach for data analysis and why it is used? i am just newbie to Bioinformatics so i could not get it so far...

Does it mean the same - pseudoreplicate = simulated data?

Example of paper with simulated data

pseudoreplicates SNPs INDELs • 328 views

