Any tools available to generate fake raw reads from assembled genomes?
3
3
Entering edit mode
7.8 years ago
Jenez ▴ 540

Hello fellow bioinformaticians!

I was wondering if anyone had any clue as to whether or not a tool exists for generating dummy/fake reads from an already assembled genome? So far my searches have yielded very little...

Essentially, the ideal software would take a single-contig assembled genome and generate simulated raw reads based on some input criteria such as desired average coverage, level of base quality, etc... .

People are not always nice enough to release the raw reads from their sequencing projects, hence simulated raw reads is the second best thing.

I realize this might be a very difficult task, but assuming no tool exists for this, do you think it would be possible to make a decent one?

Thank you for the input!

Assembly raw reads dummy reads fake reads • 3.2k views
ADD COMMENT
1
Entering edit mode

I second using randomreads.sh from BBMap but in case you want more options take a look at this thread: NGS reads simulation

ADD REPLY
0
Entering edit mode

Oh shit I certainly did not google this well enough. Thanks!

ADD REPLY
4
Entering edit mode
7.8 years ago

BBMap's 'randomreads' does exactly what you need.

ADD COMMENT
0
Entering edit mode

Thanks mate!. I need to better my search skills...

ADD REPLY
1
Entering edit mode
7.8 years ago
ivivek_ngs ★ 5.2k

If you are in need for generating simulated RNA-Seq raw reads then you have polyester that should be able to do the trick for you.

ADD COMMENT
1
Entering edit mode
7.8 years ago
Gabriel R. ★ 2.9k

I have used ART http://www.niehs.nih.gov/research/resources/software/biostatistics/art/ I have been satisfied with the results.

ADD COMMENT

Login before adding your answer.

Traffic: 2453 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6