Question: To Simulate Heterozygous Variations In Genome
2
gravatar for Pascal
6.8 years ago by
Pascal1.4k
Barcelona
Pascal1.4k wrote:

A bit technical question here... I want to simulate an heterozygous indel starting from a piece of reference genome (FASTA file) and I thought about the following workflow to do it. Could you comment it? Is it a good idea or is there a better approach?

  1. copy of reference file (let's call it ref.fasta) to ref_insert.fasta

  2. edit ref_insert.fasta and insert a sequence of 10bp for instance,

  3. concatenate both fasta files ref.fasta into a new fasta file (ref_diploid.fasta)

  4. simulate short reads with a simulator (my favorite one is wgsim) disabling mutations generator,

  5. align and then detect variants.

genome variant • 2.0k views
ADD COMMENTlink written 6.8 years ago by Pascal1.4k
3
gravatar for Stefano Berri
6.8 years ago by
Stefano Berri4.0k
Cambridge, UK
Stefano Berri4.0k wrote:

You can start this way, but then I would leave the mutation generation on (your sample will never be exactly like the reference genome). Also, you can insert a few of different size and/or in random locations, so that you get an idea of how likely it is to pick it up.

ADD COMMENTlink written 6.8 years ago by Stefano Berri4.0k

Of course, you're right. Good point, thank you Stefano.

ADD REPLYlink written 6.8 years ago by Pascal1.4k
1
gravatar for Attila Berces
6.8 years ago by
Attila Berces10 wrote:

At Omixon we developed tools for simulating reads with various mutations. The input is a multi-fasta reference file and a VCF file. It can simulate Illumina and Ion Torrent reads with specified length and distance in case of paired data. We offer it next month as a free on-line service, but if you do not want to wait we can give you a command line version for personal use. you can fill the contact form on www.omixon.com.

ADD COMMENTlink written 6.8 years ago by Attila Berces10
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1534 users visited in the last hour