Insert variant into fastq file
0
0
Entering edit mode
4.9 years ago

Hi,

I have a fastq file. I would like to alter it with and insertion or deletion.

What is the best way to go about doing this?

edit: must be for use with fastq

fastq ngs • 1.8k views
ADD COMMENT
0
Entering edit mode

Why would you like to do this? If you're looking to simulate a variant, exact knowledge of where the read will map (as well as what other reads might map to the same locus) would be required, otherwise the alteration may be discarded as a sequencing anomaly.

ADD REPLY
0
Entering edit mode

I would like to insert a true positive variant event into the fastq file to test whether or not the pipeline will detect the variant inserted/deleted at a given position.

ADD REPLY
2
Entering edit mode

Seems like the easier thing to do is to insert the event into the reference.

ADD REPLY
1
Entering edit mode

Second that, there are multiple tools to mutate a fasta file. From this fasta file you can then simulate fastq reads, with which you can test your pipeline. That's the most appropriate test. Inserting a variant in a read/fastq is not a valid approach.

ADD REPLY
0
Entering edit mode

Why not do it at the SAM level? Variant callers don't usually work on FASTQ directly!

ADD REPLY
0
Entering edit mode

I was only given a fastq file to work with

ADD REPLY
2
Entering edit mode

Align it, and you'll have a SAM/BAM file. Either modify that file and get the pipeline started at that stage, or (not sure if this will work) figure out which reads align to a target location and modify those reads. With the second approach, be aware that modifying the reads can lead to them aligning to a different location.

ADD REPLY
0
Entering edit mode

I'm sure that this way makes the most sense, but I need to specifically use a fastq file, per the request of the assignment

ADD REPLY
0
Entering edit mode

assignment

Like a homework assignment or an actual project/task?

ADD REPLY
0
Entering edit mode

This is an actual project/task

ADD REPLY
0
Entering edit mode

The other way you might do this, though it's pretty bass ackwards, is to make a reference file with variants, then simulate a fastq from that reference.

ADD REPLY

Login before adding your answer.

Traffic: 2975 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6