Adding Colourspace Readdata To A Bwa .Sam
1
0
Entering edit mode
12.7 years ago
Russh ★ 1.2k

Hey everyone

I have some .sam issues at the moment. I'm readdressing a BWA/GATK workflow I was using a couple of months ago and would like to be able to use GATKs base quality recalibration tools (CountCovariates / TableRecalibration).

Now, my SOLiD reads were converted to (fake)basespace prior to mapping with BWA. The resultant .bam can't be used in the GATK recalibration tools [despite my having reordered / sorted / indexed to conform to GATKs other requirements] because the original colourspace read data isn't present in the .bam.

So my question is: given a .csfasta file and a .sam file, is there a particularly efficient way to look up a read in the former and add its colourspace readdata (CS:Z:... and CQ:Z:...) to the latter, iterating over all of the reads in the .sam?

All the best, Russ

next-gen sequencing bwa • 2.3k views
ADD COMMENT
1
Entering edit mode
12.7 years ago
mylons ▴ 130

You are doing the analysis incorrectly. Converting SOLiD data to base space prior to mapping destroys the power of colorspace all together.

I answered an unrelated question about mapping here that you should check out: A: Mapping Paired End Solid Data Using Bwa

If you map the data using a colorspace aware mapper, ie bfast, this problem is removed from your workflow and you don't have to use a kludge to fix it.

ADD COMMENT
0
Entering edit mode

oh hi, thanks for your reply. I also did the mapping with SHRiMP, and was keen to compare the two. All the best.

ADD REPLY
0
Entering edit mode

I haven't used SHRiMP myself, but if computational resources aren't an issue for you compare your results to a colorspace aware mapper. My intuition is the colorspace aware algorithm will be superior

ADD REPLY

Login before adding your answer.

Traffic: 2127 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6