I'm stiil not able to figure out how Bowtie2 deal with different Phred quality encoding.
In my recent post:
enter link description here
genomax pointed me to use --phred64 flag when using Bowtie2 to map Phred+64 encoded reads.
I tried both --phred64 and --phred33 flags to map the same reads to reference. All my reads have the same quality score in each position (just short example):
@Seq=TGCAGCTTGCTGGTCATTGAGTGGAGCTGGGTCTTAGCCTTGAGATGGA
TGCAGCTTGCTGGTCATTGAGTGGAGCTGGGTCTTAGCCTTGAGATGGA
+
fffffffffffffffffffffffffffffffffffffffffffffffff
@Seq=TGCAGCTTGCTGGTCATTGAGTGGAGCTGGGTCTTAGCCTTGAGGTGGA
TGCAGCTTGCTGGTCATTGAGTGGAGCTGGGTCTTAGCCTTGAGGTGGA
+
fffffffffffffffffffffffffffffffffffffffffffffffff
Obiously the reads are Phred+64 encoded. My first command line was:
bowtie2 --very-sensitive --phred64 -x index -U ForAlign.fa.gz -S AlignFullvs_test_64.sam
In the log file i've found:
249631 reads; of these:
249631 (100.00%) were unpaired; of these:
6787 (2.72%) aligned 0 times
199271 (79.83%) aligned exactly 1 time
43573 (17.45%) aligned >1 times
97.28% overall alignment rate
Further, i used the same command but changed --phred64 to --phred33 flag:
bowtie2 --very-sensitive --phred33 -x index -U ForAlign.fa.gz -S AlignFullvs_test_33.sam
In the log file:
249631 reads; of these:
249631 (100.00%) were unpaired; of these:
7132 (2.86%) aligned 0 times
200810 (80.44%) aligned exactly 1 time
41689 (16.70%) aligned >1 times
97.14% overall alignment rate
So, as you can see, the overall statistics was almost the same, there were not any errors or at least warnings in both runs. My questions are:
- How
Bowtie2can mapPhred+64encoded reads with--phred33flag without any errors?- Are the both resulted
SAMfiles correct?
- Are the both resulted
You can also change the Q scores to
phred+33by using the following program from BBMap suite before mapping: