Why reads not match to the genome? too many wrong bases to the genome
2
0
Entering edit mode
6.6 years ago
maple964 • 0

Dear all,

I am very new in this field. I faced problems that I could not find the answer.

  1. I used IGV to see the mapped reads to the Arabidopsis genome. However, I found this. enter image description here Why reads mapped to the wrong position ? or the genome reference is a problem. Does it suggest something wrong with the mapping procedure? Why can reads be mapped to the position with so many different bases?

The following is tophat code.

tophat -p 12 -o 20170905_tophat_WT -G /usr1/thhuang/index/TAIR10_GFF3_genes_transposons.gff /usr1/thhuang/index/TAIR10_chr Trim_WT.fastq.gz

Hope to find a solution

Thank you all in advance

RNA-Seq rna-seq IGV tophat • 2.1k views
ADD COMMENT
1
Entering edit mode

Looks like you mapped against a different genome than you're using in IGV.

ADD REPLY
1
Entering edit mode

While the command line and the image say TAIR10 there is certainly the possibility that something is wrong as far as the genome build is concerned.

Surprising that IGV still allows one to select an incorrect genome.

ADD REPLY
1
Entering edit mode

This is because of the oversimplification in SAM spec where the mismatch is reported as an M so we can only figure the mismatches from the MD tag. But the MD tag is optional hence, by default, the SAM spec does not require the aligner to report how many mismatches the alignment contains...let's just think about that for a second...

If the MD tag does not match what IGV sees then there is a discrepancy. Now it becomes a user interface issue. How many mistakes until you show the error. If it is too strict it just brings about another set of problems.

ADD REPLY
1
Entering edit mode
6.6 years ago

You are most likely visualizing the alignments against a different genomic build than what the aligner used.

Ensure that the reference genomes are the same in both cases.

ADD COMMENT
0
Entering edit mode
6.6 years ago
maple964 • 0

Thank you all for the reply! I am appreciated. I downloaded a new reference and did the mapping using tophat again. The results showed normal.

ADD COMMENT

Login before adding your answer.

Traffic: 2665 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6