Hello everyone,
When I look at RNA-Seq mapping in IGV, why reads are aligned to the opposite strand?
In the example below, the gene is annotated in the reverse strand (which I understand as a synonym of negative/minus strand), but the reads are aligned to the positive strand (foward/plus strand). It's because is cDNA? (I used Group alignments by first-in-pair strand and Color alignments by first-of-pair strand options in IGV)
Sorry for the silly question, but I could not figured out it by myself...
Many thanks.
I think this has to do with the RNA-seq library prep protocol. There are "stranded" protocols that will always produce reads from the +/- strand. See this for example.
Yes, it's a strand specific protocol (dUTP).
But this is not an exception, this is the rule: for all genes I've seen, the reads are aligned to the opposite strand.
Maybe should I use strand info when mapping (i.e. in bowtie2)? Or I'm using wrong options in IGV (Group alignments by first-in-pair strand and Color alignments by first-of-pair strand)?
Maybe I'm confused here, but doesn't that simply mean that your protocol is minus-strand specific? You can check that by running your aligned bam file through Qualimap. The report will tell you the proportion of +/- alignments.
I ran infer_experiment.py (from RSeQC package) and the results pointed to RF orientation of pairs, which I think you mean "minus-strand specific".
if your data is paired end, then (depending on the protocol) one read of the pair will be sense orientation and the other read will be antisense orientation. See https://www.biostars.org/p/285757/ for more details