I noticed that the SNP in dbSNP have the attribute 'strand', I was wondering what's the meaning of the strand because the DNA is complementary, the SNP in Watson strand can also be observed in Crick strand. And if one SNP is labeled in plus strand, can I map this SNP to the gene in minus strand when the chromosome location is proper?
How can a SNP be on the antisense / -ve strand? If the opposite strand's base is to be complementary and therefore also a SNP, it seems redundant to label a strand..?
I read that were are occasionally mismatch errors across the DNA helix in that A binds opposite C or G..? Is that the reason?
The SNP is of course on both strands, but was mapped to one strand or the other because one strand of DNA is sequenced against a template of the other strand. In other words, strand A was sequenced and a SNP was found and hence mapped to a position on strand A and incorporated into a chromosome map also on that same strand. It could be that there is a gene in that same region but on strand B and so the SNP appears to be on the opposite strand of the gene.
Yes, if one SNP is labeled or mapped to the plus strand, you can indeed map that SNP to a gene located on the minus strand. Adrian's answer is good - the A/T and G/C SNPs can be problematic due to strand mapping. Often, it can be difficult or impossible to know which strand actually has been genotyped, but you would still know which SNP was genotyped and that may override any strand concerns.
So, the gene really maps to both strands because a gene is double-stranded. The reading frame of the gene may map to the minus strand of the chromosome and some SNP assays or reported SNPs map to the plus strand. If this assignment needs to be confirmed, then use a simple alignment tool or sequence similarity search (BLASTN) to find where and which strand of the gene and SNP match each other. With the hundreds of SNPs that I analyze in close detail, I need to perform this only once or twice per 100 SNPs. In other words, strand designation of the SNPs is not critical for most work on genetic variation and association studies.
I'm just a touch confused.
How can a SNP be on the antisense / -ve strand? If the opposite strand's base is to be complementary and therefore also a SNP, it seems redundant to label a strand..? I read that were are occasionally mismatch errors across the DNA helix in that A binds opposite C or G..? Is that the reason?
The SNP is of course on both strands, but was mapped to one strand or the other because one strand of DNA is sequenced against a template of the other strand. In other words, strand A was sequenced and a SNP was found and hence mapped to a position on strand A and incorporated into a chromosome map also on that same strand. It could be that there is a gene in that same region but on strand B and so the SNP appears to be on the opposite strand of the gene.