The recent UniProt website is fabulous, containing a complete collection of links to different resources. UniProt itself could be a very good resource for data mining for biological information.
I have a few questions regarding the SNP information in UniProt.
1. SNP information could be in two different section: Pathology and Biotech section and Sequence section. Is this from the annotation of disease SNP in dbSNP?
2. I want to identify disease SNP from natural variance. From xml file of UniProt (like: http://www.uniprot.org/uniprot/Q96CV9.xml), are all "feature" entries of variant with "evident" key disease related? Which is like following line:
<feature type="sequence variant" description="In GLC1E." id="VAR_021537" evidence="16 17 18">
3. How does UniProt do the mapping between dbSNP and UniProt? Just by sequence alignment of mRNA? Is the mapping complete and correct? Or there is any heuristic behind.