Can anybody guide me in details on the following data set,
1.What is difference between Sequence name mentioned below (e.g. GRCh37.p13 chr3 and TLR9 RefSeqGene etc)
What is difference between Change mentioned below (NC_000003.11:g.52256697C>T andNG_033933.1:g.8483G>T and NG_033933.1:g.8483G>A )
Difference between them, how it affect on decision of SNP findings?
Genomic Placements Sequence name Change
GRCh37.p13 chr 3 NC_000003.11:g.52256697C>T GRCh37.p13 chr 3 NC_000003.11:g.52256697C>A GRCh38.p7 chr 3 NC_000003.12:g.52222681C>T GRCh38.p7 chr 3 NC_000003.12:g.52222681C>A TLR9 RefSeqGene NG_033933.1:g.8483G>A TLR9 RefSeqGene NG_033933.1:g.8483G>T
Gene: TLR9, toll like receptor 9 (minus strand) Molecule type Change Codon/Amino acid SO Term TLR9 transcript NM_017442.3:c.1635G>T CCG>CCT Coding Sequence Variant toll-like receptor 9 precursor NP_059138.1:p.Pro545= P[Pro]>P[Pro] Synonymous Variant TLR9 transcript NM_017442.3:c.1635G>A CCG>CCA Coding Sequence Variant toll-like receptor 9 precursor NP_059138.1:p.Pro545= P[Pro]>P[Pro] Synonymous Variant
Kindly guide me on same in details.
Hello,
if you describe the change based on chromosome you always take the positiv strand. If you describe the change based on the gene/transcript you take into account on which strand this gene encodes. So if a gene like TLR9 encodes on the negative strand C becomes G and A becomes T.
To your second question: "g." means you count every base that belongs to your reference (genomic). "c." means you just count every coding base with "A" of the ATG start codon is 1. Every base that belongs to non coding regions like Intron are not counted.
BTW: Have a look at hgvs to read more about the nomenclature.
fin swimmer
Aren't
c.
coordinates cDNA? So it's not just coding bases but UTR as well.Edit: Nevermind, you're correct. I apparently need more coffee today :)
Thank you very much for your kind guidance.
Please use the "ADD COMMENT" button next time, so threading is maintained.