How to find the location of snps in a gene, whether it is in the promoter, intron, exon or 5'UTR region?
0
0
Entering edit mode
3.2 years ago
Abbas.M • 0

Hello, I have a plant data of 300 varieties, on which I already performed GWAS. After GWAS, I have annotated the most significant snps to find candidate genes. I found some candidate genes that are predicted to be associated with my trait.

Now, I want to find the haplotypes of those candidate genes in my entire plant population. For this, I want to find all the snps in my candidate genes, and its location in my gene with respect to exon, intron, promoter, 5'UTR etc. Also I want to know, individuals(accessions) carrying those haplotypes? Can anyone suggest me, how to accomplish my task? Any tools, or programs that can do it?

I already used haploview for it, but it can only give haplotype blocks and frequency. It cannot give the full information of individuals carrying a specific haplotype. I tried it in BLAST also, by selecting pairwise with dots for identities. For this, I blasted my gene in NCBI, and selected Pairwise with dots for identities. It gives the mismatched bases in red color, and identical bases are shown as dots, as given below. But I am not sure, these are snps, or sequencing errors. Can we use blast for finding snps? Highly appreciated, if get some precious suggestions.

Alignment statistics for match #1
Score   Expect  Identities  Gaps    Strand
3055 bits(1654) 0.0 1747/1792(97%)  6/1792(0%)  Plus/Plus
Query  1        ATGGAGAAAAAGCAAGGTTTTTTCTCAGCTCTCAAAGAGGAAGTAATTCGTGGGCTTTCA  60
Sbjct  4420511  ............................................................  4420570

Query  61       CCTTCCCGCTCGAGGACCAACAGCCCCGGAAGAGCCCGGTCACCTATTGCCATTCTGTTG  120
Sbjct  4420571  ............................................C...............  4420630

Query  121      CGGAGAAAGAAAAGCGGCCACTACAACTACGGAGGCGCTTACCTGGTACAACCGGAGCCC  180
Sbjct  4420631  .....................A........................CG............  4420690

Query  181      TTGATCGCGAGGTACGGTGTCGGGGAAGCGTTAGCTCCGCTCATGGAAGGTCCCGACCCG  240
Sbjct  4420691  ....C........................T..............................  4420750

Query  241      GACGGAGGCGAAACCGGGGATTCCAAGAGGCTTGGGTTGGGGCTAGGACAATGGGTTATG  300
Sbjct  4420751  ........T...........C................C......................  4420810

Query  301      GGACAGTTATCGAGGACTCCATCCATGGCTTCCTTGAGTTGCAAAAGGTCCGATCTAAGG  360
Sbjct  4420811  ............................................................
SNP R gene genome • 555 views
ADD COMMENT

Login before adding your answer.

Traffic: 3001 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6