Batch query reference nucleotide
0
0
Entering edit mode
7.4 years ago
jennagu • 0

Hi. I have a long list of genomic locations and I would like to batch query for the corresponding reference nucleotide.

For example, here is a list of four hg19 genomic locations:

  • chr12:103246681
  • chr12:103246654
  • chr12:103246593
  • chr12:103237454

I want the reference nucleotide for each location. The answer is actually G for all four of them. I know how to get this answer by searching one at a time in the UCSC browser, but I need a way to search for hundreds at a time.

Any ideas???

Thanks!

SNP genomic reference sequence • 1.3k views
ADD COMMENT
1
Entering edit mode

If you have programming experience, there are many ways to do it. If you don't, have you tried UCSC Table Browser? Import your coordinates as a custom track in 'chr start end' format and you can get sequences from a list of coordinates very easily. Because UCSC uses 0-based coordinate, chr12:103246681 should be entered as 'chr12 103246680 103246681'.

One of your coordinates returns a T.

>hg19_ct_UserTrack_3545_0 range=chr12:103237454-103237454 5'pad=0 3'pad=0 strand=+ repeatMasking=none
T
>hg19_ct_UserTrack_3545_1 range=chr12:103246593-103246593 5'pad=0 3'pad=0 strand=+ repeatMasking=none
G
>hg19_ct_UserTrack_3545_2 range=chr12:103246654-103246654 5'pad=0 3'pad=0 strand=+ repeatMasking=none
G
>hg19_ct_UserTrack_3545_3 range=chr12:103246681-103246681 5'pad=0 3'pad=0 strand=+ repeatMasking=none
G
ADD REPLY

Login before adding your answer.

Traffic: 3792 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6