Question: Get Snps For A List Of Gene Symbols In A Text File As A Single Column
gravatar for confusedious
7.2 years ago by
confusedious420 wrote:

Hello everyone,

I have assembled a list of all genes (~1744) that fall within a GO category of interest and would like to assemble a list of SNPs that fall within these genes. I have previously used an R package for this, but have found that this particular function doesn't work any longer (I'm guessing something has changed at the NCBI end and they aren't understanding one another anymore).

I have the list of gene symbols in a text file as a single column.

Does anyone know a quick and easy way to do this? I am sure there is one!

I would like to avoid scripting if at all possible, I am only familiar with R and even then only at a very elementary level.

Thank you for any help you might be able to offer.

ncbi ensembl snps genes ucsc • 2.2k views
ADD COMMENTlink modified 7.2 years ago by Pierre Lindenbaum131k • written 7.2 years ago by confusedious420
gravatar for Pierre Lindenbaum
7.2 years ago by
France/Nantes/Institut du Thorax - INSERM UMR1087
Pierre Lindenbaum131k wrote:

You could use the UCSC mysql server:

$ echo -e "EIF4G1\nEIF4G2" |\
    awk 'BEGIN {printf("select distinct,R.chrom,R.chromStart,R.chromEnd,X.geneSymbol from (snp137 as R,knownGene as G, kgXref as X ) where R.chrom=G.chrom and NOT(R.chromStart>=G.txEnd or R.chromEnd<=G.txStart) and and X.geneSymbol in (\"-9999\"");} {printf(",\"%s\"",$0);} END {printf(");\n");}' |\
    mysql --user=genome -A -D hg19

or the ucsc tables




extract and grep the field GENEINFO in the NCBI VCFs


ADD COMMENTlink modified 22 months ago by _r_am30k • written 7.2 years ago by Pierre Lindenbaum131k

I figured out how to do it BioMart - thank you.

ADD REPLYlink written 7.2 years ago by confusedious420
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 992 users visited in the last hour