Question: I Have An Excel File Including Some Regions And I Need The Genes Located In Each Region As Out Put
2
gravatar for Omid
8.0 years ago by
Omid560
Netherlands
Omid560 wrote:

I have an Excel file including positions like below (based on hg 18) chr1:195005320-195066067

chr16:16462059-16755404

chr17:41527505-41719986

chr3:176568786-177212247

chr15:41676019-41735779

chr1:195011143-195066066

chr6:32595201-32633890

... I want to known which genes are located in each region in a seprate column in front of each region.Thanks

gene • 1.4k views
ADD COMMENTlink modified 8.0 years ago by Boboppie530 • written 8.0 years ago by Omid560
2
gravatar for Kieren Lythgow
8.0 years ago by
Southampton
Kieren Lythgow110 wrote:

Biomart will retrieve the genes that reside in these regions just follow this link:

http://www.biomart.org/biomart/martview/0cdc89146700420ccb5d1ccf961c219f

Use the filters to specify the chromosomal coordinates and the attributes to generate the coordinates and genes within those regions. This can then be exported as a csv file directly into excel.

ADD COMMENTlink written 8.0 years ago by Kieren Lythgow110
1

Or you may first map your NCBI36 (hg18) regions to GRCh37 (hg19) using the Ensembl Assembly mapper (http://www.ensembl.org/tools.html) and then use BioMart for the most recent version of Ensembl to retrivee your genes.

ADD REPLYlink written 8.0 years ago by Bert Overduin3.6k

warning: your link is directed to GRCh37 not hg18.

ADD REPLYlink written 8.0 years ago by Pierre Lindenbaum123k

thanks but in biomart there is not hg 18.That is hg 19.

ADD REPLYlink written 8.0 years ago by Omid560

it is hg18 if you trace the Ensembl archive back to v54. you may then build your query based on hg18, although the genes in there will be from v54 too.

ADD REPLYlink written 8.0 years ago by Jorge Amigo11k
2
gravatar for Pierre Lindenbaum
8.0 years ago by
France/Nantes/Institut du Thorax - INSERM UMR1087
Pierre Lindenbaum123k wrote:

google for your query returns the following answer on the ucsc mailing list for the table browser: http://genome.ucsc.edu/cgi-bin/hgTables?command=start

To input a batch of regions and output a list of gene names go to the 
table browser and after selecting your assembly of interest select:

     group: Genes and Gene Prediction Tracks
     track: UCSC genes
     table: knownGene

You can paste in regions by clicking on "define regions".

Then select:

     output format: "selected fields from primary and related tables"

Once you click on "get output" this will take you to a menu where you
can specify which fields you want to retrieve from associated tables. 
Now in the "...kgXref fields" section you can select "geneSymbol" or 
other indentifiers of interest to be included in the output. Then click 
"get output".
ADD COMMENTlink modified 4 weeks ago by RamRS24k • written 8.0 years ago by Pierre Lindenbaum123k

I really appreciate Pierre, The problem is: when I select geneSymbol, in out put file I will get all names of one gene and that would be confusing specially when the region is long and there are several genes in that region.Do you know any solution for it?

ADD REPLYlink written 8.0 years ago by Omid560
2
gravatar for Boboppie
8.0 years ago by
Boboppie530
Cambridge, UK
Boboppie530 wrote:

You can also try genomic region search on metabolicMine - http://www.metabolicmine.org/beta/genomicRegionSearch.do

ADD COMMENTlink written 8.0 years ago by Boboppie530
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1909 users visited in the last hour