Question: Cannonical human sequence missing in OMA database?
gravatar for asente
14 months ago by
asente10 wrote:


I would like to fetch 1:1 orthologs for GBRA2_HUMAN, a cannonical GABAA receptor alpha-2 subunit sequence as defined by Uniprot (Uniprot ID: P47869), however it seems that this sequence is not available in the OMA database. Could you please let me know if there is an alternative way to obtain the orthologs for this gene and what might be the reason for the missing entry?

I tried using the OMAdb R package:

getHOG(id ='GBRA2_HUMAN', members = TRUE)

THE OMA REST API request failed: Here's the original error message: Not Found (HTTP 404).

If I use mapSequence(sequence) using the cannonical sequence from Uniprot, I get three matched targets, neither of which is the cannonical human GABAA alpha2 subunit:

[1] "A0A2R8ZZ28" "H2QPE5" "G3QVC6"

On the other hand, other human GABAA subunits seem to be present in the database (e.g. GBRA1_HUMAN, GBRA3_HUMAN).

The GBRA2_HUMAN sequence I used for searches is the following:


Many thanks.

oma R software error • 299 views
ADD COMMENTlink modified 13 months ago by Christophe Dessimoz700 • written 14 months ago by asente10
gravatar for Christophe Dessimoz
13 months ago by
University College London
Christophe Dessimoz700 wrote:

The reason for this is that OMA is quite stringent when mapping IDs. For human, OMA uses the genome provided by Ensembl. The protein sequence we consider for the gene you are interested in (human gene "GABRA2") is a different isoform in OMA vs UniProt. The one we use in OMA ( which is 511 AA long and maps to a different UniProt entry (

So how can you find GBRA2_HUMAN in OMA?

1) One way would be to use the approximate search functionality, e.g. using the REST API function sequence list (setting the search parameter to "approximate").

For instance, using the library OmaDB in R: mapSequence(sequence,search = "approximate")

2) Another solution would be to map via the gene name GABRA2. You can use the REST API function xref list:

For instance, using the library OmaDB in R: searchProtein('GABRA2')

This returns two human isoforms in OMA, the first of which is ENSP00000427603 (and which was used to infer orthologs).

ADD COMMENTlink modified 13 months ago • written 13 months ago by Christophe Dessimoz700

Dear Christophe, many thanks for your helpful and detailed answer. Best wishes.

ADD REPLYlink written 13 months ago by asente10
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1482 users visited in the last hour