Finding gene doubles in a list of genes
3
0
Entering edit mode
7.5 years ago
Jack ▴ 120

Hello,

Suppose you are given a list of genes like this: MLL CYP2R1 MIR548AN TM9SF2 GALC DEXI CLEC16A RMI2 SHC1 etc...

and you have to check this list for those genes that have different names but actually are the same gene. What DB's would you use for that?

genes r • 1.3k views
ADD COMMENT
0
Entering edit mode

good question....I have get many times same problem like in proteomic I get like 4 different names to the same gen and at end you don't know what to do with them because isn't like I want to split them manually.

ADD REPLY
0
Entering edit mode

Are these identifiers supposed to be from a single organism or multiple?

ADD REPLY
2
Entering edit mode
7.5 years ago

You can also get gene name synonyms with the EnsEMBL API e.g. to get gene names from UniProt for a given EnsEMBL gene:

 my $Ensgene = $gene_adaptor->fetch_by_stable_id($EnsemblID);
 my @uniprots = @{$Ensgene->get_all_DBLinks('Uniprot%')};

For human genes, the reference is HGNC from which you can download a table including synonyms.

ADD COMMENT
1
Entering edit mode
7.5 years ago
mastal511 ★ 2.1k

You need a table of gene synonyms for the organism you are working with.

Here is a link to a previous discussion on biostars: /https://www.biostars.org/p/1378/

ADD COMMENT
1
Entering edit mode

Just an addition that uniprot also provides synonyms in their flat file, can be easily parsed out.

ADD REPLY
0
Entering edit mode
7.5 years ago
biofalconch ★ 1.1k

You could use RefSeq, taking into account that

The Reference Sequence (RefSeq) collection provides a comprehensive, integrated, non-redundant, well-annotated set of sequences, including genomic DNA, transcripts, and proteins

You can find the download links here

Have a good day :)

EDIT 1: Also, following Jean-Karim Heriche's suggestion, you can also make ID conversions depending on what type of IDs you have. You just need a way to compare tables, might be easily done in R or mySQL

ADD COMMENT

Login before adding your answer.

Traffic: 3426 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6