multi species_gene name to description
2
0
Entering edit mode
3.7 years ago
Phismil ▴ 20

Dear colleagues,

I have a large set of gene names that belong to the "multiple species". I am wondering if you are aware of any pipeline that can print the description of the genes based on their names? To the best of my knowledge, the majority of the pipeline mentioned on the community require the explicit selection of the "reference species" and mainly perform enrichment analysis. The question in my case is only the description.

Example toy data look like:

EF1A_CHICK, PPN_DROME, RL8_DANRE

Thanks in advance

gene function transcriptome • 795 views
ADD COMMENT
2
Entering edit mode
3.7 years ago
Shalu Jhanwar ▴ 520

Have a look at UniProt. It contains multiple functional information (amino acid sequence, name or description, GO etc). Here is an example of it's functionality with the toy data you provided. You can add more columns by editing "columns" tab.

ADD COMMENT
2
Entering edit mode
3.7 years ago
GenoMax 141k

Using EntrezDirect:

$ more test
EF1A_CHICK
PPN_DROME
RL8_DANRE

$ for i in `cat test`; do printf ${i}"\t"; esearch -db protein -query ${i} | esummary | xtract -pattern DocumentSummary -element Title; done
EF1A_CHICK      RecName: Full=Elongation factor 1-alpha 1; Short=EF-1-alpha-1; AltName: Full=Elongation factor Tu; Short=EF-Tu
PPN_DROME       RecName: Full=Papilin; Flags: Precursor
RL8_DANRE       RecName: Full=60S ribosomal protein L8
ADD COMMENT
0
Entering edit mode

Thank you for your reply!

ADD REPLY

Login before adding your answer.

Traffic: 1999 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6