Browsing MGnify IDs and clusters
1
0
Entering edit mode
13 months ago
biomarco ▴ 50

I'm trying to familiarize with the MGnify database (https://www.ebi.ac.uk/metagenomics/) since I believe it would be a valuable resource for my research.

I'm encountering some difficulties with how to use it and would very much appreciate some help from you guys.

Basically I have 2 problems: 1) When I search a protein sequence into their clustered database (Sequence search button) I get to a page with many results. What's the reason why if I directly search for one of those accession ids using the search box on the homepage it doesn't find anything? (e.g. MGYP000524092069 won't work)

2) Since the search results are cluster representatives, how do I get to the whole cluster items?

metagenomics mgnify ebi • 656 views
ADD COMMENT
1
Entering edit mode
13 months ago
biomarco ▴ 50

I follow up with the solutions I found.

Question 1) To retrieve the sequences from accassion ids it seems not possible to use the MGnify website. Just look them up in the ESM Atlas as follows: https://api.esmatlas.com/fetchSequence/MGYP000611865938 It is also documented here: https://esmatlas.com/about

Question 2) To know the cluster items you have to look for the representatives in the file mgy_cluster_seqs.tsv.gz from http://ftp.ebi.ac.uk/pub/databases/metagenomics/peptide_database/current_release/

ADD COMMENT

Login before adding your answer.

Traffic: 2607 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6