How do you split metagenomic bins/MAGs into eukaryotic and prokaryotic?
1
1
Entering edit mode
2.7 years ago
O.rka ▴ 710

I have a dataset with lots of single-celled algae, diatoms, and bacteria. I can't use my normal pipeline ([MULTIPLE BINNING PROGRAMS] -> DAS Tool -> GTDB-Tk) because the latter 2 are built specifically for prokaryotes.

How can I determine which metagenome assembled genomes [MAG] are eukaryotic? The options I know about are the following:

  • EukRep - This happens at the contig level and is ML-based. I've run "positive controls" through EukRep in the past and it couldn't identify that all the contigs were eukaryotic so I don't have complete faith in this methodology. There seemed to be a high false negative rate.;
  • BBSuite's sketch.sh - This worked pretty well in the past but AFAIK, it's heavily dependent on a database that isn't actively updated and really affected by the completeness of the genome (other methods could be as well);
  • Mash/Sourmash - Same as above. Though, I've had more errors with these.
  • BARRNAP - Identify 16S/18S then see what is more prominent. Not sure how I feel about this.

Is there something akin to GTDB-Tk but can handle eukaryotic MAGs?

To reiterate, I want to bin out my MAGs and then perform eukaryotic vs. prokaryotic classification (not on the contig level preferably). What are the recommended methods to do this?

protist metagenomics prokaryotic eukaryotic gtdbtk • 1.3k views
ADD COMMENT
1
Entering edit mode

Check out this preprint that came out today on binning eukaryotic genomes:

https://www.biorxiv.org/content/10.1101/2021.07.25.453713v1.full.pdf

ADD REPLY
0
Entering edit mode

Wow...what are the chances that a biorxiv/tool comes out specifically addressing the question that I was asking??? Pretty wild. Thanks for linking me out to this. I've seen some activity on GitHub about this so I'm excited to see they are going under peer review.

ADD REPLY
1
Entering edit mode
2.7 years ago
Mensur Dlakic ★ 27k

Is there something akin to GTDB-Tk but can handle eukaryotic MAGs?

GTDB-Tk will classify Eukarya as Asgard archaea, which have very limited geographic distribution so it is a safe bet that they are not in your sample. I have seen diatoms and fungi in my samples and that is how they get classified.

All your other choices are viable as well to corroborate or reject whatever you get with GTDB-Tk. I would use SSU-ALIGN instead of Barrnap.

ADD COMMENT

Login before adding your answer.

Traffic: 1842 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6