Question: Opinion on best way to taxonomically classify a genome
0
gravatar for msobol
10 months ago by
msobol20
msobol20 wrote:

Hello,

I am trying to taxonomically classify two fungal genomes. I have read a couple ways to do this.

  1. Align the genome to reference genomes using programs such as Mummer or Muscle, then build a maximum likelihood tree with RaxML.

  2. Determine common orthologs with Busco or Cegma from the genomes and reference genomes, align and tree.

In your opinion, which is the best? Is there another way to taxonomically identify an organism from their genome?

Thanks in advance, Morgan

ADD COMMENTlink modified 10 months ago by Carambakaracho1.4k • written 10 months ago by msobol20
1

Can you provide more information? From your question I can't really deduct what you want to achieve.

You have no clue which fungal genomes you have? There are apparently reference genomes available?

Even with the brief info at hand I don't think approach 1 will get you very far ... depending on the divergence between the genomes and the reference, nucleotide level will give too little resolution I'm afraid

ADD REPLYlink written 10 months ago by lieven.sterck5.5k

Based on the 18S rRNA we know it's closely related to Penicillium, however, these genomes are from environmental samples so there is uncertainty about how closely related the entire genome is to the reference. From what I can understand, you compare the genome to several reference genomes you believe are closely related then tree the alignment to see how similar/dissimilar it is to the references. I have seen this done with the entire genome itself or the orthologous genes found within the genomes, my question is asking which way is the most accurate. Does this make more sense?

ADD REPLYlink written 10 months ago by msobol20

my first thought would be to go for the orthologous genes then. It's a bit more of a straightforward approach compared to the genome option. On the other hand, if the species are very closely related the proteins might be too conserved (==similar) and might not provide enough resolution. In that case you can use the complete genome indeed.

ADD REPLYlink modified 10 months ago • written 10 months ago by lieven.sterck5.5k
0
gravatar for Carambakaracho
10 months ago by
Carambakaracho1.4k
Switzerland/Basel
Carambakaracho1.4k wrote:

There's other options than doing full genome alignments

Kraken http://ccb.jhu.edu/software/kraken/MANUAL.html

FastANI https://www.biorxiv.org/content/early/2017/11/27/225342 though only published for prokayotes, it's worth a try on fungal genomes

ADD COMMENTlink modified 10 months ago • written 10 months ago by Carambakaracho1.4k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 835 users visited in the last hour