Question: Training Augustus for Maker
gravatar for Rox
4.0 years ago by
France / Toulouse / GeT-Plage
Rox1.2k wrote:

Hi everyone !

I'm aiming to fully assemble and annotate the D. suzukii genome. For the annotation part, it is my first time, I already asked a lot of questions here, and get a lot of answers, but some points remains very unclear to me, even in the Maker tutorial ( ).

In their tutorial, they fully described the procedure to train SNAP. So we have to launch several times maker in order to produce a final pyu.hmm, corresponding to the "final" SNAP model that I'm going to give to my "final" maker.

I was looking for something similar for Augustus, but couldn't really find what I wanted.

So what I did, is to run myself Augustus, first abinitio, and then giving hints (some RNAseq data that correspond to my organism).

I was aiming to give theses 2 augustus output file to maker using for example the maker_opts.ctl :

est_gff= #aligned ESTs or mRNA-seq from an external GFF3 file
pred_gff= #ab-initio predictions from an external GFF3 file
model_gff= #annotated gene models from an external GFF3 file (annotation pass-through)
other_gff= #extra features to pass-through to final MAKER generated GFF3 file

But as you see, there is a lot of places where I could provide my augustus GFF, and I'm not really confident about how theses parameters are similar, or how they affect the final annotation... Also not sure that I'm doing the right thing, and that I'm really "training" augustus or not.

Can I have some advices please ?



maker annotation augustus • 5.9k views
ADD COMMENTlink modified 3.9 years ago by Juke344.5k • written 4.0 years ago by Rox1.2k

Hi, Roxane!

I am annotating my plant genome now and am working with Maker and the same detailed tutorial. My question is re: SNAP now. How do you actually know that it is enough to train and you can run your final Maker run. I have tried to run it several time and there is a difference in the number of genes every time. It is actually a kind of sinusoid graph - number of genes are going up and down... So when do you stop? Or how do you know that SNAP is trained? Do you wait until the plateau? How many times did you do the training and why?

ADD REPLYlink modified 21 months ago • written 21 months ago by alslonik140
gravatar for Juke34
3.9 years ago by
Juke344.5k wrote:

Hello ! Me again ...

You cannot train Augustus within MAKER. You can only create an annotation with MAKER that will be used to train Augustus aside.

Here links that explain how train Augustus:
gene set filter/selection for training ab initio annotation tools

But I advise you to not train it ! Yes, I'm quite sure that the Drosophila melanogaster hmm profile already available into Augustus will be better that the one you will create.

ADD COMMENTlink modified 7 months ago • written 3.9 years ago by Juke344.5k

Sorry for the late reply, I was busy with an other project, I'm currently reading the links you advised me, and they helped me a lot. Thanks !

ADD REPLYlink written 3.4 years ago by Rox1.2k
gravatar for mafireyi
3.9 years ago by
South Africa
mafireyi80 wrote:

Hi. You can use BRAKER to train Augustus. See a previous discussion on this here

A: Braker output into maker (Annotation)

ADD COMMENTlink written 3.9 years ago by mafireyi80

As they says in the paper, If you have only EST, BRAKER is better, but If you have EST and proteins, using MAKER still gives a better result.

ADD REPLYlink written 3.9 years ago by Juke344.5k

and if u have proteins from a closely related organism?

ADD REPLYlink written 3.9 years ago by mafireyi80

Yes ! Most of times it is like that, you don't have the proteins of the species you want to annotate.

ADD REPLYlink written 3.8 years ago by Juke344.5k
gravatar for Juke34
3.9 years ago by
Juke344.5k wrote:

You can also use BUSCO to train Augustus

ADD COMMENTlink written 3.9 years ago by Juke344.5k

I have run BUSCO with the "--long" option and it seemed to have completed successfully, but I don't see where the optimized augustus metaparameters are stored. I have looked in the augustus config directory and the BUSCO output directory. Is there somewhere else it is placed?

ADD REPLYlink written 3.8 years ago by kcamnairb40

It should be in $AUGUSTUS_CONFIG_PATH/species/yourspecies folder I guess.

ADD REPLYlink written 3.8 years ago by Juke344.5k

Did you ever figure this out? I'm trying to train Augustus with BUSCO as well, and despite augustus.log file claiming "Writing exon model parameters [1] to file ..." in AUGUSTUS_CONFIG_PATH, there is no output that I can find.

ADD REPLYlink written 3.5 years ago by bifobear0

I know it's been a really long time since this was posted but I had to deal with this myself a couple of days ago.

So, the Augustus training parameters generated by BUSCO when you use the --long option, are saved in the current working directory (i.e. the directory from which you called BUSCO); they are NOT saved in the Augustus config/species/ directory.

More specifically, it should be in the run_XXX/augustus_output/retraining_parameters/ directory, where XXX is the value you gave to the -o option when running

ADD REPLYlink modified 2.4 years ago • written 2.4 years ago by Panos1.7k

It depends which version you use. Version 3 is like you said, version 1 is like I said, version 2 I don’t remember well but I think it was like version 1.

ADD REPLYlink written 2.4 years ago by Juke344.5k

The species folder where the hmm models are stored are located within the Augustus installation folder. You can try the command

which augustus

And you will know where to look at.

ADD REPLYlink written 3.4 years ago by Juke344.5k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 732 users visited in the last hour