Hiii.....I'm ruuning maker annotation pipeline for the first time. It is given that in the maker tutorial the output folder contains number of files and a directory.

ie, contig-dpp-500-500.gff
contig-dpp-500-500.maker.proteins.fasta
contig-dpp-500-500.maker.transcripts.fasta
run.log
theVoid.contig-dpp-500-500


But my output folder contains only the following files. why didn't generate the protein and transcript sequence files?

contig-dpp-500-500.gff
run.log
theVoid.contig-dpp-500-500


Have you run the MAKER scripts fasta_merge to create the protein/transcript fasta files and gff3_merge to create the final gff file?

Not yet..could u please mention the syntax for the same.

./fasta_merge

Synopsis:

fasta_merge -d maker_datastore_index.log
fasta_merge -o genome.all -i <fasta1> <fasta2> ...

Descriptions:

This script will take a MAKER datastore index log file, extract all
the relevant fasta files and create fasta files with relevant
categories of sequence (i.e. transcript, protein, GeneMark protien,
etc.).  For this to work properly you need to be in the same directory
as the datastore index.

Options:

-d The location of the MAKER datastore index log.
-o Alternate base name for the output files.
-i A optional list of files to process along with or instead of the
datastore.

./gff3_merge

Synopsis:

gff3_merge -d maker_datastore_index.log
gff3_merge -o genome.all.gff <gff3_file1> <gff3_file2> ...

Descriptions:

This script will take a MAKER datastore index log file, extract all
the relevant GFF3 files and combined GFF3 file.  The script can also
combine other correctly formated GFF3 files.  For this to work
properly you need to be in the same directory as the datastore index.

Options:

-d The location of the MAKER datastore index log file.
-o Alternate base name for the output files.
-s Use STDOUT for output.
-g Only write MAKER gene models to the file, and ignore evidence.
-n Do not print fasta sequence in footer
-l Merge legacy annotation sets (ignores already having seen
features more than once for the same contig)

i ran the below command but nothing was generated.Do you have any advices ? Thanks for your help

fasta_merge -d dpp_contig_master_datastore_index.log

Is the datastore index still present (i.e., was it accidentally removed)?

contig-dpp-500-500  dpp_contig_datastore/05/1F/contig-dpp-500-500/  STARTED
contig-dpp-500-500  dpp_contig_datastore/05/1F/contig-dpp-500-500/  FINISHED

Is the directory dpp_contig_datastore still there, not the file dpp_contig_master_datastore_index.log

dpp_contig_datastore  dpp_contig_master_datastore_index.log  maker_exe.log   mpi_blastdb
dpp_contig.db         maker_bopts.log                        maker_opts.log  seen.dbm

Sorry, not sure why fasta_merge -d dpp_contig_master_datastore_index.log produces no output then unless the directory dpp_contig_datastore/05/1F/contig-dpp-500-500/ is missing or the .fa or .gff files are missing from that sub-directory.

Hii...Maker didn't generate the protein and transcript sequence files even after fasta_merge program. Do you have any advices ? Could u please help me..Thank you.

Sorry, I don't know what the problem is. Can you point us to your tutorial you are using and the commands you entered on your computer? Also, did MAKER generate .fa and .gff files in the sub-directory dpp_contig_datastore/05/1F/contig-dpp-500-500/?

http://gmod.org/wiki/MAKER_Tutorial_2013#Installation

The example from above works on my computer and produces the following output. Note the deep levels of sub-directories. I am using MAKER 2.31.10

working directory /genetics/elbers/test/
ls /genetics/elbers/test/dpp_contig.maker.output/dpp_contig_datastore/05/1F/contig-dpp-500-500
contig-dpp-500-500.gff  contig-dpp-500-500.maker.proteins.fasta  contig-dpp-500-500.maker.transcripts.fasta  run.log  theVoid.contig-dpp-500-500

genome=/genetics/elbers/maker-2.31.10/data/dpp_contig.fasta #genome sequence (fasta file or fasta embeded in GFF3 file)
est=/genetics/elbers/maker-2.31.10/data/dpp_est.fasta #set of ESTs or assembled mRNA-seq in fasta format
protein=/genetics/elbers/maker-2.31.10/data/dpp_protein.fasta  #protein sequence file in fasta format (i.e. from mutiple oransisms)
repeat_protein=/genetics/elbers/maker-2.31.10/data/te_proteins.fasta


note that dpp_est.fasta and dpp_protein.fasta are named differently in new versions of MAKER than the version of MAKER used in the tutorial

Thank you so much for the support and advice. Solved it..
got the protein and transcript files. :-)

Actually the problem was in the opts.ctl file. I didn't make a new maker_opts.ctl. i edited the maker generated opt.ctl which includes other configurations. That made the mistake.

Could you please let me know, what exactly did you change in maker_opts.ctl file to get the fasta output. I am getting the same error but I am not able to figure out the cause.

Created a new opts.ctl file which contains only genome,est,protein,est2genome files.

