Issues with BRAKER2 (v.2.1.6)--not producing "genemark_hintsfile.gff.prot.tmp"
0
0
Entering edit mode
2.8 years ago
vizzerraa • 0

Hi, BRAKER2 (v.2.1.6) is not producing the "genemark_hintsfile.gff.prot.tmp" file after ProtHint finishes, what could be causing this issue?

The full command was

/data/selaginella/anaconda3/envs/braker2_conda_env/BRAKER-2.1.6/scripts/braker.pl --genome=/data/selaginella/DATA/annotations/braker_wallacei_test/a.lines.fa --bam=/data/selaginella/DATA/annotations/braker_wallacei_test/wallacei.sorted.bam --species=wallacei --prot_seq=/data/selaginella/DATA/annotations/braker_wallacei_test/protein.fa --ALIGNMENT_TOOL_PATH=/data/selaginella/anaconda3/envs/braker2_conda_env/gth-1.7.3-Linux_x86_64-64bit/bin --GENEMARK_PATH=/data/selaginella/anaconda3/envs/braker2_conda_env/gmes_linux_64/ --AUGUSTUS_SCRIPTS_PATH=/data/selaginella/anaconda3/envs/braker2_conda_env/augustus-3.4.0/scripts --GUSHR_PATH=/data/selaginella/anaconda3/envs/braker2_conda_env/GUSHR --PROTHINT_PATH=/data/selaginella/anaconda3/envs/braker2_conda_env/ProtHint/bin --etpmode --UTR=on --softmasking --cores=8 

Please find the braker.log attached below. There is no content in any of the error files either.

#**********************************************************************************
#                               BRAKER CONFIGURATION
#**********************************************************************************
# BRAKER CALL: /data/selaginella/anaconda3/envs/braker2_conda_env/BRAKER-2.1.6/scripts/braker.pl --genome=/data/selaginella/DATA/annotations/braker_wallacei_test/a.lines.fa --bam=/data/selaginella/DATA/annotations/braker_wallacei_test/wallacei.sorted.bam --species=wallacei --prot_seq=/data/selaginella/DATA/annotations/braker_wallacei_test/protein.fa --ALIGNMENT_TOOL_PATH=/data/selaginella/anaconda3/envs/braker2_conda_env/gth-1.7.3-Linux_x86_64-64bit/bin --GENEMARK_PATH=/data/selaginella/anaconda3/envs/braker2_conda_env/gmes_linux_64/ --AUGUSTUS_SCRIPTS_PATH=/data/selaginella/anaconda3/envs/braker2_conda_env/augustus-3.4.0/scripts --GUSHR_PATH=/data/selaginella/anaconda3/envs/braker2_conda_env/GUSHR --PROTHINT_PATH=/data/selaginella/anaconda3/envs/braker2_conda_env/ProtHint/bin --etpmode --UTR=on --softmasking --cores=8
# Wed Jun 16 21:54:15 2021: braker.pl version 2.1.6
# Wed Jun 16 21:54:15 2021: Configuring of BRAKER for using external tools...
# Wed Jun 16 21:54:15 2021: Trying to guess $AUGUSTUS_BIN_PATH from $AUGUSTUS_CONFIG_PATH.
# Wed Jun 16 21:54:15 2021: Setting $AUGUSTUS_BIN_PATH to /data/selaginella/anaconda3/envs/braker2_conda_env/bin/../config/../bin
# Wed Jun 16 21:54:15 2021: Setting $AUGUSTUS_SCRIPTS_PATH to command line argument --AUGUSTUS_SCRIPTS_PATH value /data/selaginella/anaconda3/envs/braker2_conda_env/augustus-3.4.0/scripts.
# Wed Jun 16 21:54:15 2021: Did not find environment variable $PYTHON3_PATH
# Wed Jun 16 21:54:15 2021: Trying to guess $PYTHON3_PATH from location of python3 executable that is available in your $PATH
# Wed Jun 16 21:54:15 2021: Setting $PYTHON3_PATH to /data/selaginella/anaconda3/envs/braker2_conda_env/bin
# Wed Jun 16 21:54:15 2021: Did not find environment variable $JAVA_PATH
# Wed Jun 16 21:54:15 2021: Trying to guess $JAVA_PATH from location of java executable that is available in your $PATH
# Wed Jun 16 21:54:15 2021: Setting $JAVA_PATH to /data/selaginella/miniconda3/bin
# Wed Jun 16 21:54:15 2021: Setting $GUSHR_PATH to command line argument --GUSHR_PATH value /data/selaginella/anaconda3/envs/braker2_conda_env/GUSHR.
# Wed Jun 16 21:54:15 2021: Setting $GENEMARK_PATH to command line argument --GENEMARK_PATH value /data/selaginella/anaconda3/envs/braker2_conda_env/gmes_linux_64.
# Wed Jun 16 21:54:15 2021: Did not find environment variable $BAMTOOLS_PATH (either variable does not exist, or the path given in variable does not exist). Will try to set this variable in a different way, later.
# Wed Jun 16 21:54:15 2021: Trying to guess $BAMTOOLS_BIN_PATH from location of bamtools executable that is available in your $PATH
# Wed Jun 16 21:54:15 2021: Setting $BAMTOOLS_BIN_PATH to /data/selaginella/anaconda3/envs/braker2_conda_env/bin
# Wed Jun 16 21:54:15 2021: Did not find environment variable $SAMTOOLS_PATH  (either variable does not exist, or the path given in variable doesnot exist). Will try to set this variable in a different way, later.
# Wed Jun 16 21:54:15 2021: Trying to guess $SAMTOOLS_PATH from location of samtools executable in your $PATH
# Wed Jun 16 21:54:15 2021: Setting $SAMTOOLS_PATH to /data/selaginella/miniconda3/bin
# Wed Jun 16 21:54:15 2021: Did not find environment variable $DIAMOND_PATH
# Wed Jun 16 21:54:15 2021: Trying to guess $DIAMOND_PATH from location of diamond executable that is available in your $PATH
# Wed Jun 16 21:54:15 2021: Setting $DIAMOND_PATH to /data/selaginella/anaconda3/envs/braker2_conda_env/bin
# Wed Jun 16 21:54:15 2021: Setting $PROTHINT_PATH to command line argument --PROTHINT_PATH value /data/selaginella/anaconda3/envs/braker2_conda_env/ProtHint/bin.
# Wed Jun 16 21:54:15 2021: Did not find environment variable $CDBTOOLS_PATH
# Wed Jun 16 21:54:15 2021: Trying to guess $CDBTOOLS_PATH from location of cdbfasta executable that is available in your $PATH
# Wed Jun 16 21:54:15 2021: Setting $CDBTOOLS_PATH to /data/selaginella/anaconda3/envs/braker2_conda_env/bin
#**********************************************************************************
#                               CREATING DIRECTORY STRUCTURE
#**********************************************************************************
# Wed Jun 16 21:54:15 2021: create working directory /data/selaginella/DATA/annotations/braker_wallacei_test/braker.
mkdir /data/selaginella/DATA/annotations/braker_wallacei_test/braker
# Wed Jun 16 21:54:15 2021: creating file that contains citations for this BRAKER run at /data/selaginella/DATA/annotations/braker_wallacei_test/braker/what-to-cite.txt...
# Wed Jun 16 21:54:15 2021: create working directory /data/selaginella/DATA/annotations/braker_wallacei_test/braker/GeneMark-ETP.
mkdir /data/selaginella/DATA/annotations/braker_wallacei_test/braker/GeneMark-ETP
# Wed Jun 16 21:54:15 2021: create working directory /data/selaginella/DATA/annotations/braker_wallacei_test/braker/GeneMark-ES.
mkdir /data/selaginella/DATA/annotations/braker_wallacei_test/braker/GeneMark-ES
# Wed Jun 16 21:54:15 2021: create working directory /data/selaginella/DATA/annotations/braker_wallacei_test/braker/species
mkdir /data/selaginella/DATA/annotations/braker_wallacei_test/braker/species
# Wed Jun 16 21:54:15 2021: create working directory /data/selaginella/DATA/annotations/braker_wallacei_test/braker/errors
mkdir /data/selaginella/DATA/annotations/braker_wallacei_test/braker/errors
# Wed Jun 16 21:54:15 2021: changing into working directory /data/selaginella/DATA/annotations/braker_wallacei_test/braker
cd /data/selaginella/DATA/annotations/braker_wallacei_test/braker
# Wed Jun 16 21:54:15 2021: Creating parameter template files for AUGUSTUS with new_species.pl
# Wed Jun 16 21:54:15 2021: new_species.pl will create parameter files for species wallacei in /data/selaginella/anaconda3/envs/braker2_conda_env/bin/../config/species/wallacei
perl /data/selaginella/anaconda3/envs/braker2_conda_env/augustus-3.4.0/scripts/new_species.pl --species=wallacei --AUGUSTUS_CONFIG_PATH=/data/selaginella/anaconda3/envs/braker2_conda_env/bin/../config 1> /dev/null 2>/data/selaginella/DATA/annotations/braker_wallacei_test/braker/errors/new_species.stderr
# Wed Jun 16 21:54:15 2021: check_fasta_headers(): Checking fasta headers of file /data/selaginella/DATA/annotations/braker_wallacei_test/a.lines.fa
#*********
# WARNING: Detected whitespace in fasta header of file /data/selaginella/DATA/annotations/braker_wallacei_test/a.lines.fa. This may later on cause problems! The pipeline will create a new file without spaces or "|" characters and a genome_header.map file to look up the old and new headers. This message will be suppressed from now on!
#*********
# Wed Jun 16 21:54:18 2021: check_fasta_headers(): Checking fasta headers of file /data/selaginella/DATA/annotations/braker_wallacei_test/protein.fa
#*********
# WARNING: Detected whitespace in fasta header of file /data/selaginella/DATA/annotations/braker_wallacei_test/protein.fa. This may later on cause problems! The pipeline will create a new file without spaces or "|" characters and a genome_header.map file to look up the old and new headers. This message will be suppressed from now on!
#*********
# Wed Jun 16 21:54:18 2021: Assuming that this is not a DNA fasta file because other characters than A, T, G, C, N, a, t, g, c, n were contained. If this is supposed to be a DNA fasta file, check the content of your file! If this is supposed to be a protein fasta file, please ignore this message!
#*********
# WARNING: in file /data/selaginella/anaconda3/envs/braker2_conda_env/BRAKER-2.1.6/scripts/braker.pl at line 1341
file /data/selaginella/DATA/annotations/braker_wallacei_test/braker/genome.fa contains a highly fragmented assembly (210069 scaffolds). This may lead to problems when running AUGUSTUS via braker in parallelized mode. You set --cores=8. You should run braker.pl in linear mode on such genomes, though (--cores=1).
#*********
#**********************************************************************************
#                               PROCESSING HINTS
#**********************************************************************************
# Wed Jun 16 21:54:19 2021: Running ProtHint to produce hints from protein sequence file (this may take a couple of hours)...
# Wed Jun 16 21:54:19 2021: Running Genemark-ES for ProtHint...
# Wed Jun 16 21:54:19 2021: Executing GeneMark-ES
# Wed Jun 16 21:54:19 2021: changing into GeneMark-ES directory /data/selaginella/DATA/annotations/braker_wallacei_test/braker/GeneMark-ES
cd /data/selaginella/DATA/annotations/braker_wallacei_test/braker/GeneMark-ES
# Wed Jun 16 21:54:19 2021: Executing gmes_petap.pl
perl /data/selaginella/anaconda3/envs/braker2_conda_env/gmes_linux_64/gmes_petap.pl --verbose --cores=8 --ES --gc_donor 0.001 --sequence=/data/selaginella/DATA/annotations/braker_wallacei_test/braker/genome.fa  --soft_mask auto 1>/data/selaginella/DATA/annotations/braker_wallacei_test/braker/GeneMark-ES.stdout 2>/data/selaginella/DATA/annotations/braker_wallacei_test/braker/errors/GeneMark-ES.stderr
# Wed Jun 16 22:42:10 2021: change to working directory /data/selaginella/DATA/annotations/braker_wallacei_test/braker
cd /data/selaginella/DATA/annotations/braker_wallacei_test/braker
# Wed Jun 16 22:42:10 2021: Calling prothint.py...
# Wed Jun 16 22:42:10 2021: starting prothint.py
/data/selaginella/anaconda3/envs/braker2_conda_env/ProtHint/bin/prothint.py --threads=8 --geneMarkGtf /data/selaginella/DATA/annotations/braker_wallacei_test/braker/GeneMark-ES/genemark.gtf /data/selaginella/DATA/annotations/braker_wallacei_test/braker/genome.fa /data/selaginella/DATA/annotations/braker_wallacei_test/braker/proteins.fa
# Wed Jun 16 23:19:07 2021: Appending hints from /data/selaginella/DATA/annotations/braker_wallacei_test/braker/prothint_augustus.gff to /data/selaginella/DATA/annotations/braker_wallacei_test/braker/hintsfile.gff
# Wed Jun 16 23:19:07 2021: Generating hints with ProtHint finished.
# Wed Jun 16 23:19:07 2021: Converting bam files to hints
# Wed Jun 16 23:19:07 2021: Checking bam headers
# Wed Jun 16 23:19:07 2021: create header file /data/selaginella/DATA/annotations/braker_wallacei_test/braker/wallacei_header.sam
/data/selaginella/anaconda3/envs/braker2_conda_env/bin/bamtools header -in /data/selaginella/DATA/annotations/braker_wallacei_test/wallacei.sorted.bam > /data/selaginella/DATA/annotations/braker_wallacei_test/braker/wallacei_header.sam
# Wed Jun 16 23:19:09 2021: Deleting SAM header file /data/selaginella/DATA/annotations/braker_wallacei_test/braker/wallacei_header.sam (will not be needed from here on)
# Wed Jun 16 23:19:09 2021: Deleting /data/selaginella/DATA/annotations/braker_wallacei_test/braker/wallacei_new_header.sam
# Wed Jun 16 23:19:09 2021: make hints from BAM file /data/selaginella/DATA/annotations/braker_wallacei_test/wallacei.sorted.bam
/data/selaginella/anaconda3/envs/braker2_conda_env/bin/../config/../bin/bam2hints --intronsonly --in=/data/selaginella/DATA/annotations/braker_wallacei_test/wallacei.sorted.bam --out=/data/selaginella/DATA/annotations/braker_wallacei_test/braker/bam2hints.temp.0.gff 1> /data/selaginella/DATA/annotations/braker_wallacei_test/braker/errors/bam2hints.0.stdout 2>/data/selaginella/DATA/annotations/braker_wallacei_test/braker/errors/bam2hints.0.stderr
# Wed Jun 16 23:19:30 2021: add hints from BAM file /data/selaginella/DATA/annotations/braker_wallacei_test/wallacei.sorted.bam to hints file
cat /data/selaginella/DATA/annotations/braker_wallacei_test/braker/bam2hints.temp.0.gff >>/data/selaginella/DATA/annotations/braker_wallacei_test/braker/hintsfile.temp.gff
# Wed Jun 16 23:19:30 2021: Checking for hints of src=C and with grp tags that should not be joined according to multiplicity
mv /data/selaginella/DATA/annotations/braker_wallacei_test/braker/tmp_no_merge_hints.gff /data/selaginella/DATA/annotations/braker_wallacei_test/braker/hintsfile.temp.gff
# Wed Jun 16 23:19:30 2021: filter introns, find strand and change score to 'mult' entry
perl /data/selaginella/anaconda3/envs/braker2_conda_env/BRAKER-2.1.6/scripts/filterIntronsFindStrand.pl /data/selaginella/DATA/annotations/braker_wallacei_test/braker/genome.fa /data/selaginella/DATA/annotations/braker_wallacei_test/braker/hintsfile.temp.gff --score 1>>/data/selaginella/DATA/annotations/braker_wallacei_test/braker/hintsfile.gff 2>/data/selaginella/DATA/annotations/braker_wallacei_test/braker/errors/filterIntronsFindStrand.stderr
# Wed Jun 16 23:19:31 2021: rm /data/selaginella/DATA/annotations/braker_wallacei_test/braker/hintsfile.temp.gff
# Wed Jun 16 23:19:31 2021: Preparing hints for running GeneMark
# Wed Jun 16 23:19:31 2021: Filtering intron hints for GeneMark from /data/selaginella/DATA/annotations/braker_wallacei_test/braker/hintsfile.gff...
cat /data/selaginella/DATA/annotations/braker_wallacei_test/braker/genemark_hintsfile.gff.prot | sort -n -k 4,4 | sort -s -n -k 5,5 | sort -s -k 3,3 | sort -s -k 1,1 | /data/selaginella/anaconda3/envs/braker2_conda_env/augustus-3.4.0/scripts/join_mult_hints.pl > /data/selaginella/DATA/annotations/braker_wallacei_test/braker/genemark_hintsfile.gff.prot.tmp
cat /data/selaginella/DATA/annotations/braker_wallacei_test/braker/genemark_hintsfile.gff.rnaseq.tmp /data/selaginella/DATA/annotations/braker_wallacei_test/braker/genemark_hintsfile.gff.prot.tmp > /data/selaginella/DATA/annotations/braker_wallacei_test/braker/genemark_hintsfile.gff

Thank you for your help.

annotation • 943 views
ADD COMMENT

Login before adding your answer.

Traffic: 1869 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6