Question: building database in snpEff
0
gravatar for reza
2.2 years ago by
reza210
Iran
reza210 wrote:

i created database for dromedary camel in snpEff as following:

  • i downloded referense file (fasta) and gff3 file from NCBI

  • converting gff3 to gtf using gffread

i followed database building Instructions (http://snpeff.sourceforge.net/SnpEff_manual.html#databases).

i ran "vi snpEff.config" and added following lines:

C.dromedary genome, version DD1.14

DD1.14.genome : C.dromedary

i renamed my fasta file to "DD1.14.fasta" and gtf file to "genes.gtf" then put them in snpEff/data/genomes and snpEff/data/DD1.14, respectively. finally i ran "java -jar snpEff.jar build gtf22 -v DD1.14". for cheking added database ran "java -jar snpEff.jar databases | grep -i DD1.14" command and i get following line:

DD1.14 C.dromedary http://downloads.sourceforge.net/project/snpeff/databases/v4_3/snpEff_v4_3_DD1.14.zip

Now when i want to annotate my vcf file using builded database i get error.

java -Xmx16G -jar snpEff.jar eff -v DD1.14 SNPs.vcf > snpeffout.vcf

00:00:00 Reading config file: /home/dr_hedayati/Downloads/snpEff/snpEff/snpEff.config

00:00:00 done

00:00:00 Reading database for genome version 'DD1.14' from file '/home/dr_hedayati/Downloads/snpEff/snpEff/./data/DD1.14/snpEffectPredictor.bin' (this might take a while)

00:00:00 Database not installed

Attempting to download and install database 'DD1.14'

00:00:00 Reading configuration file 'snpEff.config'. Genome: 'DD1.14'

00:00:00 Reading config file: /home/dr_hedayati/Downloads/snpEff/snpEff/snpEff.config

00:00:00 done

00:00:00 Downloading database for 'DD1.14'

00:00:00 Connecting to http://downloads.sourceforge.net/project/snpeff/databases/v4_3/snpEff_v4_3_DD1.14.zip

00:00:01 ERROR while connecting to http://downloads.sourceforge.net/project/snpeff/databases/v4_3/snpEff_v4_3_DD1.14.zip

java.lang.RuntimeException: java.lang.RuntimeException: File not found on the server. Make sure the database name is correct.

at org.snpeff.util.Download.download(Download.java:178)

.

.

.

00:00:01 Logging

00:00:03 Checking for updates...

anybody know what is my wrong? why database not installed?

snp snpeff software error • 2.2k views
ADD COMMENTlink modified 2.2 years ago by Biostar ♦♦ 20 • written 2.2 years ago by reza210

Hello,

Did you determine what the problem was? I have received the same error when trying to use my own database.

Thank you

ADD REPLYlink written 22 months ago by HannahT0

Have you built your own database? check it using below command.

"java -jar snpEff.jar databases | grep -i your_database_name

ADD REPLYlink written 22 months ago by reza210

It appears that I have not! But now I am at the point that I get the following script to run: java -jar snpEff.jar build -gff3 -v Sorghum

And it appears to work, without major error messages, but I never get an output snpEffectPredictor.bin to work with downstream. The problems I was having earlier were related to directory names and the config file entry, but now I can see that the 'build' command in finding the directory and the genes.gff file.

But I am missing something about why this is not running. Am I not completing the build?

Thank you for replying

ADD REPLYlink written 22 months ago by HannahT0

it is not clear to me what you did exactly. convert your gff file to gtf and then try it again. check your reference and annotation files directories as mentioned in snpeff manual.

ADD REPLYlink written 22 months ago by reza210
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1508 users visited in the last hour