How do I download entire human genome for local blast formatting and searching?
1
0
Entering edit mode
7.0 years ago
b10hazard • 0

I'm trying to make a copy of the entire human genome for local blast queries on my machine. I understand that I need to download it from the NCBI FTP server here...

ftp://ftp.ncbi.nih.gov/genomes/Homo_sapiens

After downloading it I need to format it into a database that blast can search. I do this using NCBI's makeformatdb command. It is run like this...

makeblastdb -in entire_human_genome.fasta -dbtype nucl -title human_genome -out human_genome.db

My question is... Where do I get the fasta file containing the entire human genome? Do I download the fasta files for all 22 chromosomes, the X chromosome, and then the Y chromosome and concatenate them? Is there a fasta file on the site containing the entire genome? A fasta file that is not partitioned by chromosome?

blast ncbi • 3.4k views
ADD COMMENT
2
Entering edit mode

Do I download the fasta files for all 22 chromosomes, the X chromosome, and then the Y chromosome and concatenate them?

yes

see also Where Can I Download Human Reference Genome In Fasta Format? Hgref.Fa File

ADD REPLY
1
Entering edit mode
7.0 years ago
GenoMax 141k

You can get the individual chromosome files from the FTP link you posted above (*ref*fa* files) and then cat the files together or get the entire genome in a single file from UCSC here.

ADD COMMENT

Login before adding your answer.

Traffic: 3349 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6