You have the FTP site of the NCBI where all databases are available (Url, if the link does not work : ftp://ftp.ncbi.nlm.nih.gov/blast/db/).
Then, in the README, you can find all descriptions of these databases.
nr.*tar.gz | Non-redundant protein sequences from GenPept, Swissprot, PIR, PDF, PDB, and NCBI RefSeq
Hi, is there a way to download just a file with the taxonomy information. i mean, a tab delimiter with: name_of_protein organism_source(plant, bacteria, other) i need getting the organism source, but if i take a look for nr db directly have a huge header for each protein and dont exist any pattern a priori to getting that.