Which is the file containing genome assembly?
4 months ago
Shivani • 0

I am new to this so I'm unable understand which of the file at this url will have the genome assembly. I don't have any background on genome assembly. But want to learn. Please help.

The URL:

https://ftp.ncbi.nlm.nih.gov/genomes/genbank/vertebrate_mammalian/Acinonyx_jubatus/all_assembly_versions/GCA_001443585.1_aciJub1/

4 months ago
cfos4698 ▴ 490

All of the provided files are explained in the README.txt file (https://ftp.ncbi.nlm.nih.gov/genomes/genbank/vertebrate_mammalian/Acinonyx_jubatus/all_assembly_versions/GCA_001443585.1_aciJub1/README.txt).

The genome assembly is provided in the '*_genomic.fna.gz' file:

   *_genomic.fna.gz file
FASTA format of the genomic sequence(s) in the assembly. Repetitive
sequences in eukaryotes are masked to lower-case (see below).
The FASTA title is formatted as sequence accession.version plus
description. The genomic.fna.gz file includes all top-level sequences in
the assembly (chromosomes, plasmids, organelles, unlocalized scaffolds,
unplaced scaffolds, and any alternate loci or patch scaffolds). Scaffolds
that are part of the chromosomes are not included because they are
redundant with the chromosome sequences; sequences for these placed
scaffolds are provided under the assembly_structure directory.