Files ending with .fna.gz, .gbff.gz, .gff.gz, .faa.gz & gpff.gz
1
1
Entering edit mode
8.4 years ago
Naresh ▴ 60

Hi, What you mean by

  1. genomic .fna.gz
  2. genomic .gbff.gz
  3. genomic .gff.gz
  4. protein .faa.gz - protein sequences
  5. protein .gpff.gz

I tried to know for others, but i could not get any results.

Please guide me

Thanks Naresh

Assembly sequence • 31k views
ADD COMMENT
7
Entering edit mode
8.4 years ago
Michael 55k

All files are text files, compressed using the linux/unix program gzip, use gunzip, to extract, zcat to write the content without saving it to a file.

The following are conventions, which a lot of people, not all, follow:

  • fna = FastA format file containing Nucleotide sequence (DNA)
  • gbff = Genbank Genome file containing genome sequence and annotation
  • gff = general feature format containing genomic regions, the "genes, transcripts, etc"
  • faa = FastA format file containing Amino-acid sequence (Protein, peptide)
  • gpff = Genbank Protein file containing protein sequence and annotation

See https://www.ncbi.nlm.nih.gov/genome/doc/ftpfaq/ for more explanation.

ADD COMMENT
0
Entering edit mode

Thank you for your wonderful explanation. Thanks alot.

ADD REPLY

Login before adding your answer.

Traffic: 1716 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6