Question: Files ending with .fna.gz, .gbff.gz, .gff.gz, .faa.gz & gpff.gz
1
gravatar for Naresh
3.6 years ago by
Naresh60
Korea, Republic Of
Naresh60 wrote:

Hi, What you mean by

  1. genomic .fna.gz
  2. genomic .gbff.gz
  3. genomic .gff.gz
  4. protein .faa.gz - protein sequences
  5. protein .gpff.gz

I tried to know for others, but i could not get any results.

Please guide me

Thanks Naresh

sequence assembly • 12k views
ADD COMMENTlink modified 3.6 years ago by Dr. Mabuse47k • written 3.6 years ago by Naresh60
5
gravatar for Dr. Mabuse
3.6 years ago by
Dr. Mabuse47k
Bergen, Norway
Dr. Mabuse47k wrote:

All files are text files, compressed using the linux/unix program gzip, use gunzip, to extract, zcat to write the content without saving it to a file.

The following are conventions, which a lot of people, not all, follow:

  • fna = FastA format file containing Nucleotide sequence (DNA)
  • gbff = Genbank Genome file containing genome sequence and annotation
  • gff = general feature format containing genomic regions, the "genes, transcripts, etc"
  • faa = FastA format file containing Amino-acid sequence (Protein, peptide)
  • gpff = Genbank Protein file containing protein sequence and annotation

See https://www.ncbi.nlm.nih.gov/genome/doc/ftpfaq/ for more explanation.

ADD COMMENTlink written 3.6 years ago by Dr. Mabuse47k

Thank you for your wonderful explanation. Thanks alot.

ADD REPLYlink written 3.6 years ago by Naresh60
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1449 users visited in the last hour