Question

get gene annotation for nucleotide sequence of the NCBI RefSeq database

0

Entering edit mode

6.8 years ago

dabid • 0

I have fasta files of different genomes of bacteria taken from the NCBI RefSeq database. I want to get the annotation of these genomes as the ones that can be shown in the genbank file format. What I mean by annotation is cds (gene start/end positions, description, and others). Anyway, I want to extract cds (nucleotide sequence) that have title/description of prophages.

gene ncbi annotation cds DNA • 2.2k views

ADD COMMENT • link 6.8 years ago by dabid • 0

0

Entering edit mode

Where did you download the fastas? Could you give an example? In general, you will find the annotation on the same folder you found the fastas.

ADD REPLY • link 6.8 years ago by h.mon 35k

0

Entering edit mode

As I mentioned, I downloaded these fasta files from NCBI (RefSeq database). I just to know that genbank files have such information. But don't know how to extract the genbank files specific to my downloaded fasta files

ADD REPLY • link 6.8 years ago by dabid • 0

score 1 · Accepted Answer · 2017-07-11

1

Entering edit mode

6.8 years ago

dabid • 0

Maybe it is not an exact answer for my question, but as a turn around, what I needed doing is downloading the bacteria genomes that I need from the NCBI RefSeq database as a genbank files. Then, it's easy to process these genbank files using Biopython library to get CDs, etc.

ADD COMMENT • link 6.8 years ago by dabid • 0