I want to gather CDs from NCBI or Ensembl. For some species there is NO curated RefSeq assembly, only the link to "ftp directory for GenBank assembly" There I found GBFF files AND genomic.fna files. If I only want CDs like I usually recover in RefSeq links, which one is the correct one?
I think I can convert GBFF files into fasta using other programs. I don't know how to filter CDs from the genomic.fna file (since I think there are more than CDs in this file). Thanks for your help,