Question

NCBI Genome Download Sequence Report Indexing Sequences FNA file Tool Exist?

0

Entering edit mode

2.3 years ago

davidawarshawsky • 0

I'm looking to find an optimal global alignment for a coral genome to other mRNA sequences in other organisms. I pressed the download button to get the genome sequences(FASTA), Sequence Report(JSONL), and Assembly data report (JSONL). When it downloaded and I unzipped it. In data/GCA_025960835.1/sequence_report.jsonl it shows the different sequences and their information. I just want one of them but now I have to go get the file with the accession number or write a couple of functions to pull out the sequence I want. Is there a python solution to do this out the box that doesn't need much coding?

Thanks, David

FNA JSON NCBI • 823 views

ADD COMMENT • link updated 2.3 years ago by Ram 45k • written 2.3 years ago by davidawarshawsky • 0

score 0 · Answer 1 · 2023-06-14

0

Entering edit mode

2.3 years ago

GenoMax 154k

While a python solution can be found this is easily achieved by using faSomeRecords tool from Jim Kent (UCSC) utilities. See more here: How do I extract Fasta Sequences based on a list of IDs?

ADD COMMENT • link 2.3 years ago by GenoMax 154k