NCBI Genome Download Sequence Report Indexing Sequences FNA file Tool Exist?
1
0
Entering edit mode
2.3 years ago

I'm looking to find an optimal global alignment for a coral genome to other mRNA sequences in other organisms. I pressed the download button to get the genome sequences(FASTA), Sequence Report(JSONL), and Assembly data report (JSONL). When it downloaded and I unzipped it. In data/GCA_025960835.1/sequence_report.jsonl it shows the different sequences and their information. I just want one of them but now I have to go get the file with the accession number or write a couple of functions to pull out the sequence I want. Is there a python solution to do this out the box that doesn't need much coding?

Thanks, David

FNA JSON NCBI • 823 views
ADD COMMENT
0
Entering edit mode
2.3 years ago
GenoMax 154k

While a python solution can be found this is easily achieved by using faSomeRecords tool from Jim Kent (UCSC) utilities. See more here: How do I extract Fasta Sequences based on a list of IDs?

ADD COMMENT

Login before adding your answer.

Traffic: 3445 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6