How to create own SRAdb
1
1
Entering edit mode
4.8 years ago
Deepak Tanwar ★ 4.2k

Hello,

As SRAdb is outdated, I would like to create my own SRAdb. What is the best way to do that? I am aware of the files I would need to make one: ftp://ftp.ncbi.nlm.nih.gov/sra/reports/

But, I am not an expert in SQL and don't know how to deal with XML files.

SRAdb SRA GEO • 2.2k views
ADD COMMENT
0
Entering edit mode

Contact Sean Davis and try to work with his team. No point in developing a new one? - waste of time and research funds (?)

ADD REPLY
1
Entering edit mode
ADD REPLY
0
Entering edit mode

ADD REPLY
0
Entering edit mode

As far as I understood, Sean Davis seems to be working on a replacement of SRAdb. As I have already set up the tool to work with SRAdb, I would like to have an updated one.

ADD REPLY
0
Entering edit mode

There are functions in R for dealing with both SQL and XML. The ideal situation would be to pull the data in real time from SRA, no? This is how GEOquery (also by Sean Davis) functions.

ADD REPLY
0
Entering edit mode

The ideal situation would be to pull the data in real time from SRA: Yes!

I was looking for the code which Sean Davis and his team wrote to make this database. But, couldn't find it (that would have been an easy solution)...

ADD REPLY
1
Entering edit mode

They have direct access to SRA, you might just ask them to update the file. I've sent Sean Davis a tweet to ask for his input here.

ADD REPLY
0
Entering edit mode

Unfortunately, I do not have direct access to SRA. I mine the data from multiple publicly available locations at NCBI in what is a non-trivial exercise.

ADD REPLY
1
Entering edit mode

If SRAdb is no longer being maintained can a note be added to the GitHub/Bioconductor page that effect? There is currently no indication there.

ADD REPLY
0
Entering edit mode

That's incredibly annoying, I had presumed that since you were at NIH you'd have more direct access.

ADD REPLY
0
Entering edit mode

Bold text and ALL CAPS can come across as rude in online communication. For what it's worth, it definitely helps to learn SQL (and even XML). It doesn't take too long and a learning opportunity seems to have presented itself to you!

ADD REPLY
0
Entering edit mode

Sorry, I updated mine comment!

ADD REPLY
3
Entering edit mode
4.8 years ago

Due to growing size of SRA metadata, we have been transitioning to a new approach for building, updating, and distributing the SRA metadata. As a first step, the data are available as public resource on Google BigQuery.

https://seandavi.github.io/2019/06/omicidx-on-bigquery/

Access BigQuery via any language, basically, via the bq command line, or via the cloud console. BigQuery requires a Google Cloud Platform account, but the free credits will cover thousands of queries.

If you are simply interested in searching SRA programmatically, you can try using NCBI Eutils. There is documentation on NCBI.

ADD COMMENT
1
Entering edit mode

There are some existing projects that are dependent on SRAdb being the in same format, e.g.:

https://github.com/deweylab/MetaSRA-pipeline

Is there a way to regenerate the SRAdb locally so that the code/data for MetaSRA can be updated, or is the bigquerty the only way forward?

Cheers

ADD REPLY

Login before adding your answer.

Traffic: 2583 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6