Finding the paper published from the SRA run riles
2
0
Entering edit mode
4 months ago

Hey folks, i have used this code to download a my query

esearch -db sra -query '("BACTERIA_NAME"[Organism] OR BACTERIA_NAME[All Fields]) AND "BACTERIA_NAME"[orgn] AND ("strategy wgs"[Properties] AND "library layout paired"[Properties] AND "filetype fastq"[Properties])' | efetch -format runinfo -mode text > first_file.tsv

now i have a bunch of SRA file but i would like to look for their respective published paper. how do i do it?

best ~scholar aldre

SRA • 570 views
ADD COMMENT
1
Entering edit mode

The linking between the SRA record and PubMed is not always there. Onus is probably on the submitters to indicate when an accession is published. PubMed may do some linking back but it does not seem to be fool proof.

ADD REPLY
0
Entering edit mode

I think your command is not really returning anything?

esearch -db sra -query '("BACTERIA_NAME"[Organism] OR BACTERIA_NAME[All Fields]) AND "BACTERIA_NAME"[orgn] AND ("strategy wgs"[Properties] AND "library layout paired"[Properties] AND "filetype fastq"[Properties])' 
<ENTREZ_DIRECT>
  <Db>sra</Db>
  <WebEnv>MCID_65818b6aa1b3ae3e6450690c</WebEnv>
  <QueryKey>1</QueryKey>
  <Count>0</Count>
  <Step>1</Step>
</ENTREZ_DIRECT>
ADD REPLY
0
Entering edit mode
esearch -db sra -query '("Staphylococcus aureus"[Organism] OR Staphylococcus aureus[All Fields]) AND "Staphylococcus aureus"[orgn] AND ("strategy wgs"[Properties] AND "library layout paired"[Properties] AND "filetype fastq"[Properties])' | efetch -format runinfo -mode text > first_file.tsv

I use "BACTERIA_NAME" to make sure it is general enough for people in from different background to understand, but I was wrong, i created more confusion sorry

ADD REPLY
0
Entering edit mode
4 months ago

The pubmed IDs are empty because the submitters never updated it:

enter image description here

ADD COMMENT
0
Entering edit mode
4 months ago
GenoMax 141k

With EntrezDirect this will work when the information is present. Generally with new SRA numbers the data is likely to be unpublished when it is first publicly available.

Here is an older example that works.

$ esearch -db sra -query SRR14673325 | elink -target pubmed | efetch -format docsum | xtract -pattern DocumentSummary -element FullJournalName,ELocationID
Nature communications   doi: 10.1038/s41467-022-35713-4
ADD COMMENT

Login before adding your answer.

Traffic: 1796 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6