How to pull metadata from SRA based on Run id?
0
0
Entering edit mode
18 months ago
Sidra • 0

I have a lot of SRR ids. I got this code line -->

esearch \
  -db sra \
  -query SRR1751234 | \
efetch \
  -format runinfo \
  -mode xml | \
xtract \
  -pattern SraRunInfo \
  -element BioProject BioSample Modelesearch \
  -db sra \
  -query SRR1751234 | \
efetch \
  -format runinfo \
  -mode xml | \
xtract \
  -pattern SraRunInfo \
  -element BioProject BioSample Model

Output like this -->>

PRJNA186035 SAMN02345261 Illumina MiSeq

But I also need collection date,country. Can anyone help me? I need command line which can read my srr ids file and give the summary of that attributes.

SRA metadata • 528 views
ADD COMMENT
0
Entering edit mode

Sidra you have already been warned not to open multiple questions that basically ask the same information. You should also not be deleting any prior posts that have received comments or answers. Deleting old posts and asking the same question again is not appropriate.

As for the information you seek, it may simply not be available for some SRA accessions. There is nothing one can do about it.

You can see that this is all the metadata that is available for the example you posted above. You can pick out fields you can use based on the header below.

Run,Assay Type,attribute_package,AvgSpotLen,Bases,BioProject,BioSample,BioSampleModel,Bytes,Center Name,collected_by,Collection_Date,Consent,DATASTORE filetype,DATASTORE provider,DATASTORE region,Experiment,geo_loc_name_country,geo_loc_name_country_continent,geo_loc_name,IFSAC+_Category,Instrument,Isolation_Source,Lat_Lon,Library Name,LibraryLayout,LibrarySelection,LibrarySource,ontological_term,Organism,PFGE_PrimaryEnzyme_pattern,Platform,ReleaseDate,Sample Name,Serovar,source_type,SRA Study,STRAIN,Subspecies
SRR1751234,WGS,environmental/food/other,459,836057844,PRJNA186035,SAMN02345261,Pathogen.env,562746707,CFSAN,FNE,2012-06-26,public,"wgmlst_sig,realign,sra,fastq","s3,ncbi,gs","ncbi.public,s3.us-east-1,gs.US",SRX837275,Taiwan,Asia,Taiwan,fish,Illumina MiSeq,frozen whole tilapia,missing,Salmonella enterica str. FNE0019 shotgun,PAIRED,RANDOM,GENOMIC,fish (frozen):FOODON_03301083|tilapia:FOODON_03412434|whole:FOODON_03430131,Salmonella enterica subsp. enterica,JKXX01.0989,ILLUMINA,2015-01-09T00:00:00Z,FNE0019,Paratyphi B var. L(+) tartrate + (Java),food,SRP018785,FNE0019,enterica
ADD REPLY
0
Entering edit mode

I wish there were a way for moderators to merge duplicates retaining relevant comments. This user has opened 3 posts for the same questions and all of them have received input of value from other users.

ADD REPLY

Login before adding your answer.

Traffic: 1392 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6