Question: SRA/ GEO format help
0
gravatar for Mamta
3.8 years ago by
Mamta410
United States
Mamta410 wrote:

HI all,

 

I just downloaded some SRA datasets. I got the Text file which has sample number, GEO accession (starting with GSM) and the fastq files are labelled SRR----. I donot know how I know which fastq file belong to which sample. Moreover when i look up the Geo accession-  it shows the SRR under run. And there are two run IDs for one GSM. Does that mean I have to combine them? If yes- whats the best way to do this?

 

Thanks so much!!

Mamta

rnaseq sra geo • 2.2k views
ADD COMMENTlink written 3.8 years ago by Mamta410

SRA Hierarchy: SRP - project/study, SRS - sample (one or more experiments (SRX)), SRR- runs, experiments has one or more runs. I usually use the NCBI SRA toolkit to download, and convert SRA files to FASTQ and FASTA format. Do you just want to know which sample (SRS) the run (SRR) belongs to? 

ADD REPLYlink modified 3.8 years ago • written 3.8 years ago by camachofrancine90

HI,

I did use the SRA toolkit. But the problem is the matrix or the text files does not have the SRR id which the fastq files have. So do i have to annotate it manually by visiting each sample on the SRA? Like how to know which fast files belongs with which sample ID.

Thanks,

Mamta

 

ADD REPLYlink written 3.8 years ago by Mamta410

Can you tell me how you are converting your .sra files to fastq/fasta format? When I use SRA toolkit, it keeps the SRR number the same. So, if SRR1750023.sra is converted to a fasta file the name will be SRR1750023.fa.

If you want to know the sample that a given SRR is from, you can use the command line to access metadata. In other words, I can get the SRRID and it's metadata including which sample it belongs to. To do this you need the SRP (project) number.

Let's say I want the metadata for SRP  = SRP001599. To do this you can run:

wget -O ./SRP001599_info.csv 'http://trace.ncbi.nlm.nih.gov/Traces/sra/sra.cgi?save=efetch&db=sra&rettype=runinfo&term= SRP001599'

This will give you a csv of the metadata associated with the Project. 

 

ADD REPLYlink modified 3.8 years ago • written 3.8 years ago by camachofrancine90
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1583 users visited in the last hour