How many FASTQ files does one SRA Run correspond to?
1
0
Entering edit mode
2.1 years ago
hamarillo ▴ 80

Hi, I was wondering if anyone can confirm this:

one Sequence Read Archive Run ID always corresponds to one single-end FASTQ file or one pair of paired-end FASTQ files.

All four SRA Run IDs SRR16918933, SRR16918934, SRR16918935, SRR16918936 have as GEO_Accession (exp) GSM5686879

In other cases, there's only one SRA Run ID for a GEO Accession and it's usually a pair of paired-end FASTQ files or one single-end FASTQ file.

So of course, I'm thinking the reason the Run ID record exists is exactly to mark one sequencing run (lol), but does anyone know if there is any case in which this would result in more than one single-end FASTQ file or one pair of paired-end FASTQ files???

Thanks!

fastq sra • 786 views
ADD COMMENT
0
Entering edit mode
2.1 years ago
GenoMax 154k

GEO accession GSM5686879 is part of a series that contains 14 samples: https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE188617

Each of those samples appears to be submitted as duplicates or triplicates (separate submission) : https://www.ncbi.nlm.nih.gov/Traces/study/?acc=PRJNA779703&o=acc_s%3Aa

GSM5686879 appears to have been submitted as 4 separate submissions (possibly technical replicates of sequencing, please confirm).

Each submission should have a unique SRA accession # that you can see in the table above. This is paired end data and the index appears to have been submitted as a third file (I1) for each sample, which is the third file you are probably referring to.

ADD COMMENT

Login before adding your answer.

Traffic: 2417 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6