Question: How can i get a merge fastq file from sratoolkit like the file i directly download from ncbi?
gravatar for Little.G
15 months ago by
Little.G0 wrote:

Excuse me, everybody. I am a new one here and also this field. I need help a lot .T-T

Parts of my story... I downloaded SRR__.sra file from ncbi using sra toolkit. i used fastq-dump to convert SRR__.sra to fastq file and splitted file into 3 files : 1.SRR____1.fastq 2.SRR____2.fastq 3.SRR___.fastq which is the combined version of 1+2

My question are.... 1) If i don't misunderstand, SRR___1.fastq and SRR___2.fastq are from the paired-end sequencing,right?

2)How can i merge the two file together? i want a single file to do the downstream process like mapping etc.

3)How can i get a file as a directly downloading file from ncbi website using sratoolkit? Was the directly downloading file already merged? (In the file, every sequence has optional description after seq name)

Please give me some suggestions. Thank you you guys in advance!

rna-seq • 822 views
ADD COMMENTlink modified 15 months ago by mmfansler320 • written 15 months ago by Little.G0
gravatar for h.mon
15 months ago by
h.mon28k wrote:

Download the fastq files directly from ENA, it is simpler, less error-prone and much faster then fastq-dump.

You don't need to merge the forward and reverse reads, all mapping programs can take the two files separately.

ADD COMMENTlink modified 15 months ago • written 15 months ago by h.mon28k

Thank you so much. It's really faster !!

Finally,If i want to analyze for DEGs, is there a step should i combine the two reads together? I used to practice on a single read (in my class). I compared gene expression in rice planting in two conditions(triplicate each).

ADD REPLYlink written 15 months ago by Little.G0

I will say it again: no, you don't need to merge / combine forward and reverse reads (or R1 and R2), mapping programs (such as STAR, HISAT2 or Subread) take both R1 and R2 files as arguments, and take into account the paired nature of the reads.

ADD REPLYlink written 15 months ago by h.mon28k
gravatar for mmfansler
15 months ago by
MSKCC | New York, NY
mmfansler320 wrote:
  1. Yes.
  2. Don't do that. Instead, use the --split-spot flag to extract an interleaved FASTQ. E.g., fastq-dump --split-spot SRR390728.sra
  3. fastq-dump will automatically download from the Sequence Read Archive if you provide the accession instead of a local file. So, in the end you want:

    fastq-dump --split-spot SRR390728

More details in the manual.

ADD COMMENTlink written 15 months ago by mmfansler320
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1292 users visited in the last hour