Question: understanding the output files of fasterq-dump --split-files
0
gravatar for inbal.tzipermanl
2.2 years ago by
inbal.tzipermanl0 wrote:

I am using fasterq-dump to download from sra, and using split-files to split paired end reads. as a result i receive one or two files. when i have two files they are in the format *_1.fastq, and another file *_2.fastq or *_3.fastq or *_4.fastq I cannot find what is the meaning of these numbers?

the command I am using: fasterq-dump --split-files -O /media/lab/fastq ERR016705

for example: ERR016705 has 2 files: _1, _4 ERR015587 has 2 files: _1, _2

fasterq-dump • 2.3k views
ADD COMMENTlink modified 2.2 years ago by genomax91k • written 2.2 years ago by inbal.tzipermanl0
0
gravatar for genomax
2.2 years ago by
genomax91k
United States
genomax91k wrote:

Have you looked at the headers of the fastq files. Even though the files themselves are named 1 and 4 the headers should tell you that these are R1 and R2 files.
(Note: Illumina sequencing happens in Read 1 --> Index 1 --> Index 2 --> Read 2 order. Sometimes people may dump index sequences into individual files and in that case output files have File 1 --> File 2 --> File 3 --> File 4 names.)

ADD COMMENTlink modified 2.2 years ago • written 2.2 years ago by genomax91k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 820 users visited in the last hour