Split fastq.gz file
1
0
Entering edit mode
18 months ago
sandy • 0

Hello everyone,

I download a single fastq.gz file from a study posted on GEO, and this is a paired read single cell sequencing. Somehow this study compressed R1 and R2 file into one fastq.gz file, I used zcat to check the reads and they are really long, like 98 characters. I want to split them to a R1_fastq.gz and R2_fastq.gz files. Please give me some advice on what should I do.

fastq • 1.0k views
ADD COMMENT
0
Entering edit mode

Can you post the accession number so we can examine the record? From where and how did you download the file?

ADD REPLY
0
Entering edit mode

Hi, thanks for your replay. I download this study: GSE138669 from GEO. I used download link generated by SRA explorer using ascp download the fastq.gz.

ADD REPLY
0
Entering edit mode

Please show a head.

ADD REPLY
0
Entering edit mode

sure!

This is:

zcat SRR10254552_GSM4115872_SC18_Homo_sapiens_RNA-Seq.fastq.gz|head -10 
@SRR10254552.1 NS500211:194:HNFHHBGXY:4:21402:13612:10917
CACCGCGAGGGCGGAGCTGCGTTGTCCTCTGCACAGATTTCGGTGGTACTCTGAAGGCGGAGCACAGTTCTCCTCAGGTCAGACCCGGGCGGGCGGGC
ADD REPLY
1
Entering edit mode
18 months ago
GenoMax 141k

Looks like this is a 10x dataset which can be a hit-or-miss proposition to download intact from SRA. Your best bet is to either use the count file provided in the GEO record or use Data Access tab to download the bam format file that has original data https://trace.ncbi.nlm.nih.gov/Traces/?view=run_browser&acc=SRR10254549&display=data-access . Then use bam2fastq utility provided by 10x to reconstruct the original fastq data.

ADD COMMENT
0
Entering edit mode

Thank you!

ADD REPLY
0
Entering edit mode

Sorry to bother you again. To download bam format file, can you give me some suggestions about how to download it? Should I use wget, prefetch, or ascp?

Thank you

ADD REPLY
1
Entering edit mode
ADD REPLY
0
Entering edit mode

Thank you!

ADD REPLY

Login before adding your answer.

Traffic: 1703 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6