Entering edit mode
4.9 years ago
sidrah.maryam
▴
70
Hello everyone, I have single end sequence fastq files that were processed using cellranger. Now I need to align them using cellranger count function. But being single end, the naming convention followed could not be standard. Is there a way to do so? I tried using SRR9320581_S1_L001_R1_001.fastq.gz file convention, but it gave pipestance failed error.
Any help is highly appreciated. Thank you
cellranger countneeds files in a special format. There are three separate files (if you have > 1 sample).R1file contains the cell barcodes and UMI,I1file contains Illumina index for the sample,R2file contains actual reads. It appears that these investigators may have submitted these fastq files in a concatenated format. Try to see if you can separate the reads using--split-filesoption forfastq-dump.okay. I will try that. Thanks a lot.
I have the same issue, did the split-files solved your problem ?
Just my two cents here. In my case
cellranger countwas failing becauseR1was in the wrong format. I also had single-end reads, whereR1was supposed to be the cell barcode and UMI, andR2should be the single-end reads. In my case I had to reruncellranger mkfastqfrom the raw BCLs. Note that theI1file mentioned is optional forcellranger countso don't worry if you don't have it.In my experience the
--split-filesoption forfastq-dumpis only relevant if you're downloading data from SRA, not if you have your raw data in-house.Please read the question - OP has SRR identifiers, hence their data is from SRA. That is why GenoMax mentioned the
--split-files.