1
0
Entering edit mode
24 days ago

I am trying to download some Placozoan RNA-seq reads to assemble them from SRA(SRR8193747 SRR8193748 SRR8193749) using fastq-dump:

 nohup fastq-dump --split-3 SRR8193748 SRR8193747 SRR8193749 &


According to SRA the reads are paired. Therefore I am expecting to get *_1.fastq and *_2.fastq files (and maybe some unpaired as well) like always. However, in this case I am getting only one file per run: SRR8193747.fastq SRR8193748.fastq SRR8193749.fastq.

The output from fastq-dump says:

Rejected 5037041 READS because of filtering out non-biological READS
Written 5037041 spots for SRR8193748
Written 5157059 spots for SRR8193747
Written 8889366 spots for SRR8193749


Is there something wrong with the SRA files or with my setup?

sra fastq-dump rna-seq • 215 views
0
Entering edit mode
24 days ago

Weird thing indeed. When checking the metadata and such for one of them it seems the length of read2 is 0 ??? (which could explain the behavior you're seeing but not why it's like that ) https://trace.ncbi.nlm.nih.gov/Traces/sra/?run=SRR8193748

One thing I could think of is that they actually had single end data but submitted it as paired end ?

0
Entering edit mode

Yes, that may be it. I found the publication and it says somewhere in the methods: ... for the haplotype H4 RNA libraries 32 – 37 million single 150 bp reads were obtained. I guess that means single end.

0
Entering edit mode

does so in my book ;)