Entering edit mode
5.0 years ago
qstefano
▴
20
Hi biostars,
align function by Rsubread package allows to set parameters according to reads layout (single or paired), since i download fastq files from SRA, if the layout is paired, i obtain a single fastq files containing both pairs.
The question is: if i don't specify if fastq file is paired or not, align would detect "automatically" this and consequently align reads correctly? or align the reads as if it were a single layout?
any reply will be appreciated, thanks.
You can easily split the files using
reformat.shfrom BBMap suite.You could have also used
--split-filesoption withfastq-dumpto get pair of files.@genomax just as an aside, would
reformat.shproduce an emptyR2.fqif the input is unpaired reads?Yes. If inputs can't be recognized by
/1 or /2(old Illumina data) or1:N: or 2:N:(current format) at the end of read headers.That's clear, thanks!
That is for interleaved files. Be sure that fastq-dump did not simply concatenate the mates into a single long read which it sometimes does. Confirm that you have one entry per mate otherwise alignment is going to be a mess.