Using PRINSEQ lite with zipped paired-end illumina reads
0
0
Entering edit mode
7.2 years ago
Medhat 9.0k

I have a directory contains 42 paired-end read Illumina they are zipped,

for example;

s245-11kb_1_R1.fq.gz     s245-11kb_1_R2.fq.gz 

Is there a way to filter and trim them using prinseq-lite, all in the same time and also zipped?!

from the manual I can see that:

fastq

Input file in FASTQ format that contains the sequence and quality data. Use stdin instead of a file name to read from STDIN (-fasta stdin). This can be useful to process compressed files using Unix pipes

fastq2

For paired-end data only. Input file in FASTQ format that contains the sequence and quality data. The sequence identifiers for two matching paired-end sequences in separate files can be marked by /1 and /2, or _L and _R, or _left and _right, or must have the exact same identifier in both input files. The input sequences must be sorted by their sequence identifiers. Singletons are allowed in the input files.

it did not talk about the zipped version in fastq2 or how to use multiple files 

I am using stand alone version 

 

 

sequencing next-gen quality reads Illumina prinseq • 3.7k views
ADD COMMENT
1
Entering edit mode

Can you use process substitution? ./prinseq <your parameters> <(gzip -d -c s245-11kb_1_R2.fq.gz)? This will unzip file on the fly and write it out to stdin.

ADD REPLY
0
Entering edit mode

That is ok but it is a directory contain 42 file not only the one I write 

ADD REPLY

Login before adding your answer.

Traffic: 2487 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6