Entering edit mode
4.5 years ago
JohnJACK
•
0
I have been assigned some coursework to write a python script that can filter out the read pairs where either one of the reads have a quality score below 30 from illumina sequencing reads in fastq format.
My task is to write a script that will output two files in the fastq format where one has reads that have a score above 30 and the other has read pairs where either one has a score below 30.
How should I go about approaching this problem? Thanks. (sorry for the bad english)
Have you looked at the biopython cookbook?
It's not completely clear where your problem is. Is it that you don't know how to do it in Python? I'd say (before one of the others does it), give us some of your ideas and someone might guide you from there. So, if it is just that the Python part is the problem, then outline the steps that you want/need to do. If it is not that, then please clarify what you're struggling with.
Isn't the point of assigning the course work to make you think how your should approach this problem?
At a very high level you could the following (one way and perhaps not the most efficient way):