Entering edit mode
8.7 years ago
shinken123
▴
150
Hi
I am working with an old sanger reads, and I have the fasta and qual files separated. I will like to remove some sequences from the qual file using a list of names of those sequences but I am having problems with this. Do you have a solution for this?
Thank you very much
Thanks for your answer. I want to remove sequences that are in my qual files but are not in my fasta files. I have the list of names of these sequences, and now I want to use it to remove that sequences for the qual files.
An example of my sequences are:
From my fasta file:
from my qual file:
But of course, these are sequences that are in both files. Another problem that I have is that some names of the sequences in the qual file differ a little from the sequences of the fasta. However now I have the list of the names of the sequences that are in the qual file but not in the fasta file and I will like to use them to remove that data form the qual file. Any idea?
How did you end up with sequences in qual files but not fasta files? I suspect the simplest and safest approach here would be to start with the raw files and redo whatever processing step led them to diverge. But barring that, I imagine that even as we speak someone is writing an awk script that will work on qual files alone...