Hello list, I have very large xml blast outputs (in order to use with blast2go) and I need to reduce these consequently according to a selected criterium (for instance the E-value). Did anybody write a script for it ? If not, I would share mine when it will be done !!
EDIT : Thank you for the answers. I'm not used in xml file handling, and I accept your proposal Egon ! So here is a subset of my blast xml output: EDIT 2: I removed the example: too big, and combined with the new "markup formatting" it makes the reading of the question very annoying.
It's big but I had to show you several situations (a multi-hsp hit for example). So I want to be able to filter this file using the hsp statistics.
Here is a detailed example of the output shrinkage I would like to conduct:
How would you do if you wanted to delete from the xml the path (including the markups) within
</Hsp> if the HSP evalue (
<Hsp_evalue>here</Hsp>) is > 1e-20
</Hit> if all the HSP evalue is > 1e-20
-- all the HSP evalue is > 1e-20
-- OR there is the
<Iteration_message>No hits found</Iteration_message> message
Thanks again for help and advice.