I am trying the HiCUP pipeline (Babraham, Cambridge, UK). It works very well for all enzymes with sites that have the usual A, G, C and T bases in the recognition site, but fails for an enzyme like ApoI at the genome digestion stage. The recognition sequence for ApoI is R^AATTY and when I use the standard command:
perl hicup_digester mm9_merge.fa -re1 R^AATTY,ApoI *.fa
I get the following error message in the terminal:
Restriction enzyme: 'R^AATTY' should only contain the characters: 'A','G','C','T' or '^' Please change configuration file and/or command-line parameters and/or installation accordingly
This is independent of which version is used. I am using hicup version v0.5.9 but the same error appears in the latest version. BTW, I am using Ubuntu 14.04 operating system on Dell Optiplex 7050.
A solution to this problem would be highly appreciated since the manual for the HiCUP pipeline gives no information when enzymes like ApoI are used, having 'R' & 'Y' in the recognition sequence. I am unfamiliar with the Perl scripting so modifying the hicup_digester perl script to have it work for genome digestion with ApoI is not likely.
Depends on the definiton of 'smarter'. In my experience, the smartest solution is the one that produces the intended result, while consuming as little efford and time as possible, so that you can focus on your actual analysis, but you are of course free to decide what you do ;-D