6.2 years ago by
Since my last post I've started to re-code many of my bioinformatic assistant scripts, both to follow the changes on the CAZy website but also to make them even faster.
To extract protein sequences from the CAZy database please use my tools here: http://research.ahv.dk/cazy
You can specify any family, sub-family, and organism and I'll also add the opportunity to specify E.C number. It should take less than a minute for instance to extract the big GH13 family of ~13.000 sequences.
These scripts are by far superior to the Park et al server, as these does not rely on a local copy of CAZy but takes the available data directly from here. I do also have a full copy available updated every 6 months, for those who wants to set up their own database to BLAST or the like.
Happy research Alexander