Hello, I'm using the DIAMOND (Link) program to align my protein sequences against the CAZY database (Link, also protein). The question is: what identity and p-value should I apply to have a good alignment? I am afraid to put high values and discard results that would be satisfactory. As well as, I'm afraid to put low values and get incorrect results.
My set of protein sequences are derived from Shotgun sequencing and have a minimum size of 21 amino acids and maximum of 96 amino acids.
Thank you in advance!