I am trying to work on a pipeline to generate optimize Multiple Sequence Alignments (MSAs) based on NCBI pulls. While I am doing this I am comparing how the length of alignments, consensus sequence, and pairwise % change and improve with each step.
It has lead to a question that I can't find a good reference to in the literature. What would you consider a good pairwise % for a MSA? I was thinking over 50%?
Open to thoughts and what you think are other benchmarks for a good and strong MSA