I did an MSA for a group of proteins and some sequences are really wrong. The average of aa in these sequences is about 76, but some unique sequences have 226 aa, others 31 or 54, that is, outliers. I want to avoid using another tool in the pipeline just to remove those sequences. Is there a simple metric that I can use to cut this sequences that I can use? Probably something that cut sequences that deviates from the average length of the sequences. But I need something more rigorous that this to justify this choice. Someone could help with this?