I have some coding sequences of the ppc gene family from plant species. The sequences were translated into amino acids and aligned by Clustal. I found a region in the amino acid alignment where only one sequence seems to have a piece of insertion, but I'm not sure if that reflects a real insertion or it is from assembly error (for example chimera). This sequence seems alright otherwise, it has both start and stop codon for the cds.
I have a image for part of the alignment: https://drive.google.com/open?id=1gAgweM_1conDUa3vKAwKE0if2Ow6s8gp
Anybody have idea how to distinguish between an insertion and assembly error? Thanks!
I think I understand your point. That's useful suggestion, thank you!