I'm currently working on 454 sequencing of collection of antibodies. However, whenever there is continuous A/C/G/T (AAAAAA), a frameshift may happen by chance. As a result, the nucleotide sequence identity (to germ line) is very high but the protein sequence identify is very low.
Any idea how to correct the frameshift using python script? Or any recommendation of matured tools?
Also, please bear in your mind that the antibody sequence has very high mutation rate.
Thanks a lot!