I have a number of protein models of varying lengths in PDB format and I'm trying to do machine learning on them and predict their energy. I have the energy values of each of the protein models.
The problem is that machine learning algorithms obviously require a fixed length vector representation. The problem is that all my protein models have different lengths.
Does anyone know of a protein vector representation?