I am generally of field computer science and data analytic. I have learned machine learning and i am solving one biological problem with using Support Vector machine.My question is i am having a data set of amino acid sequences. In our human body there are 20 standard amino acids and each amino acid contain sequences. I have found the composition of this amino acid sequences.This composition is nothing but the word count of each individual amino acid by its name and have counted its percentage that is composition of amino acid. Now i have to build a model for support vector machine using these composition as feature.Can any 1 give some idea how can i build SVM model..??
Thanks in advance
Sequences is like this:
>HMPREF9352_0002 rod shape-determining protein MreD [Streptococcus gallolyticus subsp. gallolyticus TX20005]
From above Amino Acid sequence i found composition of the Amino Acid