Convert DNA Sequences into numerical vectors for R / Weka classification using NB / SVM
1
0
Entering edit mode
7.4 years ago

I would like to use machine learning techniques such as Naive Bayes and SVM in Weka to identify species using DNA Sequence data. The Issue is that I have to convert the DNA sequences into numerical vectors.

MY sequences are like this:

------------------------------------------------G ------------------------------------------GGAGATG ------------------------------------------GGAGATG ------------------------------------------GGAGATG TTATTAATTCGAGCAGAATTAGGAAATCCTGGATCTTTAATTGGTGATG ----------------------------------------------ATG CTATTAATTCGAGCTGAGCTAAGCCAGCCCGGGGCTCTGCTCGGAGATG -----------------------TCAACCTGGGGCCCTACTCGGAGACG ----TAATCCGAGCAGAATTAAGCCAACCTGGCGCCCTACTAGGGGATG CTATTAATTCGAGCTGAGCTAAGCCAGCCTGGGGCTCTGCTCGGAGATG TTATTAATTCGTTTTGAGTTAGGCACTGTTGGAGTTTTATTAG---ATA

How can I do this? Any suggestion of other programs for doing ML with DNA sequences besides Weka?

sequence machine learning classification • 1.9k views
ADD COMMENT
0
Entering edit mode
7.4 years ago

I don't know anything about how to use Weka, I just know it exists. However, dr. Google showed this result: https://sourceforge.net/projects/bioweka/

It might be interesting!

ADD COMMENT
0
Entering edit mode

That program does not work. No manual, nor examples and very user complicated.

ADD REPLY

Login before adding your answer.

Traffic: 1628 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6