I've had a list of books in which each book belongs to a category.
- Flying a Plane - Aviation
- Painting a picture - Art
- 1001 Recipes - Cooking
I have a huge enough sample set of data. I need to categorize my newer books using some algorithm. I know it'll never be a 100% accurate but a good guess is good for me.
What should I use to implement to do something like this? Should I go with Classifier4J and it's Vector Classifier?
Are there other tools that I should look at like Weka? It would be great if someone could point me to some articles/examples to get me started.
Thanks