Hey here is my problem
Given a set of documents, I need to assign a predefined category to each document.
I was going to use the n-gram approach to represent the textual content of each document, and then train the SVM classifier for the training data that I have.
Correct me if I miss something clear.
Now the problem is that the categories must be dynamic. Meaning, my classifier should process new training data with a new category.
For example, if I prepared a classifier to classify this document as category A, category B or category C, and then I was given new training data with category D. I should be able to gradually train my classifier by providing him with new training data for " category D ".
To summarize, I do not want to combine the old training data (with 3 categories) and the new training data (with a new / invisible category) and train my classifier again. I want to train my classifier on the fly
Can this be implemented using SVM? if not, can you recommend me some classification algorithms? or any book / document that can help me.
Thanks at Advance.
Tefa
source share