I implemented LinearSVC and SVC from the sklearn framework for classifying text. I use TfidfVectorizer to get a sparse representation of the input data, which consists of two different classes (benign data and malicious data). This part works very well, but now I wanted to implement some kind of anomaly detection using the OneClassSVM classifier and training the model with only one class (outliers detection ...). Unfortunately, it does not work with sparse data. Some developers are working on a patch ( https://github.com/scikit-learn/scikit-learn/pull/1586 ), but there are some errors, so there is still no solution for using the OneClassSVM implementation.
Are there any other methods in the sklearn-framework to do something like this? I am looking through examples, but nothing of the kind fits.
Thanks!
python scikit-learn machine-learning svm
Milchbart
source share