Natural Language Processing - Truecaser Classifier

Please suggest a good machine learning classifier for data validation. In addition, is it possible to specify your own rules / functions for authentication in such a classifier? Thanks for all your suggestions.

thanks

+4
source share
2 answers

I implemented the truecaser version in Python. It can be trained for any language when you provide enough data (i.e. correctly dressed sentences).

For English, it achieves 98.38% accuracy with sample sentences from Wikipedia. A preliminary model for the English language is provided.

You can find it here: https://github.com/nreimers/truecaser

+3
source

Please take a look at this document.

http://www.cs.cmu.edu/~llita/papers/lita.truecasing-acl2003.pdf

They report 98% accuracy.

0
source

All Articles