If you are in Python / interpreted languages, check out the excellent NLTK database at nltk.org. He is well versed in the page and recently published book of O'Reilly.
If you use Java and / or require a more mature, but more difficult to understand framework, instead of GATE .
source share