What open / free data mining engines and frameworks do you know and use for text data?
Thanks for any advice!
I don’t know about machines or frameworks, but I used this tool called Weka , it has many implemented algorithms in it.
Not sure what you are looking for. Perhaps something like Lucene ?
Apache Mahout OpenSource Machile, MapReduce (Apache Hadoop).
Java:
: http://mahout.apache.org/
http://girlincomputerscience.blogspot.com.br/2010/11/apache-mahout.html
http://www.ibm.com/developerworks/java/library/j-mahout/
RapidMiner windows, mac, linux . Weka R.
Weka Rapidminer . , . ELKI, WEKA, .
( ) NLTK. Python. , , , , Python.
RapidMiner - : http://www.RapidMiner.com/
: http://www.kdnuggets.com/2011/05/tools-used-analytics-data-mining.html
KDnuggets 2011: RapidMiner .
Java . , , ..
, , . , , , . , .
: http://www.philippe-fournier-viger.com/spmf/
Apache Mahout offers many popular algorithms that can also be applied to text data and are also quite scalable! Apache UIMA does not offer data mining algorithms, but it is the foundation that is widely used in natural language processing.