I vote for RapidMiner for three reasons, and I used both of them:
- The RapidMiner GUI makes things smoother - it's well designed.
- You can use plugins in RapidMiner, which have a ton of background power, such as R and Weka - this makes the system much more versatile than GATE for working with data and data.
- RapidMiner has a pretty good support network. I definitely recommend looking at the Vancouver Data link above because what Neal does with the text completely blew my mind - so I went and used his methods. They worked like a charm!
- RapidMiner can be deployed as a server, which means you can really combine numbers and data when you need to. There are no restrictions on the desktop.
However, here are a few things about GATE:
- GATE probably has a better semantic understanding of the text, and the built-in dictionaries are quite extensive.
- The GATE system is mature and well developed and continues to evolve.
- GATE can handle Arabic and several other languages ββthat can cause problems with RapidMiner. In fact, for direct work with the Case, GATE darn impressively. It also has many plugins, but installing them is not just plug-and-play, as in RapidMiner.
RapidMiner should release version 5.2 around the end of January 2012 (right now), so if you decide to go this route, you will have the option of a well-supported 5.1 or beta version 5.2.
William MB
source share