Does anyone have experience using Stata and Hadoop? Stata 13 now has a Java Plugin API , so I think it should be simple to get them to play well.
I am particularly interested in being able to analyze weblog data in order to get it in a form suitable for statistical analysis.
This question arose at the beginning on Statalist , but there was no answer, so I thought I would try it here, where the audience is more likely to have experience with this technology.
hadoop hive apache-pig stata
Dimitriy V. Masterov
source share