It's hard to find a lot about Hive, but I found this snippet on the Hive site that relies heavily on HBase (in bold)
Hive is based on Hadoop, which is a batch processing system. Accordingly, this system does not and cannot promise low latency on requests . The paradigm here is rigorous assignment and notification when assignments are completed as opposed to real-time queries. As a result, it cannot be compared with systems such as Oracle, where the analysis is performed on a much smaller amount of data, but the analysis is much more iterative when the response time between iterations is less than a few minutes. The response time for Hive requests for the smallest tasks can be about 5-10 minutes, and for larger tasks this can even work in hours.
Because HBase and HyperTable are all about performance (modeled on Google BigTable), they sound like they will be much faster than Hive at the cost of functionality and a higher learning curve (for example, they don't have joins or SQL-like syntax).
Chris Bunch Aug 30 '08 at 22:16 2008-08-30 22:16
source share