, nosql - , , . , H-base (hdfs), .
, "", , ( ). petascale , , "" . ( ), .
, , , , , node. Pig (http://pig.apache.org/) Hive (http://hive.apache.org/) hdfs, SQL- , mapreduce . , . , .
, , . , , , . , . ( , , ).
Hdfs datasets are mostly processed at a specific point in time, after which the results are published in one batch. For example, on a social networking site, you collect all connection data daily, search for all new connections between people, and when the operation is completed for all people in the data set, the results will be published in a typical 'X is now connected to Y' messages. This is not happening in real time.