Minimum is one kind of pseudo-distributed mode . Participating moving parts:
Assuming you are working on HDFS (what you should do):
- 1 HDFS NameNode
- 1 or more HDFS Secondary NameNode (s)
- 1 or more HDFS DataNode (s)
For MapReduce (if you want it):
- 1 MapReduce JobTracker
- 1 or more MapReduce TaskTracker (s) (usually the same machines as datanodes)
HBase
- 1 HBase Master (s) ( - ).
- 1 HBase RegionServer (s) ( , datanodes)
- 1 Thrift Servers ( HBase , )
ZooKeeper
- 3 - 5 ZooKeeper node (s)
, . node , , - HDFS NameNode, Hadoop.
HDFS RegionServers 3, .
, ( ):
- 1x HDFS NameNode
- 1x JobTracker/Secondary NameNode
- 3x ZK
- 3x DataNode/RegionServer ( MapReduce, TaskTracker)
- 1x Thrift Server ( HBase - )