The Voldemort project looks beautiful, but I have not looked deeper into it yet.
In this current state, CouchDB may not be correct for "massive amounts of data." The distribution of data between nodes and routing requests is accordingly included in the roadmap, but has not yet been implemented. CouchDB's most famous manufacturing facilities use โtablesโ (โdatabasesโ on the couch) of about 200G.
HA is not supported on the basis of CouchDB, but can be easily created: all CouchDB nodes replicate database nodes among themselves in a multi-wizard setup. We put two Varnish proxies in front of the CouchDB machines, and Larn boxes are redundant with CARP . The design of CouchDBs "build from the Web" makes such things very easy.
The most pressing issue in our setup is the fact that there are still problems replicating large (multi-byte) attachments to CouchDB documents.
I suggest you also check out the traditional RDBMS route. There are huge problems with available talent beyond the RDBMS approach, and Oracle and Co have very affordable offers.
max
source share