I need to set up a data store that can store the level of PB files (files are mostly small json, images and csv files, but some of them may be ~ 100 MB of binary files).
I am studying a distributed data warehouse that is non-mastering and does not have a single point of failure.
And I found Riak and GlusterFS.
I want to ask if any of you have used both of them before?
I know that the interface (DB / Map) is very different. But it seems to me that they both use hashing and similar distributed technologies. Will they have similar performance, consistency, and availability?
filesystems cluster-computing riak
Eric Fong
source share