In some function, knowledge of all existing data is used when it includes some king of aggregation (mean, average, standard deviation) or some order (first, last).
If you want a distributed NOSQL solution that supports AF, you need to use some centralized indexing and metadata to store information about the data in all nodes, thereby having a master node and, possibly, one point of failure.
You should ask what you expect using NoSQL. Do you want circuit diagrams? Distributed data? Improved performance for very simple queries?
Depending on your needs, I see here three main alternatives:
1 - use distributed NoSQL without a single point of failure (i.e. Cassandra ) to store your data and use a map / reduce data processing and output the results for the desired function (almost any basic NoSQL Hadoop support). The caveat is that map / reduce requests are not real-time (they may take minutes or hours to complete the request) and require additional configuration and training.
2 - use a traditional DBMS that supports several servers, such as MySQL Cluster
3 - use NoSQL with a master / slave topology that supports ad-hoc queries and aggregations like Mongo
Regarding the second question: yes, you can rely on M / R to replace AF. You can do almost anything with M / R.
lstern
source share