Pervasive DataRush is the foundation for parallel data flow programming for any JVM language, including Clojure.
Pervasive DataRush uses a data stream architecture. The architecture implements a program that runs in the form of a graph of computation nodes interconnected by queues of data streams. Nodes use queues to exchange data. As data is streamed, only the data required by any active operation must be in memory at any given time, which allows the analysis of very large data sets. In addition to the ability to scale for tasks larger than the available memory, data flow diagrams use several forms of parallelism.
Customers use DataRush for large data analytics and data preparation (ETL).
joedubin
source share