MongoDB one-way replication

Question

MongoDB one-way replication

We need to somehow move the data from the customer database to the central database. In general, several instances of MongoDB work on remote computers [clients] and require some method of periodically updating the central mongo database with new documents added and changed in clients .it must replicate its records to one central server

For example:

If I have 3 mongo instances running on 3 machines, each of which has 10 GB data, then after data migration the 4th mongoDB computer should have 30 GB of data. And the cenral mongoDB machine should be periodically updated with the data of all three machines. But these 3 machines not only receive new documents, but existing documents can be updated. I would like the mongoDB central machine to also receive these updates.

+8

mongodb

frisky Dec 14 '12 at 12:30

source share

2 answers

Stennie · Answer 1 · 2013-01-03T07:16:44+0000

The desired replication strategy is not formally supported by MongoDB.

A MongoDB replica set consists of one primary with asynchronous replication to one or more secondary servers in the same replica set. You cannot configure a replica set with multiple primers or replication to another replica set.

However, there are several possible approaches to your use case, depending on how actively you want to update the central server and the amount of data / updates that you need to manage.

Some common caveats:

Combining data from multiple stand-alone servers can cause unexpected conflicts. For example, unique indexes will not know about documents created on other servers.
Ideally, the data you consolidate will still be separated by a unique database name on the source server so that you do not have strange crosstalk between disparate documents having the same namespace and _id shared by different origin servers.

Approach # 1: use `mongodump` and `mongorestore`

If you just need to periodically synchronize content with a central server, one way to do this is to use mongodump and mongorestore . You can schedule a periodic mongodump from each individual instance and use mongorestore to import to a central server.

Warning:

There is a --db option for mongorestore , which allows you to restore the original name in another database (if necessary)
mongorestore only performs inserts into an existing database (i.e. does not perform updates or updates). If existing data with the same _id already exists in the target database, mongorestore will not replace it.
You can use mongodump options like --query to be more selective when exporting data (e.g. just select the latest data, not all)
If you want to limit the amount of data for dump and recovery at each start (for example, only exporting “changed” data), you will need to decide how to handle updates and deletes on a central server.

Given reservations, the simplest use of this approach would be a complete reset and recovery (i.e. using mongorestore --drop ) to ensure all changes are copied.

Approach # 2: use the tail cursor with the MongoDB `oplog` .

If you need more real-time or incremental replication, it's possible that creating tail cursors for MongoDB oplog replication.

This approach basically "collapses your own replication." You will need to write an application that processes oplog on each of your MongoDB instances and looks for changes of interest to be stored on your central server. For example, you can only replicate changes to custom namespaces (databases or collections).

A related tool that may be of interest is the experimental Mongo Connector from 10gen labs. This is a Python module that provides an interface for restricting oplog replication.

Warning:

To do this, you need to implement your own code and learn / understand how to work with oplog documents
There may be an alternative product that better supports your desired replication model out of the box.

Andreas Jung · Answer 2 · 2012-12-14T15:46:47+0000

You should know that for replication there is only a set of replicas, a set of replicas always means: one primary, multiple secondary. Recording always goes to the main server. Apparently you need multi-wizard replication that is not supported by MongoDB. Therefore, you want to learn another technology, such as CouchDB or CouchBase. MongoDB is a barrel exploding here.

MongoDB one-way replication

Some common caveats:

Approach # 1: use mongodump and mongorestore

Warning:

Approach # 2: use the tail cursor with the MongoDB oplog .

Warning:

More articles:

Approach # 1: use `mongodump` and `mongorestore`

Approach # 2: use the tail cursor with the MongoDB `oplog` .