Loading...

XML

Word

Printable

JSON

Type: Task
Resolution: Done
Fix Version/s: 2.8.0-rc0
Affects Version/s: Legacy C++ Implementation
Component/s: mongorestore
Labels:
- commands

Sprint:
MCI 2.7.8
Documentation Changes:
Completed

There are a couple different ways to make this parallelized:

On the server, use the multi-index builder to build them in parallel. As of 2.6, the createIndexes command can take >1 index at a time. As of 2.8, it also creates them in parallel, so using a single createIndexes for all indexes for a collection should make index builds much more efficient.
On the client:
1. Open one connection per collection (up to a user-specified or adaptively arrived at limit)
2. Use >1 thread per collection. Read ahead from the source file and make multiple connections to the server

Considerations:

Throttling the rate from the client to server so as not to overload either client or server.
When using >1 thread per collection, data will not be restored in the same order it was dumped.

Original Description
I am running mongorestore to recreate a copy of large(ish) production database on a separate system (~300GB). It seems from observation that the process of importing the data and re-creating the indexes is happening in serial. Given that indexes can be created in the background during normal operating conditions, that at least this bit could be done in parallel. Ideally it would be fantastic to see the collections themselves be restored in parallel since the machine(s) I'm working with have plenty of extra resources to spare for this process. Is this doable? Or perhaps there are complexities that prevent this which I am not aware of?
Thanks, as always.

is duplicated by

SERVER-12246 mongorestore has poor performance

Closed

is related to

TOOLS-18 make mongodump multi-threaded / parallel

Closed

TOOLS-68 Enable Mongoimport to be mult-threaded and take advantage of all CPU cores

Closed

TOOLS-596 expose numInsertionWorkers option to the end user

Closed

1.	Use batch inserts in mongorestore	TOOLS-282	Closed	Kyle Erf (Inactive)	2.8.0-rc0
2.	Parallelize restoring dumps per collection	TOOLS-283	Closed	Kyle Erf (Inactive)	2.8.0-rc0
3.	Use new createIndexes command in mongorestore	TOOLS-286	Closed	Kyle Erf (Inactive)	2.8.0-rc0

Assignee:: Unassigned
Reporter:: Scott D'Aquila
Votes:: 8 Vote for this issue
Watchers:: 14 Start watching this issue

Created:: Nov 01 2013 10:16:11 PM UTC
Updated:: Jul 02 2026 07:07:09 PM UTC
Resolved:: Oct 21 2014 09:09:21 PM UTC

Details

Description

Attachments

Issue Links

Sub-Tasks

Activity

People

Dates