[SERVER-19884] All config server crash Created: 12/Aug/15 Updated: 12/Aug/15 Resolved: 12/Aug/15 |
|
| Status: | Closed |
| Project: | Core Server |
| Component/s: | Admin |
| Affects Version/s: | None |
| Fix Version/s: | None |
| Type: | Improvement | Priority: | Minor - P4 |
| Reporter: | patrick wong | Assignee: | Ramon Fernandez Marina |
| Resolution: | Duplicate | Votes: | 0 |
| Labels: | None | ||
| Remaining Estimate: | Not Specified | ||
| Time Spent: | Not Specified | ||
| Original Estimate: | Not Specified | ||
| Issue Links: |
|
||||||||
| Participants: | |||||||||
| Description |
|
If all 3 config servers crash, the entire cluster will become non-functional even all data or data servers are still functional. And restoration of whole cluster become the only way to recover the cluster. For some terabyte mongodb, the recovery may take a few days and it cannot be affordable for some production system. The data server is easily protected by adding extra member to each replication set. However, we only have 3 config servers and don't have ways to store the delta change of config between each config server backup Request : Once we restore an outdated config server, we need a way / new feature / a tool to synchronize the data between the outdated config server and the existing data servers or recover the metadata loss of config from existing data servers OR We have a way to restore config server to point that the existing data server reaches to
|
| Comments |
| Comment by Ramon Fernandez Marina [ 12/Aug/15 ] |
|
patrickwong@wisers.com, if I understand correctly, you're describing a scenario where all three config servers crash at the same time and are unable to start up again (for example, because of In this scenario having more config servers will not help if they're subject to the same failure mode as the others, e.g., because they're running on the same machine, or on the same rack and the rack loses power. Since the purpose of having three config servers is for redundancy, every sharded deployment should make sure that no more than two config servers may be affected by any single point of failure. That being said, as part of
Regards, |