Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Done
Priority: Major - P3
Fix Version/s: None
Affects Version/s: 2.6.4
Component/s: Sharding
Labels:
None

Assigned Teams:

Sharding
Operating System:
ALL
Confidence Status:
None
Work Order:
3
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

Hi
During some lab testing to measuring the failover response time and behaviour, we created diferent scenarios:
1) Graceful failover (replSetStepDown)
2) Process aborted (killed externally in this simulation by task manager)
3) System outage (removing network cable)

We were using a 2 node replica set with an extra arbitrer and a mongos on top of them (servers were started with --shardsvr and --replset). Mongos was running in a linux box, while the mongod were running in windows, all using 2.6.4.

In scenarios 1 and 2, updates were aborted with errors, and started working once the new primary was elected.

In the 3rd scenario the client gets stuck in the update command (using c# driver latest stable version). we could overcome the issue by setting the socketTimeoutMS option in the connection string, but in principle the preferred behaviour is the one shown in scenarios 1 and 2.

Test has been repeted several times to avoid mistakes.

As a side note times were: 3, 16 and 23 seconds, for the three scenarios.

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

failoverTestHostSchema.png
8 kB
Aug 29 2014 11:48:16 AM UTC

Assignee:: [DO NOT USE] Backlog - Sharding Team
Reporter:: Jose Luis Pedrosa
Participants:: [DO NOT USE] Backlog - Sharding Team, Andy Schwerin, Greg Studer, Jose Luis Pedrosa
Votes:: 0 Vote for this issue
Watchers:: 8 Start watching this issue

Created:: Aug 21 2014 10:15:16 PM UTC
Updated:: Dec 06 2022 05:02:28 AM UTC
Resolved:: Jun 13 2019 03:46:14 PM UTC

Details

Description

Attachments

Attachments

Activity

People

Dates