Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Duplicate
Priority: Major - P3
Fix Version/s: None
Affects Version/s: 3.2.1
Component/s: Sharding
Labels:
None

Assigned Teams:

Sharding
Backwards Compatibility:
Fully Compatible
Operating System:
ALL
Case:
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

We are running multiple sharded mongo clusters, and recently one of our clusters started having an autosplitting issue.

Our mongos processes have been logging the following messages:

I SHARDING [conn14835] sharded connection to shard1/mongo-blob-1:27017,mongo-blob-2:27017 not being returned to the pool
W SHARDING [conn14835] could not autosplit collection database_name.collection_name :: caused by :: 9996 stale config in runCommand ( ns : database_name.collection_name, received : 2|4||56b053c081c73af0480d60fe, wanted : 2|7||56b053c081c73af0480d60fe, recv )

These messages always appear together and seem related. Only one of our clusters is affected. The warning appears with several databases and collections, but for others autosplitting seems to remain functional.

I have tried restarting each mongod and mongos process in this specific cluster, but nothing changed. I cannot find any issues with the config servers for this cluster either. We have a replicated config server setup (the 3.2 default).

Any advice on how to proceed? I assume this issue is an indication that something is wrong with my config cluster. Are there any diagnostics commands available to check the config cluster health? I would prefer to not have to resync my config cluster, as that would give me downtime on my service. Could simply restarting the config servers be sufficient?

I welcome any advice.

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

repro.js
0.5 kB
Dec 15 2016 05:44:20 AM UTC

duplicates

SERVER-28418 make the split command on mongod return a stale version error if the requested chunk bounds are not found

Closed

is duplicated by

SERVER-23500 could not autosplit collection :: caused by :: 9996 stale config

Closed

related to

SERVER-24148 splitVector should check if given chunk exists

Closed

Assignee:: [DO NOT USE] Backlog - Sharding Team
Reporter:: Goffert van Gool
Participants:: [DO NOT USE] Backlog - Sharding Team, Anthony Pastor, Esha Maharishi, Goffert van Gool, jiang chao, Ramon Fernandez, Randolph Tan
Votes:: 6 Vote for this issue
Watchers:: 21 Start watching this issue

Created:: Feb 04 2016 01:37:51 AM UTC
Updated:: Dec 06 2022 04:34:23 AM UTC
Resolved:: Jul 28 2017 07:35:13 PM UTC

Details

Description

Attachments

Attachments

Issue Links

Activity

People

Dates

PagerDuty