[SERVER-42734] when start shard and the error:DuplicateKey: E11000 duplicate key error collection: config.cache.chunks Created: 09/Aug/19 Updated: 19/Sep/19 Resolved: 19/Sep/19 |
|
| Status: | Closed |
| Project: | Core Server |
| Component/s: | None |
| Affects Version/s: | None |
| Fix Version/s: | None |
| Type: | Bug | Priority: | Major - P3 |
| Reporter: | Chen Jian | Assignee: | Siyuan Zhou |
| Resolution: | Incomplete | Votes: | 0 |
| Labels: | None | ||
| Remaining Estimate: | Not Specified | ||
| Time Spent: | Not Specified | ||
| Original Estimate: | Not Specified | ||
| Attachments: |
|
| Operating System: | ALL |
| Sprint: | Repl 2019-08-26, Repl 2019-09-09, Repl 2019-09-23 |
| Participants: |
| Description |
|
at first, one of my primary shards running out of disk space because of duplicated key error log. |
| Comments |
| Comment by Kelsey Schubert [ 19/Sep/19 ] | ||||||
|
We haven’t heard back from you for some time, so I’m going to mark this ticket as resolved. If this is still an issue for you, please provide additional information and we will reopen the ticket. Regards, | ||||||
| Comment by Siyuan Zhou [ 13/Aug/19 ] | ||||||
|
Hi chenjian@tmxmall.com, sorry to hear the failure. We need more data to investigate the root cause of this issue. Since you mentioned the issue happened before the restart and crash, could you please post the log before the crash? We also need the content of "config" database on the crashed node and the oplog on the node. We need the data before the restore procedure below. You can dump the data with mongodump and upload it to this ticket after compression.
To recover from the failure, you need to remove all the documents of the collection in question. This is safe because the collection is a cache used by sharding and will be recreated by sharding.
During the whole procedure, the crashed node cannot become primary, so please make sure you run the commands when the replset is stable with primary other than the crashed node. After the procedure, the crashed node should become a normal secondary and can run for elections. | ||||||
| Comment by Chen Jian [ 13/Aug/19 ] | ||||||
|
Hi,Can you tell me how to restore this node first | ||||||
| Comment by Chen Jian [ 09/Aug/19 ] | ||||||
| ||||||
| Comment by Kaloian Manassiev [ 09/Aug/19 ] | ||||||
|
chenjian@tmxmall.com, what is the shard key of the pr_tmxbase.toffs_6.chunks collection? Can you please run this query against the cluster and include the output:
|