Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Done
Priority: Major - P3
Fix Version/s: None
Affects Version/s: None
Component/s: Replication, Sharding
Labels:
None

Operating System:
ALL
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

I have a 13 server mongodb cluster consisting of 1 query router, 3 config servers with replication and 3 shards, each with replication(primary, secondary and arbiter). Its installed on AWS-EC2 R series instances. Monit is used to restart the mongodb service incase it exceeds 95% memory usage.

My Shard3Primary failed and the Shard3Secondary became primary(as expected). The problem is that the Shard3Primary mongodb process isnt able to restart stating

Initializing full-time diagnostic data capture with directory '/data_storage/data/diagnostic.data'
2018-08-03T03:59:12.144+0000 I REPL [initandlisten] Rollback ID is 210
2018-08-03T03:59:12.145+0000 I REPL [initandlisten] Starting recovery oplog application at the appliedThrough: { ts: Timestamp(1533190391, 15335), t: 454 }
2018-08-03T03:59:12.145+0000 I REPL [initandlisten] Replaying stored operations from { : Timestamp(1533190391, 15335) } (exclusive) to { : Timestamp(1533190418, 1) } (inclusive).
2018-08-03T03:59:12.145+0000 F REPL [initandlisten] Oplog entry at { : Timestamp(1533190391, 15335) } is missing; actual entry found is { : Timestamp(1533190393, 1) }
2018-08-03T03:59:12.145+0000 F - [initandlisten] Fatal Assertion 40292 at src/mongo/db/repl/replication_recovery.cpp 218
2018-08-03T03:59:12.145+0000 F - [initandlisten]

***aborting after fassert() failure

I tried to take the mongodump on the QueryRouter and it failed too.(the same command had succeeded for earlier dumps).

I have attached the screenshots of Shard3Primary for your reference.

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

1.png
357 kB
Aug 03 2018 04:25:15 AM UTC
2.png
363 kB
Aug 03 2018 04:25:15 AM UTC
3.png
347 kB
Aug 03 2018 04:25:15 AM UTC

Assignee:: Nick Brewer (Inactive)
Reporter:: Prasad Surase
Participants:: Nick Brewer, Prasad Surase
Votes:: 0 Vote for this issue
Watchers:: 4 Start watching this issue

Created:: Aug 03 2018 04:26:07 AM UTC
Updated:: Sep 15 2018 02:48:56 PM UTC
Resolved:: Aug 21 2018 09:28:20 PM UTC

Details

Description

Attachments

Attachments

Activity

People

Dates