Loading...

XML

Word

Printable

JSON

Type: Question
Resolution: Incomplete
Priority: Major - P3
Fix Version/s: None
Affects Version/s: 3.2.8
Component/s: Replication, WiredTiger
Labels:
None

Confidence Status:
None
Work Order:
3
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

After upgrading to 3.2.8 (from 3.0.9) startup appears to hang with lots of disk reads to the local collection.

fatrace shows lots of:

mongod(87260): R /srv/mongodb/local/collection-2--6617514398526579318.wt

The file is a ~230gb file which is likely made up of the oplog.

This all happens after the log lines:

2016-07-25T13:12:15.345+0000 I STORAGE  [initandlisten] Placing a marker at optime Jul 25 11:21:00:37f
2016-07-25T13:12:15.345+0000 I STORAGE  [initandlisten] Placing a marker at optime Jul 25 12:55:11:25b
2016-07-25T13:12:25.296+0000 I NETWORK  [websvr] admin web console waiting for connections on port 28017
2016-07-25T13:12:25.302+0000 I REPL     [initandlisten] Did not find local voted for document at startup;  NoMatchingDocument: Did not find replica set lastVote document in local.replset.election

and on another run with debug:

2016-07-25T15:12:49.643+0000 I REPL     [initandlisten] Did not find local voted for document at startup;  NoMatchingDocument: Did not find replica set lastVote document in local.replset.election
2016-07-25T15:12:49.643+0000 D REPL     [initandlisten] returning minvalid: (term: -1, timestamp: May 24 18:17:53:802)({ ts: Timestamp 1464113873000|2050, t: -1 }) -> (term: -1, timestamp: Jul 25 15:04:29:36)({ ts: Timestamp 1469459069000|54, t: -1 })
2016-07-25T15:12:49.643+0000 D REPL     [initandlisten] Recovering from a failed apply batch, start:{ ts: Timestamp 1464113873000|2050, t: -1 }

With verbose logging on it just shows WT Journal flushing at a regular rate.

Why is this happening and how long might it take for the node to come back online?

Will it try and read the whole collection / op log?

related to

SERVER-25773 Upgrade from 3.0.9 to 3.2.9 discards oplog and can't catch up

Closed

Assignee:: Unassigned
Reporter:: Paul Ridgway
Participants:: Bartosz Debski, Kelsey Schubert, Paul Ridgway, Ramon Fernandez Marina
Votes:: 1 Vote for this issue
Watchers:: 11 Start watching this issue

Created:: Jul 25 2016 03:25:40 PM UTC
Updated:: Aug 26 2016 05:36:15 PM UTC
Resolved:: Aug 26 2016 05:35:44 PM UTC

Details

Description

Attachments

Issue Links

Forms

Activity

People

Dates