Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Done
Priority: Major - P3
Fix Version/s: None
Affects Version/s: 3.2.7
Component/s: WiredTiger
Labels:
- RF

Operating System:
ALL
Steps To Reproduce:

Hide

Simply insert data heavily.

Show
Simply insert data heavily.
Confidence Status:
None
Work Order:
3
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

Hi,
I upgraded my mongodb cluster from 3.0 to 3.2.7 recently and found that the I/O load increased a lot. (abount 100% in primary nodes and 500% in secondary nodes, mainly judged by %util of iostat command).
After reading the doc, I learned that journaling behavior changed a little in 3.2 (flushes the journal every 50ms), so I tried disabling the journaling and the I/O load returned low as it was in 3.0. So I guess flushing the journal frequently is the main reason of primary nodes' I/O load going high 100%.
But I can't find out why secondaries' I/O load increased 500% in the docs. So I used strace to track the mongod thread which is in charge of flushing the journal. (strace about 10 seconds)

Primary node:

% time     seconds  usecs/call     calls    errors syscall
------ ----------- ----------- --------- --------- ----------------
 58.86    3.921470      13476       291       143   futex
 40.54    2.700511      18754       144             pwrite
  0.30    0.020001      10001         2             fdatasync

Secondary node:

% time     seconds  usecs/call     calls    errors syscall
------ ----------- ----------- --------- --------- ----------------
 83.04    4.272435       14682       291             fdatasync
 16.95    0.871993         998       874         9   futex
  0.01    0.000461           2       288             pwrite

From the above we know that pwrite calls in secondary node are nearly twice of primary. And fdatasync calls are as many as pwrite calls and are far more than primary's. Is this the reason why secondaries' I/O load increase 500% ? Is it a bug or a design?

is related to

SERVER-26040 High CPU/IOWAIT in MongoDB 3.2.9

Closed

related to

SERVER-53667 High rate of journal flushes on secondary in 4.4

Closed

Assignee:: Kelsey Schubert
Reporter:: stronglee
Participants:: Kelsey Schubert, stronglee
Votes:: 0 Vote for this issue
Watchers:: 12 Start watching this issue

Created:: Jul 25 2016 08:39:25 AM UTC
Updated:: Jan 08 2021 07:46:39 PM UTC
Resolved:: Aug 19 2016 05:05:03 PM UTC

Details

Description

Attachments

Issue Links

Forms

Activity

People

Dates