Uploaded image for project: 'Core Server'
  1. Core Server
  2. SERVER-20068

Mongodb 3.0.5 with wiredTiger causing Out of memory Issues

    • Type: Icon: Bug Bug
    • Resolution: Done
    • Priority: Icon: Major - P3 Major - P3
    • None
    • Affects Version/s: 3.0.5
    • Component/s: WiredTiger
    • Labels:
      None
    • ALL
    • Hide

      Deploy a mongo version 3.0.5 sharded replica set

      Environment -
      Server: Ubuntu 14.04
      Ram: 60 GB

      Run four shard servers(all primary in their replica set) on this machine with 13 GB Cache Size.

      Perform write intensive jobs and over time memory used by shard servers increase and finally lead to System Kill due to out of memory.

      Show
      Deploy a mongo version 3.0.5 sharded replica set Environment - Server: Ubuntu 14.04 Ram: 60 GB Run four shard servers(all primary in their replica set) on this machine with 13 GB Cache Size. Perform write intensive jobs and over time memory used by shard servers increase and finally lead to System Kill due to out of memory.

      We have upgraded our replicated sharded mongo db setup to the latest 3.0.5 i hope of fixing OOM issues we are having after we migrated our Storage Engine from MMAP to WT but our memory usage issues didn't go away. Memory usage increases over time and only restart releases allocated memory.

      We are running 4 shards on Ubuntu Server(primary instance in our 3 member replica set) having 60 GB System Memory with WT as Storage Engine. We set Cache Size to 13 GB for each Shard Server leaving 8 GB memory for System processes and if Mongo requires more memory for Open Cursors, Open Sessions etc. But it uses way more and System kills the process.

      Two of the four shard servers running on our primary instance failed with OOM error due to system kill.

      Please find attached db.serverStatus(

      {tcmalloc:true}

      ) captured for all the four shard servers running on Primary captured from 1 hour before the failure occurred. Also attached syslog which logged the system kill actions of the two shard servers.

        1. ss-s00.log
          4.49 MB
        2. ss-s01.log
          6.43 MB
        3. ss-s10.log
          6.51 MB
        4. ss-s11.log
          4.39 MB
        5. syslog
          31 kB

            Assignee:
            ramon.fernandez@mongodb.com Ramon Fernandez Marina
            Reporter:
            praveenak Praveen Akinapally
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

              Created:
              Updated:
              Resolved: