Uploaded image for project: 'Core Server'
  1. Core Server
  2. SERVER-48395

Extended stalls during heavy insert workload

    • Type: Icon: Bug Bug
    • Resolution: Fixed
    • Priority: Icon: Major - P3 Major - P3
    • 4.4.0-rc13
    • Affects Version/s: 4.4.0-rc6, 4.4.0-rc7
    • Component/s: Storage
    • Fully Compatible
    • ALL
    • Storage - Ra 2020-06-15, Storage - Ra 2020-06-29, Storage - Ra 2020-07-13, Storage - Ra 2020-07-27
    • 0
    • 5

      While working with the repro for WT-6175 I noticed that there were extended stalls during the insert phase.

      • The stalls seem to end with the start of the next checkpoint
      • With checkpoints disabled the stalls lasted as long as 10 minutes
      • During the stalls the log reports operations that took the entire duration of the stall to complete
      • They appear to have something to do with page splits.

      FTDC, logs, and repro script attached. The repro creates two collections of 5 GB each with a 5 GB cache, using 50 client threads on a machine with 24 cpus.

        1. with-oplog-closeup.png
          with-oplog-closeup.png
          93 kB
        2. with-oplog.png
          with-oplog.png
          110 kB
        3. stalls.png
          stalls.png
          270 kB
        4. stall-in-mem-split.png
          stall-in-mem-split.png
          105 kB
        5. server-resolved.png
          server-resolved.png
          127 kB
        6. repro.sh
          0.9 kB
        7. insert.tgz
          1.34 MB
        8. 4.2-pic.png
          4.2-pic.png
          143 kB

            Assignee:
            sulabh.mahajan@mongodb.com Sulabh Mahajan
            Reporter:
            bruce.lucas@mongodb.com Bruce Lucas (Inactive)
            Votes:
            0 Vote for this issue
            Watchers:
            18 Start watching this issue

              Created:
              Updated:
              Resolved: