Uploaded image for project: 'Core Server'
  1. Core Server
  2. SERVER-53852

MongoDB hangs randomly

    XMLWordPrintable

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major - P3
    • Resolution: Fixed
    • Affects Version/s: 4.4.2
    • Fix Version/s: 4.4.6, 5.0.0-rc0
    • Component/s: None
    • Labels:
      None
    • Backwards Compatibility:
      Fully Compatible
    • Operating System:
      ALL
    • Backport Requested:
      v4.4
    • Steps To Reproduce:
      Hide

      Sorry, but I actually don't know how to reproduce it. Like I said, it randomly hangs.

      Show
      Sorry, but I actually don't know how to reproduce it. Like I said, it randomly hangs.
    • Sprint:
      Security 2021-02-08, Security 2021-02-22
    • Case:

      Description

      I am running MongoDB 4.4.2 cluster with one Primary, one Secondary and one hidden Secondary. On the hidden Secondary, sometimes (like once every 2 days or so) MongoDB just hangs (once it also happened on the Primary). By "hangs", I mean:

      • I am not able to connect to mongod via mongoshell
      • Secondary stops replicating, and starts lagging (until I restart it manually)
      • but running `rs.status()` on the Primary server shows that hung Secondary is reachable

      I referred to https://jira.mongodb.org/browse/SERVER-34190 which looked like a similar issue (but it was fixed in 3.6.4). So I have attached the files that were requested in that issue:

      1. Output of the gdb command: gdb p $(pidof mongod) -batch -ex 'thread apply all bt' > gdb_`date +"%Y%m-%d_%H-%M-%S"`.txt
      2. Last 500 lines of mongod.log
      3. I have provided the latest files in diagnostic.data folder

      Please let me know if you need anything else or you want me to try running some commands.

        Attachments

        1. gdb_2021-01-16_13-02-39.txt
          605 kB
        2. mongod_500l.log
          129 kB
        3. metrics.interim
          31 kB
        4. metrics.2021-01-16T01-00-10Z-00000
          7.17 MB
        5. Screen Shot 2021-01-19 at 2.28.30 PM.png
          Screen Shot 2021-01-19 at 2.28.30 PM.png
          223 kB

          Issue Links

            Activity

              People

              Assignee:
              sergey.galtsev Sergey Galtsev
              Reporter:
              ashish@provakil.com Ashish Madeti
              Participants:
              Votes:
              0 Vote for this issue
              Watchers:
              14 Start watching this issue

                Dates

                Created:
                Updated:
                Resolved: