Uploaded image for project: 'Core Server'
  1. Core Server
  2. SERVER-56854

Provide the ability for RSM requests to timeout and mark the server as failed

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Critical - P2
    • Resolution: Fixed
    • None
    • 4.0.25, 4.2 Required
    • Networking
    • None
    • Fully Compatible
    • v4.2, v4.0
    • Sharding 2021-05-17

    Description

      Currently, a mongos node can send a hello request to replica set members and not hear a response indefinitely. In this case, the operation will not return until the connection on the mongos side has a timeout, which could be several minutes based on TCP keepalive settings.

      This ticket is to create an application timeout mechanism that allows the RSM to make progress monitoring other nodes in the presence of TCP blackholes or similar network failures. The timeout should be on the order of seconds to ensure cluster availability.

      Attachments

        Issue Links

          Activity

            People

              lamont.nelson@mongodb.com Lamont Nelson
              lamont.nelson@mongodb.com Lamont Nelson
              Votes:
              0 Vote for this issue
              Watchers:
              14 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: