Uploaded image for project: 'Core Server'
  1. Core Server
  2. SERVER-49145

Prevent distributed lock timeouts in suites with background migrations

    XMLWordPrintable

    Details

    • Type: Bug
    • Status: Open
    • Priority: Major - P3
    • Resolution: Unresolved
    • Affects Version/s: None
    • Fix Version/s: Backlog
    • Component/s: Sharding
    • Labels:
      None
    • Operating System:
      ALL
    • Linked BF Score:
      9

      Description

      A migration holds a distributed lock on the migrating namespace for its duration, so in suites with background migrations, like multi_stmt_txn_jscore_passthrough_with_migration, sharding metadata commands that take the distributed lock can exhaust the 20 second acquisition timeout if a migration takes longer than 20 seconds or if it repeatedly fails to acquire the lock due to a lack of a fairness policy. Even if the commands use the in-memory NamespaceSerializer lock, they can still time out taking the distributed lock because migrations don't use the NamespaceSerializer.

      Changing all config server commands that take distributed locks to use the NamespaceSerializer should avoid timeouts, or the distributed lock acquisition timeout can be raised in these suites.

        Attachments

          Issue Links

            Activity

              People

              Assignee:
              backlog-server-sharding Backlog - Sharding Team
              Reporter:
              jack.mulrow Jack Mulrow
              Participants:
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

                Dates

                Created:
                Updated: