Uploaded image for project: 'Core Server'
  1. Core Server
  2. SERVER-62282

Migration recovery should be retried until success

    XMLWordPrintableJSON

Details

    • Icon: Task Task
    • Resolution: Fixed
    • Icon: Major - P3 Major - P3
    • 5.3.0
    • None
    • Sharding
    • None
    • Fully Compatible
    • Sharding EMEA 2022-01-10, Sharding EMEA 2022-01-24

    Description

      Currently, in case of some errors during a migration (or migration recovery), the donor shard clears it's filtering metadata so that the migration will be recovered the next time a query attempts to use that collection. Some code paths trigger a best-effort recovery, while others don't. Even in the case of the best-effort attempt, it could fail to recover. This is correct, but with the new migration protocol (where the recipient takes the critical section) it may cause long periods of time where the recipient is holding both the critical section (causing collection unavailability) and also holding the ActiveMigrationRegistry (making the recipient shard unable to donate/receive chunks related to any other collection).

      This ticket is to evaluate making sure that the migration recovery is retried until success.

      Attachments

        Activity

          People

            antonio.fuschetto@mongodb.com Antonio Fuschetto
            jordi.serra-torrens@mongodb.com Jordi Serra Torrens
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: