Remove shard does not wait for migrations to finish on the drained shard

XMLWordPrintableJSON

    • Type: Bug
    • Resolution: Duplicate
    • Priority: Major - P3
    • None
    • Affects Version/s: None
    • Component/s: Sharding
    • None
    • Sharding EMEA
    • ALL
    • None
    • 3
    • None
    • None
    • None
    • None
    • None
    • None
    • None

      The removeShard command is only checking locally in the config server if the removed shard does not own any more chunks, however, this check can pass right after the latest migration commits the chunk on the config server, but has not finished the cleanup, meaning that important persistency cleanup tasks like starting the donor shard range deletion, removing the recipient shard's range deletion document document and even removing the coordinator document, might never be executed if a user shuts down the shard immediately after receiving a successful result of a removeShard command.

      Remove shard should check with the draining shard if all migrations are finished and successful.

              Assignee:
              [DO NOT USE] Backlog - Sharding EMEA
              Reporter:
              Marcos José Grillo Ramirez
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

                Created:
                Updated:
                Resolved: