Uploaded image for project: 'Core Server'
  1. Core Server
  2. SERVER-85859

killop_drop_collection.js does not consider forward progress until succeed property of sharded cluster coordinators

    • Type: Icon: Bug Bug
    • Resolution: Fixed
    • Priority: Icon: Major - P3 Major - P3
    • 8.0.0-rc0
    • Affects Version/s: None
    • Component/s: Sharding
    • Labels:
      None
    • Catalog and Routing
    • Fully Compatible
    • ALL
    • CAR Team 2024-02-05, CAR Team 2024-02-19, CAR Team 2024-03-04
    • 5
    • 1

      killop_drop_collection.js is a core test that checks that killOp might or might not successfully interrupt drop collection (because after some point, interrupting drop collection might leave the server in a inconsistent state).

      Since PM-1965 drop collection has the forward progress until succeed property in sharded clusters, meaning that after the creation of the coordinator, unless there is some unrecoverable error (like for example, the collection cannot be dropped), the DDLCoordinator will continue it's execution even in the presence of interruptions or failures.

      This causes the following scenario:

      1. The test issues a drop collection in a sharding jscore passthrough suite, sending the _shardsvrDropCollection command to the primary shard
      2. The primary shard manages to instantiate a coordinator
      3. The test issues a killOp, successfully interrupting the _shardsvrDropCollection command when it is waiting for the coordinator to finish

      Because of 2, the listCollections command issued by the test will race with the coordinator, making this check non-deterministic.

      One solution could be that in sharded clusters, we should not expect the collection to exist even if the killOp command succeeded.

            Assignee:
            david.dominguez@mongodb.com David Dominguez Sal (Inactive)
            Reporter:
            marcos.grillo@mongodb.com Marcos José Grillo Ramirez
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

              Created:
              Updated:
              Resolved: