Uploaded image for project: 'Core Server'
  1. Core Server
  2. SERVER-91194

timeseriesBucketsMayHaveMixedSchemaData not properly cloned upon data migration/initial sync/restore

    • Catalog and Routing
    • ALL
    • v8.0, v7.3, v7.0, v6.0
    • CAR Team 2024-06-10, CAR Team 2024-06-24, CAR Team 2024-07-08

      The timeseriesBucketsMayHaveMixedSchemaData collection option is:

      1. Only set on time-series
      2. Serialized as top-level fields (not part of the options sub-object)
      3. Defaulted to false at bucket collection creation time

      Defaulting timeseriesBucketsMayHaveMixedSchemaData to false (bullet 3) is problematic because collections are not only created from scratch but may also be created due to data cloning in the following cases:

      • Initial synchronization - when cloning a collection as part of adding a new node to a replica set
      • Chunk migration - when cloning a collection as part of migrating sharded collection's data
      • MovePrimary - when cloning a collection as part of changing its db primary in a sharded cluster
      • Resharding (and moveCollection) - when cloning a collection as part of redistributing all its data
      • Mongodump/ mongorestore: when restoring a cluster from a backup

      This means that in all clusters starting from v5.2 (SERVER-60574), the value of timeseriesBucketsMayHaveMixedSchemaData may be incorrect.

      The short term solution is to always pretend the option is set to true (not changing the actual value on the catalog but always behave as if it was).

      SERVER-91195 will take care of designing a long-term solution for avoiding hitting the issue in the future.

            Assignee:
            pierlauro.sciarelli@mongodb.com Pierlauro Sciarelli
            Reporter:
            pierlauro.sciarelli@mongodb.com Pierlauro Sciarelli
            Votes:
            0 Vote for this issue
            Watchers:
            14 Start watching this issue

              Created:
              Updated: