Uploaded image for project: 'Core Server'
  1. Core Server
  2. SERVER-60565

Ensure newly created time-series buckets do not contain problematic mixed-schema data

    • Type: Icon: Task Task
    • Resolution: Done
    • Priority: Icon: Major - P3 Major - P3
    • 5.2.0
    • Affects Version/s: None
    • Component/s: None
    • Labels:
      None
    • Execution Team 2021-10-18, Execution Team 2021-11-01
    • 135

      Currently buckets are allowed to contain measurement with arbitrarily mixed schema. This can lead to complicated type-comparison semantics (as seen in SERVER-59505). In order to facilitate indexing, we need to maintain certain invariants about what types of data can co-exist within a single bucket.

      This will require changes to the time-series insert path. Namely, our current MinMax structure for maintaining the control summary data will be retrofitted to determine if an incoming measurement contains an incompatible schema relative to the measurements which have already landed in a given bucket. If so, we will close the bucket and open a new one for the new measurement.

            Assignee:
            dan.larkin-york@mongodb.com Dan Larkin-York
            Reporter:
            dan.larkin-york@mongodb.com Dan Larkin-York
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

              Created:
              Updated:
              Resolved: