Set rawData for resharding sampling-based initial split policy aggregation

XMLWordPrintableJSON

    • Type: Bug
    • Resolution: Fixed
    • Priority: Major - P3
    • 8.3.0-rc0
    • Affects Version/s: 8.3.0-rc0
    • Component/s: Sharding
    • None
    • Catalog and Routing
    • Fully Compatible
    • ALL
    • CAR Team 2025-08-18
    • None
    • 3
    • TBD
    • 🟥 DDL
    • None
    • None
    • None
    • None
    • None
    • None

      The initialization phase of resharding samples some documents from the source collection in order to pick the split points for the initial chunks. This is done by sending an aggregation with $sample over the new shard key.

      When the collection being resharded is timeseries, at this point the new shard key is already translated to the bucket fields (i.e. "raw") and the aggregation pipeline refers to those fields. However currently the outgoing aggregation command is not sent with rawData, so the aggregation is being executed over the measurements instead of the buckets as intended.

      Send the aggregation command for that pipeline with rawData.

              Assignee:
              Joan Bruguera Micó
              Reporter:
              Joan Bruguera Micó
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

                Created:
                Updated:
                Resolved: