Uploaded image for project: 'WiredTiger'
  1. WiredTiger
  2. WT-7405

Implement better random cursor operations for tiered cursors

    • Type: Icon: Improvement Improvement
    • Resolution: Unresolved
    • Priority: Icon: Major - P3 Major - P3
    • None
    • Affects Version/s: None
    • Component/s: None
    • None

      The implementation of random cursors on tiered tables – i.e., curtiered_next_random() – can be pretty skewed. We should come up with a better way to select random items.

      In the current implementation we select a tier at random and then select a random item from that tier.  This will produce skewed results in two likely usage scenarios:

      • Tiers have substantially different numbers if items
      • There are many items replicated across tiers

      In LSM we choose a random chunk, weighting by the number of items in each chunk. But with tiered cursors we don't know how many items are in each of the tiers.  Perhaps we could use the number of leaf pages in each tree/tier as a proxy.

            Assignee:
            backlog-server-storage-engines [DO NOT USE] Backlog - Storage Engines Team
            Reporter:
            keith.smith@mongodb.com Keith Smith
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

              Created:
              Updated: