Uploaded image for project: 'WiredTiger'
  1. WiredTiger
  2. WT-902

Improve bloom filter sizing

    • Type: Icon: Task Task
    • Resolution: Done
    • Priority: Icon: Major - P3 Major - P3
    • None
    • Affects Version/s: None
    • Component/s: None

      In LSM trees we don't currently take into account duplicate items when sizing bloom filters that are being created during an LSM merge. There are algorithms available that can help estimate the number of duplicate items - it's worth investigating.

      See:
      http://www.datastax.com/dev/blog/improving-compaction-in-cassandra-with-cardinality-estimation

            Assignee:
            backlog-server-execution [DO NOT USE] Backlog - Storage Execution Team
            Reporter:
            alexander.gorrod@mongodb.com Alexander Gorrod
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

              Created:
              Updated:
              Resolved: