Uploaded image for project: 'WiredTiger'
  1. WiredTiger
  2. WT-902

Improve bloom filter sizing

    XMLWordPrintable

    Details

    • Type: Task
    • Status: Closed
    • Priority: Major - P3
    • Resolution: Works as Designed
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: None

      Description

      In LSM trees we don't currently take into account duplicate items when sizing bloom filters that are being created during an LSM merge. There are algorithms available that can help estimate the number of duplicate items - it's worth investigating.

      See:
      http://www.datastax.com/dev/blog/improving-compaction-in-cassandra-with-cardinality-estimation

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                backlog-server-storage Backlog - Storage NY Team
                Reporter:
                alexander.gorrod Alexander Gorrod
              • Votes:
                0 Vote for this issue
                Watchers:
                1 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: