Uploaded image for project: 'Core Server'
  1. Core Server
  2. SERVER-93407

Implement partitioning by top-K frequency difference in MaxDiff algorithm

    • Type: Icon: Task Task
    • Resolution: Fixed
    • Priority: Icon: Major - P3 Major - P3
    • 8.1.0-rc0
    • Affects Version/s: None
    • Component/s: None
    • None
    • Query Optimization
    • Fully Compatible

      As the experiment result shows, kAreaDiff may not be the best partitioning strategy in some cases, especially when the variance in frequencies is low (an extreme case is when values are unique).

      Currently, our MaxDiff supports partitioning buckets by selecting top-K area difference in kAreaDiff. Similarly, introduce a variant kFreqDiff that selecting top-K frequency difference.

            Assignee:
            matt.olma@mongodb.com Matt Olma
            Reporter:
            chii.huang@mongodb.com Chi-I Huang
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

              Created:
              Updated:
              Resolved: