-
Type: Task
-
Resolution: Fixed
-
Priority: Major - P3
-
Affects Version/s: None
-
Component/s: None
-
None
-
Query Optimization
-
Fully Compatible
As the experiment result shows, kAreaDiff may not be the best partitioning strategy in some cases, especially when the variance in frequencies is low (an extreme case is when values are unique).
Currently, our MaxDiff supports partitioning buckets by selecting top-K area difference in kAreaDiff. Similarly, introduce a variant kFreqDiff that selecting top-K frequency difference.