Loading...

XML

Word

Printable

JSON

Type: Task
Resolution: Unresolved
Priority: Major - P3
Fix Version/s: None
Affects Version/s: None
Component/s: None
Labels:
None

Assigned Teams:

Query Optimization

Case1: The following query over ce_data_1000 collection from the CE accuracy tests shows very imprecise estimate

Id: 6066: [ { "$match" : { "mixed_arr_str_70_30" : { "$gt" : "LeG7", "$lt" : "LgG7" } } } ], qtype: medium range, data type: array                                      
cardinality: 126, Histogram estimation: 394.17, errors: {  "absError" : 268.17,  "relError" : 2.13,  "selError" : 26.82 }

The data has only 33 values and is completely represented in the histogram buckets.

If we apply the formula

Card(ArrayMin(a < valHigh)) - Card(ArrayMax(a < valLow)) we get 291 - 165 = 136, which is a much more precise estimate. Investigate why we get the value of 394.17.

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

case1
3 kB
Feb 10 2023 03:42:34 PM UTC

Assignee:: [DO NOT USE] Backlog - Query Optimization

Reporter:: Milena Ivanova

Participants:: [DO NOT USE] Backlog - Query Optimization, Milena Ivanova

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Created:: Feb 10 2023 03:36:47 PM UTC

Updated:: Jun 29 2023 02:16:33 PM UTC

Details

Description

Attachments

Attachments

Activity

People

Dates