Loading...

XML

Word

Printable

JSON

Type: Task
Resolution: Fixed
Priority: Major - P3
Fix Version/s: 6.3.0-rc0
Affects Version/s: None
Component/s: None
Labels:
None

Assigned Teams:

Query Optimization
Backwards Compatibility:
Fully Compatible
Confidence Status:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name:
None
Goal Link:
None

We should have integration and/or unit tests that exercise the following scenarios in histogram generation and in estimation of predicates:

minimum and maximum values for each type (most importantly numeric)
inf/NaN/invalid values- if we can insert these into a collection, we have to make sure we handle them correctly during bucket creation/estimation
a wide range of values including extreme types
extreme date/time values
Decimal128 types that are too large to fit in a double
very large arrays
very large strings

We need to ensure both that histogram creation on these types results in a valid histogram, and that cardinality estimation for these values (both when present and when absent from a histogram) works adequately.

is depended on by

SERVER-72819 Estimate the cardinality of extreme values in histograms

Closed

is related to

SERVER-72850 Allow strings with unicode characters to be added to histograms

Backlog

SERVER-72997 [CQF] Allow histograms with number of buckets equal to number of types

Closed

related to

SERVER-72807 [CQF] Allow NaN to be added to histograms

Closed

Assignee:: Ben Shteinfeld

Reporter:: Alya Berciu

Participants:: Alya Berciu, Ben Shteinfeld, Githook User

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Created:: Nov 30 2022 10:34:23 AM UTC

Updated:: Oct 29 2023 09:29:56 PM UTC

Resolved:: Jan 19 2023 02:25:47 PM UTC

Confidence Status Last Update:: 10/Jan/23 9:58 PM

Details

Description

Attachments

Issue Links

Activity

People

Dates