-
Type: Task
-
Resolution: Unresolved
-
Priority: Major - P3
-
None
-
Affects Version/s: None
-
Component/s: None
-
Cluster Scalability
As a precondition for SERVER-68050, it would be good to perform some testing of the sampling behavior to see if/how much more skewed the data distribution would end up between shards if we were to initially create only 1 chunk per shard. Since sampling is not exactly splitting the shard key space in even parts, the theory is that having more chunks better "spreads the risk" because distributing several heterogeneously sized chunks we may end up having a more balanced distribution.
Some experiments with a lot of chunks have shown >10GB of difference between shards with a global collection size of 500GB.
- depends on
-
WT-11532 Fix session reset RNG by using cursor RNG
- Closed
- is depended on by
-
SERVER-68050 Change resharding split policy to create one chunk per shard by default
- Blocked