-
Type:
Documentation
-
Resolution: Unresolved
-
Priority:
Major - P3
-
None
-
Affects Version/s: None
-
Component/s: Not Applicable
-
None
-
Storage Engines
-
5
-
StorEng - 2025-03-28
Context
Following from WRITING-26135, there has been an ask from the storage engines team to collaborate with the resharding team to figure out a method to improvement MongoDB's overall resharding process. Currently the resharding process is largely dependent on the random cursor. The random cursor is used under the MongoDB $sample stage which is aimed to provide split points. Split points will govern the distribution of data each shard will have.
Problem statement
The aim within WiredTiger is to figure out a proper way to provide an even distribution of data such that the resharding process can evenly distribute the database into different shards. This ticket aims to investigate and create a design proposal document that would describe the possible solutions WiredTiger can offer.
- related to
-
WT-13310 WT random cursor continues to return duplicate records due to poor interaction with MongoDB layer's query yielding
-
- Open
-