Loading...

XML

Word

Printable

JSON

Type: Improvement
Resolution: Duplicate
Priority: Major - P3
Fix Version/s: None
Affects Version/s: None
Component/s: Sharding
Labels:
None

Assigned Teams:

Sharding EMEA
Confidence Status:
None
Work Order:
3

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name:
None
Goal Link:
None

The so called "Top Chunk Optimisation" in Sharding operates by checking whether writes are happening to the highest (or lowest) chunk of a collection and pre-splitting and possibly moving that chunk to a different shard. This is done in order to ensure that the "top chunks" are moved while empty (or as small as possible), rather than when they are full of data.

Given that it only looks at the extreme chunks of the collection, the top chunk optimisation appears to only be useful for the cases where the collection is bulk-loaded, on a single thread, using data sorted on the shard key.

Without this pattern, the top chunk optimisation is largely unused and in certain cases causes issues, because it would block the threads performing inserts to the extreme chunks. The blocking issue was mostly fixed in 4.2 by moving the auto-splitter to the shard servers and making it asynchronous, but in older versions it still persists.

This ticket is to design and introduce a flag to disable the TCO.

depends on

SERVER-44192 Introduce metrics to measure the usefulness of the Top Chunk Optimisation

Closed

duplicates

SERVER-61557 Get rid of top chunk migration

Closed

Assignee:: [DO NOT USE] Backlog - Sharding EMEA
Reporter:: Kaloian Manassiev
Participants:: [DO NOT USE] Backlog - Sharding EMEA, Kaloian Manassiev, Pierlauro Sciarelli
Votes:: 0 Vote for this issue
Watchers:: 8 Start watching this issue

Created:: Oct 24 2019 04:19:34 PM UTC
Updated:: Dec 06 2022 02:43:49 AM UTC
Resolved:: Mar 03 2022 04:47:40 PM UTC

Details

Description

Attachments

Issue Links

Activity

People

Dates