-
Type: Bug
-
Resolution: Fixed
-
Priority: Major - P3
-
Affects Version/s: 6.0.2, 6.1.0-rc4, 6.2.0-rc0
-
Component/s: None
-
Fully Compatible
-
ALL
-
v6.2, v6.0
-
Sharding EMEA 2022-11-14, Sharding EMEA 2022-11-28, Sharding EMEA 2022-12-12, Sharding EMEA 2022-12-26, Sharding EMEA 2023-01-23, Sharding EMEA 2023-02-06
-
120
Collmod coordinator may resumes migrations after hitting a retriable error.
This could lead to wrong execution scenario like the following:
- Collmod starts, stop migrations and enter the kUpdateConfig phase
- Hit a retriable error and unblocks migrations
- Attempt to re-execute the kUpdateConfig but this time with the migrations unblocked
Keep in mind that we can't simply resume migrations on non-retriable error, in fact even after hitting a non-retriable error we can't guarantee that the coordinator won't be recovered and re-executed from a new primary node in case of stepdown.
- is caused by
-
SERVER-61760 The new implementation of CollMod should rely on the AllowMigrations to stop/resume migrations
- Closed
- related to
-
SERVER-89035 collMod coordinator can deadlock on tracked unsharded collection
- Closed