Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Gone away
Priority: Major - P3
Fix Version/s: None
Affects Version/s: None
Component/s: Sharding
Labels:
- sharding-wfbf-sprint

Assigned Teams:

Sharding EMEA
Operating System:
ALL
Sprint:
Sharding EMEA 2021-07-26, Sharding EMEA 2021-08-09
Confidence Status:
None
Work Order:
3

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name:
None
Goal Link:
None

CollectionShardingState::getCriticalSectionSignal does not take or enforce any locks, and ShardingMigrationCriticalSection::getSignal does not do any synchronization either.

When entering the critical section in migration, we use an X lock, but when exiting (which calls .reset() on the signal), we only use an IX lock.

Furthermore, in setShardVersion, when we call getCriticalSectionSignal we're only holding an IS lock on the collection which does not conflict with the IX lock held in exitCriticalSection, so there's no synchronization on reading/writing the shared_ptr for the critical section signal. The same is true in _flushRoutingTableCacheUpdates.

Fortunately in the normal path for inspecting the critical section we use a shared lock on the CSR which conflicts with the exclusive lock taken on the CSR when we exit the critical section.

Assignee:: [DO NOT USE] Backlog - Sharding EMEA
Reporter:: Matthew Saltz (Inactive)
Participants:: [DO NOT USE] Backlog - Sharding EMEA, Kaloian Manassiev, Matthew Saltz
Votes:: 0 Vote for this issue
Watchers:: 6 Start watching this issue

Created:: Dec 17 2019 04:57:02 PM UTC
Updated:: Oct 27 2023 08:42:28 PM UTC
Resolved:: Nov 03 2021 10:34:38 AM UTC