Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Fixed
Priority: Critical - P2
Fix Version/s: 5.3.2, 6.0.0-rc5, 4.4.15, 5.0.10, 6.1.0-rc0
Affects Version/s: 6.0.0-rc1, 5.3.0, 4.4.0, 5.0.0
Component/s: Sharding
Labels:
None

Backwards Compatibility:
Fully Compatible
Operating System:
ALL
Backport Requested:

v6.0, v5.3, v5.0, v4.4
Sprint:
Execution Team 2022-05-02, Execution Team 2022-05-16
Case:
Linked BF Score:
169
Confidence Status:
None
Work Order:
0

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name:
None
Goal Link:
None

Here are the steps to reproduce the deadlock:

Run a cross-shard transaction with two participant shards, shard0 and shard1 where shard0 is the coordinator shard. Pause the TransactionCoordinator thread right before the commit decision is written (i.e. after the transaction has entered the "prepared" state).
Run a setFCV command against shard0. Wait until the setFCV thread is blocked waiting to acquire the global S lock (i.e. waiting for prepared transactions that existed before the FCV change to commit or abort).
Unpause the TransactionCoordinator thread. The transaction cannot commit since the TransactionCoordinator is blocked waiting to acquire the IX lock for the config.transaction_coordinators collection to write the commit decision.
Both the setFCV thread and TransactionCoordinator thread now hang.

causes

SERVER-75205 Deadlock between stepdown and restoring locks after yielding when all read tickets exhausted

Closed

is related to

SERVER-60682 TransactionCoordinator may block acquiring WiredTiger write ticket to persist its decision, prolonging transactions being in the prepared state

Closed

SERVER-57476 Operation may block on prepare conflict while holding oplog slot, stalling replication indefinitely

Closed

SERVER-66340 Improve distributed transaction commit locking behavior

Closed

SERVER-66341 Improve journal flusher locking behavior

Closed

SERVER-66342 Remove resourceIdFeatureCompatibilityVersion

Closed

related to

SERVER-66719 dbCheck FCV lock upgrade causes deadlock with setFCV

Closed

SERVER-66213 setFCV may need to wait for transactionLifetimeLimitSeconds

Open

(1 is related to, 2 related to)

Assignee:: Gregory Noma
Reporter:: Cheahuychou Mao
Participants:: Cheahuychou Mao, Githook User, Gregory Noma
Votes:: 0 Vote for this issue
Watchers:: 22 Start watching this issue

Created:: Apr 20 2022 02:14:31 PM UTC
Updated:: Oct 29 2023 09:39:13 PM UTC
Resolved:: May 09 2022 07:27:07 PM UTC
Confidence Status Last Update:: 21/Apr/22 3:26 PM

Details

Description

Attachments

Issue Links

Activity

People

Dates