Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Fixed
Priority: Major - P3
Fix Version/s: 5.0.4, 5.1.0-rc0
Affects Version/s: None
Component/s: None
Labels:
None

Backwards Compatibility:
Fully Compatible
Operating System:
ALL
Backport Requested:

v5.0
Sprint:
Execution Team 2021-06-14
Linked BF Score:
23
Confidence Status:
None
Work Order:
3

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name:
None
Goal Link:
None

it's possible for the JournalFlusher to miss the killOp interrupt by timing the opCtx reset right: the killOp marks the JournalFlusher's opCtx killed, but then the JournalFlusher resets the opCtx and never throws the expected error.

The opId that the test fetches via currentOp is associated with the JournalFlusher's opCtx at that moment, and then the opCtx has changed by the time that the test tries to kill the journal flusher thread via killOp. It's a small window of time.

The test sets the JournalFlusher interval (how frequently it runs) to 500 ms. We could decrease the frequency (higher interval), but then we also need the run the JournalFlusher to run in order to get that error thrown.

I recommend a new FAILPOINT, to stop the JournalFlusher before the currentOp and then release it after the killOp is sent.

related to

SERVER-79810 make JournalFlusher::waitForJournalFlush() interruptible when waiting for write concern

Closed

Assignee:: Dianna Hohensee (Inactive)
Reporter:: Dianna Hohensee (Inactive)
Participants:: Dianna Hohensee, Githook User, Vivian Ge
Votes:: 0 Vote for this issue
Watchers:: 2 Start watching this issue

Created:: May 26 2021 08:46:27 PM UTC
Updated:: Apr 19 2024 04:44:41 PM UTC
Resolved:: Jun 03 2021 04:55:56 PM UTC
Confidence Status Last Update:: 02/Jun/21 4:44 PM

Details

Description

Attachments

Issue Links

Activity

People

Dates