Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Done
Priority: Major - P3
Fix Version/s: 3.4.4, 3.5.5
Affects Version/s: None
Component/s: Replication
Labels:
- bkp

Backwards Compatibility:
Fully Compatible
Operating System:
ALL
Backport Requested:

v3.4
Sprint:
Repl 2017-03-27
Linked BF Score:
0
Confidence Status:
None
Work Order:
3

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name:
None
Goal Link:
None

Replication coordinator stops the bgsync, which stops the running oplog fetcher, if there's a running oplog fetcher. Oplog fetcher needs the current term and the last committed optime to make new requests. As a result, they create an deadlock.

Replication coordinator, while holding replCoord's mutex, waits on oplog fetcher's mutex to stop it.
Oplog fetcher, while holding its mutex, waits on replCoord's mutex to get the current term and the last committed optime.

To fix this, we need move the current term and last committed optime out of oplog fetcher's mutex.

related to

SERVER-27120 Increase synchronization between producer/applier threads and stepdown/stepup

Closed

Assignee:: Siyuan Zhou
Reporter:: Siyuan Zhou
Participants:: Githook User, Siyuan Zhou
Votes:: 0 Vote for this issue
Watchers:: 6 Start watching this issue

Created:: Mar 03 2017 12:03:37 AM UTC
Updated:: Sep 07 2017 05:07:59 AM UTC
Resolved:: Mar 22 2017 08:03:58 PM UTC

Details

Description

Attachments

Issue Links

Activity

People

Dates