Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Fixed
Priority: Major - P3
Fix Version/s: 3.5.12
Affects Version/s: None
Component/s: Replication
Labels:
None

Backwards Compatibility:
Fully Compatible
Operating System:
ALL
Sprint:
Repl 2017-08-21
Linked BF Score:
0
Confidence Status:
None
Work Order:
3

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name:
None
Goal Link:
None

When running in Master-Slave mode, the replication "commit point" never advances, since there is no consensus protocol used in this mode. For replication recovery and rollback purposes, in the ReplicationCoordinator we maintain a list of "stableTimestampCandidates" which is updated with a new timestamp every time we update our lastApplied optime. The current "stable" timestamp is calculated as the largest timestamp in this list less than the commit point. We will remove timestamps from this list when they are less than the current stable timestamp. If we add timestamps to this list, but the commit point never advances, then the stableTimestampCandidates list will grow unbounded. This can cause performance issues as this list grows and we keep trying to insert things to it every time an operation comes in. This was causing a test to timeout when we insert a few hundred thousand documents (see linked BF).

To fix this, we should check if we are running in master slave mode and refrain from updating the stable timestamp list if so.

related to

SERVER-29891 Roll Back to Checkpoint: Call setStableTimestamp() when commit point or last applied changes

Closed

Assignee:: Will Schultz
Reporter:: Will Schultz
Participants:: Eric Milkie, Githook User, Will Schultz
Votes:: 0 Vote for this issue
Watchers:: 3 Start watching this issue

Created:: Aug 10 2017 05:19:23 PM UTC
Updated:: Oct 30 2023 11:14:27 PM UTC
Resolved:: Aug 11 2017 11:13:44 AM UTC
Confidence Status Last Update:: 10/Aug/17 5:20 PM

Details

Description

Attachments

Issue Links

Activity

People

Dates