Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Done
Priority: Major - P3
Fix Version/s: 2.4.6, 2.5.1
Affects Version/s: 2.4.4
Component/s: Replication
Labels:
None
Environment:
Tested on Linux amd64, Ubuntu (10.04 Lucid and 13.04 Raring)

Backwards Compatibility:
Fully Compatible
Operating System:
ALL
Steps To Reproduce:

Hide

Run this test script with -vv: https://gist.github.com/nelhage/5777797

Observe the failovers are taking 10-12s, even though the cluster is healthy, unloaded, and running across the local interface.

Look at the logs for a node that has become primary. Observe that after the election completes, the server logs "replSet waiting for replication to finish before becoming primary", and does not actually step up for up to 10 seconds later.

Show
Run this test script with -vv: https://gist.github.com/nelhage/5777797 Observe the failovers are taking 10-12s, even though the cluster is healthy, unloaded, and running across the local interface. Look at the logs for a node that has become primary. Observe that after the election completes, the server logs "replSet waiting for replication to finish before becoming primary", and does not actually step up for up to 10 seconds later.
Confidence Status:
None
Work Order:
3

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name:
None
Goal Link:
None

With a healthy replset, failovers via rs.stepDown() take 10+s,
even when the election completes in 1-2s.

After an rs.steDown(), a healthy moderately-loaded replset on a
good network will normally complete an election for an new
primary normally within 1-2s. However, before the new primary can
step up, it has to stop the background sync thread.

Suppose (as is usually/often the case) that the new primary was
syncing from the stepped-down node. When that node stepped down,
it closed all connections, including the connection the new
primary was using to read the oplog. This triggered a DBException
inside of _producerThread, triggering this code:

sethbmsg(str::stream() << "db exception in producer: " << e.toString());
sleepsecs(10);

When the new primary goes to stop the rsBackgroundSync thread,
the thread is caught in the middle of that sleep, and so we end
up literally waiting on a sleep(10) before the cluster has a new
primary.

is duplicated by

SERVER-9944 assumePrimary() waits 10 seconds (for nothing?)

Closed

SERVER-9464 On election, relax priority restriction when another member is fresher

Closed

related to

SERVER-10225 Replica set failover speed improvement

Closed

Assignee:: Eric Milkie
Reporter:: Nelson Elhage
Participants:: Aristarkh Zagorodnikov, auto, Eric Milkie, Matt Dannenberg, Nelson Elhage
Votes:: 1 Vote for this issue
Watchers:: 10 Start watching this issue

Created:: Jun 13 2013 10:23:46 PM UTC
Updated:: Jul 11 2016 05:40:39 PM UTC
Resolved:: Jun 29 2013 01:04:22 PM UTC

Details

Description

Attachments

Issue Links

Activity

People

Dates