-
Type: Bug
-
Resolution: Duplicate
-
Priority: Major - P3
-
None
-
Affects Version/s: 4.4.4
-
Component/s: Networking
-
None
-
Service Arch
-
ALL
-
(copied to CRM)
When ServerDiscoveryMonitor::requestImmediateCheck is called, in some cases, we attempt to cancel the outstanding hello request and reschedule a new one.
Cancelling the previous request uses a CallbackHandle object that is set after scheduling the request, which happens after a delay.
This means the following sequence of events could occur:
- A call to ServerDiscoveryMonitor::requestImmediateCheck occurs, which calls _scheduleNextHello
- The task it schedules to do the exhaust command hangs before actually scheduling the command and setting the callback handle.
- A new call to requestImmediateCheck arrives. It tries to cancel the outstanding request, but the callback handle hasn't actually been set yet.
- The first request continues, setting the callback handle, which may or may not overwrite the existing one. Either way we end up with two concurrent exhaust commands running for the same host, one of which is no longer tracked or cancellable.
- is related to
-
SERVER-54738 Calls to ServerDiscoveryMonitor::requestImmediateCheck should be throttled
- Backlog