Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Unresolved
Priority: Major - P3
Fix Version/s: None
Affects Version/s: None
Component/s: Internal Code, Networking
Labels:
- sharding-nyc-subteam2

Assigned Teams:

Cluster Scalability
Operating System:
ALL
Case:
Story Points:
6

It's possible that ServerDiscoveryMonitor::requestImmediateCheck can be called so frequently each subsequent request can cancel the previous request before it has a chance to run, leading to none of them ever succeeding.

This flag is supposed to short circuit rescheduling when there's already an outstanding 'hello' request, but that doesn't get set until after the request is actually scheduled, which can happen at a delay from the time requestImmediateCheck is called, so that doesn't help us in this case.

Note that this applies to both 4.4 and master so we should make sure any fix is backportable.

Acceptance criteria:

Unit test to demonstrate the problem and add throttle to fix the test.

related to

SERVER-54739 Race in ServerDiscoveryMonitor::requestImmediateCheck could lead to multiple outstanding exhaust requests

Closed

Assignee:: Unassigned

Reporter:: Matthew Saltz (Inactive)

Participants:: Matthew Saltz

Votes:: 0 Vote for this issue

Watchers:: 11 Start watching this issue

Created:: Feb 23 2021 10:51:03 PM UTC

Updated:: Nov 08 2024 02:42:38 PM UTC

Details

Description

Attachments

Issue Links

Activity

People

Dates