Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Done
Priority: Major - P3
Fix Version/s: 3.4.0-rc5, 3.5.1
Affects Version/s: None
Component/s: None
Labels:
None

Backwards Compatibility:
Fully Compatible
Operating System:
ALL
Backport Completed:

3.4.0-rc5
Steps To Reproduce:

Hide

Run the jstests/sharding/cursor_timeout.js test with --repeat. I was able to reproduce in under 200 iterations on my Mac.

Show
Run the jstests/sharding/cursor_timeout.js test with --repeat. I was able to reproduce in under 200 iterations on my Mac.
Sprint:
Query 2016-12-12
Linked BF Score:
0
Confidence Status:
None
Work Order:
3

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name:
None
Goal Link:
None

The sharding/cursor_timeout.js test sets cursorTimeoutMillis to the same time interval as clientCursorMonitorFrequencySecs. This results in immediate timeout (when the clientcursormon thread wakes up from a clientCursorMonitorFrequencySecs sleep), as the elapsed time interval passed to the timeout check is based on a clientcursormon-level timer and will never be less than clientCursorMonitorFrequencySecs.

In light of the above, the sequence of events that causes this test failure is:

A find() is run that returns a subset of the result set and leaves an open cursor.
Just after (on the order of <1ms) the clientcursormon thread wakes up from a 1 second sleep and attempts to kill expired cursors.
The clientcursormon passes to the kill method its timer value as the elapsed time. In my testing this would be ballpark 1004ms.
The open cursor is killed after only being open for a few milliseconds.

A quick fix for the test would be to increase cursorTimeoutMillis to 2000. This will give us what I expect was the desired behavior which is kill after 1 second has passed (and will translate to kill between 1 and 2 seconds).

We may also want to consider failing startup when 0 < cursorTimeoutMillis <= (clientCursorMonitorFrequencySecs * 1000) as the test (or at a minimum perform an audit to make sure there are no other tests that setup cursorTimeoutMillis in this manner).

Assignee:: James Wahlin
Reporter:: James Wahlin
Participants:: Githook User, James Wahlin
Votes:: 0 Vote for this issue
Watchers:: 2 Start watching this issue

Created:: Nov 17 2016 04:51:46 PM UTC
Updated:: Dec 28 2016 04:21:38 PM UTC
Resolved:: Nov 22 2016 02:03:30 PM UTC
Confidence Status Last Update:: 21/Nov/16 4:46 PM

Details

Description

Attachments

Activity

People

Dates