Loading...

XML

Word

Printable

JSON

Type: Task
Resolution: Fixed
Priority: Major - P3
Fix Version/s: 5.0.0-rc0
Affects Version/s: None
Component/s: Sharding
Labels:
- PM-234-M3
- PM-234-T-error-flow

Backwards Compatibility:
Fully Compatible
Sprint:
Sharding 2021-04-05
Story Points:
2
Confidence Status:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name:
None
Goal Link:
None

Task executors are allowed to refuse work and the .onCompletion() continuation won't run if the task executor has been shut down. This is especially problematic for the ReshardingCollectionCloner after the changes from ~~SERVER-54959~~ because the noCursorTimeout cursor will be permanently leaked on stepdown. We should instead be using the RecipientStateMachine::getInstanceCleanupExecutor() to run the .onCompletion() continuation.

ReshardingCollectionCloner::run() and ReshardingTxnCloner::run() should be changed to additionally accept the cleanup task executor and should return a SemiFuture<void> so the caller must explicitly do .thenRunOn(**executor) to chain any further continuations.

.on(executor, cancelToken)
.thenRunOn(cleanupExecutor)
.onCompletion([chainCtx](Status status) {
    if (chainCtx->pipeline) {
        // Use a separate Client to make a better effort of calling dispose() even when the
        // CancelationToken has been canceled.
        auto serviceContext = cc().getServiceContext();
        auto clientStrand = ClientStrand::make(
            serviceContext->makeClient("ReshardingCollectionClonerCleanup"));
        auto clientGuard = clientStrand->bind();

        auto opCtx = clientGuard->makeOperationContext();
        chainCtx->pipeline->dispose(opCtx.get());
        chainCtx->pipeline.reset();
    }

    return status;
})
.semi();

Assignee:: Janna Golden

Reporter:: Max Hirschhorn

Participants:: Githook User, Janna Golden, Max Hirschhorn

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Created:: Mar 19 2021 03:02:06 AM UTC

Updated:: Oct 29 2023 09:56:05 PM UTC

Resolved:: Mar 30 2021 07:17:57 PM UTC

Confidence Status Last Update:: 25/Mar/21 2:10 PM

GA Target Date:: None

Public Preview Target Date:: None

Private Preview Target Date:: None

Experiment Target Date:: None

Details

Description

Attachments

Activity

People

Dates