Loading...

XML

Word

Printable

JSON

Type: Improvement
Resolution: Unresolved
Priority: Major - P3
Fix Version/s: None
Component/s: Sessions
Labels:

Driver Changes:
Needed
Quarter:
- FY25Q3
Downstream Changes Summary:
Hide

Summary of necessary driver changes

Drivers MUST stop gossiping $clusterTime on SDAM commands. This means we no longer send $clusterTime with heartbeat commands and we no longer parse $clusterTime from heartbeat responses.

Implement the Sessions spec prose test titled "20. Drivers do not gossip $clusterTime on SDAM commands." added here: https://github.com/mongodb/specifications/tree/master/source/sessions/tests#20-drivers-do-not-gossip-clustertime-on-sdam-commands

Commits for syncing spec/prose tests

Spec commit:
Message: DRIVERS-2798 Drivers do not gossip $clusterTime on SDAM commands (#1735)
Branch: master
https://github.com/mongodb/specifications/commit/b76cbf96807d96aa8ae4337a3f2b3c9be39d3487

Python implementation:
Message: ~~PYTHON-4579~~ Stop gossiping $clusterTime on SDAM connections (#1925)
Branch: master
https://github.com/mongodb/mongo-python-driver/commit/85ca6f1d9fa71badeeee2b80db7ec89dc4bef0f4
Show
Summary of necessary driver changes Drivers MUST stop gossiping $clusterTime on SDAM commands. This means we no longer send $clusterTime with heartbeat commands and we no longer parse $clusterTime from heartbeat responses. Implement the Sessions spec prose test titled "20. Drivers do not gossip $clusterTime on SDAM commands." added here: https://github.com/mongodb/specifications/tree/master/source/sessions/tests#20-drivers-do-not-gossip-clustertime-on-sdam-commands Commits for syncing spec/prose tests Spec commit: Message: DRIVERS-2798 Drivers do not gossip $clusterTime on SDAM commands (#1735) Branch: master https://github.com/mongodb/specifications/commit/b76cbf96807d96aa8ae4337a3f2b3c9be39d3487 Python implementation: Message: PYTHON-4579 Stop gossiping $clusterTime on SDAM connections (#1925) Branch: master https://github.com/mongodb/mongo-python-driver/commit/85ca6f1d9fa71badeeee2b80db7ec89dc4bef0f4

Driver Compliance:

$i18n.getText("admin.common.words.hide")

Key	Status/Resolution	FixVersion
CDRIVER-5643	Backlog
CXX-3079	Backlog
CSHARP-5204	Done	3.4.0
GODRIVER-3288	Backlog
JAVA-5546	Backlog
NODE-6293	Backlog
MOTOR-1347	Duplicate
PYTHON-4579	Fixed	4.12
PHPC-2529	Blocked
RUBY-3523	Backlog
RUST-2005	Fixed	3.3.0

$i18n.getText("admin.common.words.show")

#scriptField, #scriptField *{ border: 1px solid black; } #scriptField{ border-collapse: collapse; } #scriptField td { text-align: center; /* Center-align text in table cells */ } #scriptField td.key { text-align: left; /* Left-align text in the Key column */ } #scriptField a { text-decoration: none; /* Remove underlines from links */ border: none; /* Remove border from links */ } /* Add green background color to cells with FixVersion */ #scriptField td.hasFixVersion { background-color: #00FF00; /* Green color code */ } #scriptField td.willNotDo { background-color: #FF0000; /* Red color code */ } /* Center-align the first row headers */ #scriptField th { text-align: center; } Key Status/Resolution FixVersion CDRIVER-5643 Backlog CXX-3079 Backlog CSHARP-5204 Done 3.4.0 GODRIVER-3288 Backlog JAVA-5546 Backlog NODE-6293 Backlog MOTOR-1347 Duplicate PYTHON-4579 Fixed 4.12 PHPC-2529 Blocked RUBY-3523 Backlog RUST-2005 Fixed 3.3.0

Summary

In unusual situations, gossiping the cluster time received on monitoring connections results in complete loss of availability and requires an application restart. The problem was traced to a temporary state during which the driver attempts to connect to a member of the wrong replica set running on the same pod. Since cluster times between deployments are not compatible, it results in all operations failing until the application is restarted.

Motivation

Who is the affected end user?

We only have one report of this, in ~~JAVA-5256~~. Please see that ticket for details, as they are quite involved.

How does this affect the end user?

Availability is completely compromised and an application restart is required.

How likely is it that this problem or use case will occur?

It's certainly unusual, as we have not heard other reports of this from people using our Kubernetes operator. On the other hand, the fix is likely simple for most drivers, though testing is an issue (there are probably no tests of the existing behavior)

If the problem does occur, what are the consequences and how severe are they?

Complete loss of availability to the desired cluster.

Is this issue urgent?

The user has no simple workaround, but it is possible to work around

Is this ticket required by a downstream team?

Is this ticket only for tests?

Acceptance Criteria

The requirement is for a clarification to the sessions specification, saying that cluster time gossiping should be limited to pooled connections and should not include monitoring connections. It's unclear though how a test could be written. In a POC of this in the Java driver, it was achieved by a simple design change that made it impossible to gossip the cluster time for monitoring connections, but it's certainly possible that a future design change could reverse that and the issue could be re-introduced.

Additional Notes

Gossiping of cluster time has been a bit of a mystery to many driver engineers, as the specification contains no rationale for it. Discussions with server engineers recently have revealed the following justification:

In a sharded cluster, each shard has an independent monotonically increasing logical clock
Every write on the shard includes the current logical clock time
The gossiping pushes the logical clock forward to just past the gossiped time
This means that a client thread that does a write that targets shard A, then a subsequent write to shard B, will result in the second write having a later time than the first write
This in turn means that the first write will precede the second write in various operations which create a total ordering of write operations. A change stream is the primary example.

Since monitoring connections are never used for writes, there is no benefit to gossiping cluster times from those connections

is related to

JAVA-5256 Switching replicas IP with a replica from a different replicaset can result in java driver obtaining HMAC keyId from the different replicaset

Closed

DRIVERS-3118 Add tests to ensure drivers advance $clusterTime from command error responses

Needs Triage

split to

CDRIVER-5643 Gossiping the cluster time from monitoring connections can result in loss of availability

Backlog

CXX-3079 Gossiping the cluster time from monitoring connections can result in loss of availability

Backlog

GODRIVER-3288 Gossiping the cluster time from monitoring connections can result in loss of availability

Backlog

JAVA-5546 Gossiping the cluster time from monitoring connections can result in loss of availability

Backlog

NODE-6293 Gossiping the cluster time from monitoring connections can result in loss of availability

Backlog

RUBY-3523 Gossiping the cluster time from monitoring connections can result in loss of availability

Backlog

PHPC-2529 Gossiping the cluster time from monitoring connections can result in loss of availability

Blocked

CSHARP-5204 Gossiping the cluster time from monitoring connections can result in loss of availability

Closed

MOTOR-1347 Gossiping the cluster time from monitoring connections can result in loss of availability

Closed

PYTHON-4579 Gossiping the cluster time from monitoring connections can result in loss of availability

Closed

RUST-2005 Gossiping the cluster time from monitoring connections can result in loss of availability

Closed

(8 split to)

Assignee:: Shane Harvey
Reporter:: Jeffrey Yemin
Engineering Lead:: Jeffrey Yemin
Program Manager:: KeAna Moutra (Inactive)
Votes:: 0 Vote for this issue
Watchers:: 7 Start watching this issue

Created:: Dec 18 2023 11:43:03 PM UTC
Updated:: Feb 28 2025 10:33:06 PM UTC
Start date:: 11/Oct/24

Details

Description

Summary

Motivation

Who is the affected end user?

How does this affect the end user?

If the problem does occur, what are the consequences and how severe are they?

Is this issue urgent?

Is this ticket required by a downstream team?

Acceptance Criteria

Additional Notes

Attachments

Issue Links

Activity

People

Dates