Loading...

XML

Word

Printable

JSON

Type: Epic
Resolution: Won't Do
Priority: Major - P3
Fix Version/s: None
Component/s: None
Labels:
- init-53-m1.0

Epic Name:
Clustered indexes for Time-series collections
Documentation Changes:
None

Quarter:
- FY22Q2
Detailed Project Statuses:
None

Server Compat:
- 5.0
Driver Changes:
Needed

Downstream Change Summary

There may be downstream impacts for this project. We will update with potential impacts after we move to design.

Description of Linked Ticket

Epic Summary

Make the RecordStore for a collection a mapping from _id key to BSON document, instead of a mapping from RecordId to BSON document. This will then allow us to remove the separate _id index.

Motivation

Queries that currently use the _id index will only need to do one read instead of two.
Inserts and deletes also have one less index to update.
Range queries on _id will be able to use a collection scan rather than index scan with fetch stage. This allows for far more efficient sequential storage access where random order access is required now.
Range removes on _id allow for efficient truncations that avoid reading data to be removed if there are no other indexes.
Significantly speed up chunk migrations on sharded clusters.
Improve usability of MongoDB for timeseries data.

Doing a smaller internal-only project first, that leaves full support for sharded collections with shard keys other than _id as future work, helps evaluating performance gains before committing to implementing the more expensive future work, and allows answering design questions for that work.

depends on

CDRIVER-3898 Clustered indexes for Time-series collections

Development Complete

CSHARP-3401 Clustered indexes for Time-series collections

Development Complete

CXX-2189 Clustered indexes for Time-series collections

Development Complete

GODRIVER-1872 Clustered indexes for Time-series collections

Development Complete

JAVA-3998 Clustered indexes for Time-series collections

Development Complete

MOTOR-670 Clustered indexes for Time-series collections

Development Complete

NODE-3096 Clustered indexes for Time-series collections

Development Complete

PHPC-1767 Clustered indexes for Time-series collections

Development Complete

PYTHON-2558 Clustered indexes for Time-series collections

Development Complete

RUBY-2530 Clustered indexes for Time-series collections

Development Complete

RUST-667 Clustered indexes for Time-series collections

Development Complete

(6 depends on)

Assignee:: Unassigned
Reporter:: Backlog - Core Eng Program Management Team
Votes:: 0 Vote for this issue
Watchers:: 2 Start watching this issue

Created:: Jan 08 2021 04:55:46 PM UTC
Updated:: Jul 19 2024 02:45:32 AM UTC
Resolved:: Feb 16 2021 05:20:18 PM UTC
Target start:: None
Target end:: None
Start date:: None
End date:: None

Details

Description

Description of Linked Ticket

Attachments

Issue Links

Activity

People

Dates