Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Unresolved
Priority: Major - P3
Fix Version/s: None
Affects Version/s: 2.2.6, 3.0.6, 3.1.8
Component/s: Querying, Sharding
Labels:
None

Assigned Teams:

Query Optimization
Operating System:
ALL
Steps To Reproduce:
Hide

Start a ShardingTest as:

// sharding test MUST be started with noAutoSplit var st = new ShardingTest({ mongos : 2, shards : 3, rs : { nodes : 3 }, other : { mongosOptions : { noAutoSplit: "" } } })

Then connect to a mongos in the cluster and run the following:

var db = db.getSiblingDB("test"); // insert some data for (var i=0; i<1000; i++) { db.foo.insert({ _id : i }); } // enable sharding db.adminCommand({ enableSharding: db.getName() }); db.adminCommand({ shardCollection: "test.foo", key: { _id: 1 } }); // create two chunks db.adminCommand({ split : "test.foo", middle: { _id : 1000 } }); // put the two chunks on separate shards db.adminCommand({ moveChunk : "test.foo", find : { _id : 0 }, to : "test-rs0" }); db.adminCommand({ moveChunk : "test.foo", find : { _id : 1001 }, to : "test-rs1" }); // shows that both test-rs0 and test-rs1 are targeted // even though the chunks are [-inf, 1000) and [1000, +inf) printjson(db.foo.find({ _id : { $lt : 1000 } }).explain());
Show
Start a ShardingTest as: // sharding test MUST be started with noAutoSplit var st = new ShardingTest({ mongos : 2, shards : 3, rs : { nodes : 3 }, other : { mongosOptions : { noAutoSplit: "" } } }) Then connect to a mongos in the cluster and run the following: var db = db.getSiblingDB( "test" ); // insert some data for ( var i=0; i<1000; i++) { db.foo.insert({ _id : i }); } // enable sharding db.adminCommand({ enableSharding: db.getName() }); db.adminCommand({ shardCollection: "test.foo" , key: { _id: 1 } }); // create two chunks db.adminCommand({ split : "test.foo" , middle: { _id : 1000 } }); // put the two chunks on separate shards db.adminCommand({ moveChunk : "test.foo" , find : { _id : 0 }, to : "test-rs0" }); db.adminCommand({ moveChunk : "test.foo" , find : { _id : 1001 }, to : "test-rs1" }); // shows that both test-rs0 and test-rs1 are targeted // even though the chunks are [-inf, 1000) and [1000, +inf) printjson(db.foo.find({ _id : { $lt : 1000 } }).explain());

Chunks are inclusive at the lower bound and exclusive at the upper bound.

However, find queries over a range of the form

{ $lt : X }

where X is the upper bound of a chunk also targets the shard containing the chunk whose lower bound is X (at least according to find().explain()).

Note that a point query for X will only (and correctly) target the shard with the chunk whose lower bound is X.
Similarly, a query of the form

{ $lte : X }

will (correctly) target the shard for both chunks.

This is undesirable both from a performance perspective, since an additional shard is unnecessarily targeted in this situation, and a testing perspective, since .explain() cannot be used to verify that all documents within a chunk's range lie only on the shard the chunk is expected to be on.

is duplicated by

SERVER-5365 range query on shard key incorrectly sends query to extra shard(s)

Closed

SERVER-38971 Single shard query hits multiple shard if it includes end of chunk range

Closed

SERVER-51731 ChunkManager::getShardIdsForRange should not hardcode isMaxInclusive to true

Closed

SERVER-4791 shard selection code ignores bound inclusivity

Closed

Assignee:: [DO NOT USE] Backlog - Query Optimization

Reporter:: Esha Maharishi (Inactive)

Participants:: [DO NOT USE] Backlog - Query Optimization, David Storch, Esha Maharishi, James Hartig, Randolph Tan, Spencer Brody

Votes:: 1 Vote for this issue

Watchers:: 18 Start watching this issue

Created:: Oct 05 2015 09:48:57 PM UTC

Updated:: Dec 06 2022 04:43:01 AM UTC

Details

Description

Attachments

Issue Links

Activity

People

Dates