Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Duplicate
Priority: Major - P3
Fix Version/s: None
Affects Version/s: 3.0.5
Component/s: Querying
Labels:
None

Operating System:
ALL
Steps To Reproduce:

Hide

1.insert same data as the structure.
2.build index.
3.run query against the index.

Show
1.insert same data as the structure. 2.build index. 3.run query against the index.
Confidence Status:
None
Work Order:
3

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name:
None
Goal Link:
None

It costs more than 20 mins to query 1 hour's data, which is about 10,000 doucuments, and the server's 100GB memory can be used up by these queries.

Details:
I setup a server with wiredtiger engine, config like:

wiredTiger:
    engineConfig:
      cacheSizeGB: 100
      statisticsLogDelaySecs: 0
      journalCompressor: snappy
      directoryForIndexes: true
    collectionConfig:
      blockCompressor: snappy
    indexConfig:
      prefixCompression: true

I have a collection named 'orders'. The documents look like:

{
  order_id:485548,
  order_status:
  [
    {
      update_time:ISODate("2015-10-28T02:45:00Z"),
      status:1
    },
    {
      update_time:ISODate("2015-10-28T03:18:00Z"),
      status:2
    }
  ]
}

I built an index:

"key" : {"order_status.update_time" : -1,  "order_status.status" : -1}

then I run this query:

db.order.find({ "order_status" : { "$elemMatch" : { "update_time" : { "$gte" : ISODate("2015-10-28T02:00:00Z"), "$lt" : ISODate("2015-10-28T03:00:00Z") }, "status" : 2 } } }).explain();

I get:

"indexBounds" :
{
  "order_status.update_time" :
  [
    "[new Date(1446001200000), true)"
  ],
  "order_status.status" :
  [
    "[2.0, 2.0]"
  ]
}

I guess this means scan all data before '2015-10-28T03:00:00Z', which is more than 40GB data.

after I change the query to

db.order.find({ "order_status" : { "$elemMatch" : { "update_time" : { "$gte" : ISODate("2015-10-28T02:00:00Z") }, "status" : 2 } } }).explain();

I get:

"indexBounds" :
{
  "order_status.update_time" :
  [
    "[new Date(9223372036854775807), new Date(1445997600000)]"
  ],
  "order_status.status" :
  [
    "[2.0, 2.0]"
  ]
}

This looks better and runs much faster (finished within a second), but how can I get my original query work?

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

mongo_index_bound_test.txt
2 kB
Oct 29 2015 09:58:24 AM UTC
mongo_query_explain.txt
10 kB
Nov 17 2015 12:34:58 PM UTC

duplicates

SERVER-20616 Plan ranker sampling from the beginning of a query's execution can result in poor plan selection

Backlog

related to

SERVER-15086 Allow for efficient range queries over non-array fields in multikey indices

Closed

Assignee:: David Storch
Reporter:: Hoyt Ren
Participants:: David Storch, Hoyt Ren, Ramon Fernandez Marina
Votes:: 0 Vote for this issue
Watchers:: 4 Start watching this issue

Created:: Oct 28 2015 09:21:50 AM UTC
Updated:: Nov 17 2015 03:18:30 PM UTC
Resolved:: Nov 17 2015 03:17:30 PM UTC

Details

Description

Attachments

Attachments

Issue Links

Activity

People

Dates