Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Duplicate
Priority: Major - P3
Fix Version/s: None
Affects Version/s: 3.2.5
Component/s: None
Labels:
None

Operating System:
ALL
Confidence Status:
None
Work Order:
3

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name:
None
Goal Link:
None

We've set up a MongoDB 3.2.5 cluster on Microsoft Azure with 3 shards (each a replica sets with 3 servers) and a 3-server config set. We access the cluster via a router.

Filling and sharding smaller collections (~400 million documents) worked with sh.shardCollection(). However, after filling a new collection with ~1.8 billion documents we tried to shard this new collection but keep getting the following error:

{ "code" : 50, "ok" : 0, "errmsg" : "Operation timed out" }

Note: At first we also had timeouts with count() and chunk balancing. After decreasing the TCP keepalive on all servers in the cluster below 240 seconds (the Microsoft Azure timeout) - based on the info at https://docs.mongodb.org/manual/faq/diagnostics/#does-tcp-keepalive-time-affect-mongodb-deployments - counting and balancing worked again. However, sharding our big collection still does not work.

Might be connected to Issue ~~SERVER-22392~~

duplicates

SERVER-23784 Don't use 30 second network timeout on commands sent to shards through the ShardRegistry

Closed

Assignee:: Unassigned
Reporter:: Jörg Rech
Participants:: Daniel Pasette, Jörg Rech, Kelsey Schubert, Ramon Fernandez Marina
Votes:: 0 Vote for this issue
Watchers:: 6 Start watching this issue

Created:: Apr 20 2016 07:00:38 AM UTC
Updated:: Apr 22 2016 08:54:34 PM UTC
Resolved:: Apr 20 2016 01:41:33 PM UTC

Details

Description

Attachments

Issue Links

Activity

People

Dates