Uploaded image for project: 'Core Server'
  1. Core Server
  2. SERVER-32692

Make zbigMapReduce.js, sharding_balance4.js, and bulk_shard_insert.js more resilient under slow machines

    • Type: Icon: Bug Bug
    • Resolution: Fixed
    • Priority: Icon: Major - P3 Major - P3
    • 4.3.1
    • Affects Version/s: None
    • Component/s: Sharding
    • Fully Compatible
    • ALL
    • v4.0, v3.6
    • Sharding 2019-09-09, Sharding 2019-09-23, Sharding 2019-10-07
    • 37

      zbigMapReduce.js fails occasionally because more than 5 migrations manage finish since the beginning of either of the two bulk writes it executes, causing the test to fail since the write never establishes a shard version. Similarly to sharding_balance4.js as of SERVER-28697, we should ignore a certain number of NoProgressMade errors to make the test fail less frequently.

      sharding_balance4.js and bulk_shard_insert.js occasionally fail because more than 10 migrations complete during the course of a find command exhausting mongos's retry attempts and failing the test. Modifying the test to retry a couple times on StaleShardVersion should make it fail less often.

      We can also consider making a generic override for read commands that retry on StaleShardVersion errors, so it can be load-ed into tests that involve frequent migrations.

            Assignee:
            matthew.saltz@mongodb.com Matthew Saltz (Inactive)
            Reporter:
            jack.mulrow@mongodb.com Jack Mulrow
            Votes:
            1 Vote for this issue
            Watchers:
            7 Start watching this issue

              Created:
              Updated:
              Resolved: