Concurrent mapReduces with replace are not supported. As part of the cluster's map reduce logic, we will always drop the output collection then shard it again. Since these two operations are not atomic, it can leads to a race condition where two mapReduces can simultaneously output to the same collection with the same UUID.
With mapReduce A and B, in order:
- mapReduce A drops the output collection
- mapReduce B drops the output collection
- mapReduce A creates the sharded output collection with a generated UUID
- mapReduce B attempts to shard the output collection, but when doing is told that the collection is already sharded and receives the UUID that mapReduce A generated.
In order to avoid this undefined behavior, no sharded fuzzer suites (specifically: jstestfuzz_sharded_causal_consistency) should run mapReduce with replace.
- is duplicated by
-
SERVER-38471 Concurrent mapReduce to same output sharded collection can trigger invariant
- Closed
- related to
-
SERVER-38471 Concurrent mapReduce to same output sharded collection can trigger invariant
- Closed