Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Fixed
Priority: Major - P3
Fix Version/s: 5.0.2, 5.1.0-rc0
Affects Version/s: None
Component/s: Sharding
Labels:
- sharding-causes-bfs-hard

Backwards Compatibility:
Fully Compatible
Operating System:
ALL
Backport Requested:

v5.0
Steps To Reproduce:

Hide

see attached test.js

Show
see attached test.js
Sprint:
Sharding EMEA 2021-05-31, Sharding EMEA 2021-06-14, Sharding EMEA 2021-06-28
Case:
Linked BF Score:
20
Confidence Status:
None
Work Order:
3

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name:
None
Goal Link:
None

Which can lead to secondaries believing that collection is still sharded.

setup:

collection test.user is sharded
1 shard, current primary: nodeA
shard's nodeB never heard about test.user, so it never had any catalog cache entries.

1. _configsvrDrop deletes all config.chunks and config.collections.
2. nodeA steps down, and nodeB becomes new primary.
3. _configsvrDrop sends setShardVersion (0,0) to all shards. Since nodeB never had any entries, set shard version was a no-op.
4. If secondary read with shard version comes to nodeA, it will try to ask nodeB (the primary) to refresh with _flushRoutingTableCacheUpdates.
5. nodeB will end up calling getDatabase and load all sharded collections under that database, but since test.user is already dropped, it will be skipped.
6. So nodeB will end up returning early without asking the CatalogCacheLoader to reload. The consequence is that since the catalog cache loader did not perform the reload, the config.cache collections for test.user will remain untouched.
7. nodeA gets ok response from _flushRoutingTableCacheUpdates, and then tries to check the version via reading config.cache.chunks, and will find out that there are still documents and erroneously believe that collection is still sharded.

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

test.js
1 kB
Apr 06 2020 09:39:09 PM UTC

is depended on by

SERVER-34632 config.chunks change to config.cache.chunks creates a collection long name after upgrade

Backlog

related to

SERVER-17397 Dropping a Database or Collection in a Sharded Cluster may not fully succeed

Closed

Assignee:: Antonio Fuschetto
Reporter:: Randolph Tan
Participants:: Antonio Fuschetto, Githook User, Randolph Tan
Votes:: 0 Vote for this issue
Watchers:: 8 Start watching this issue

Created:: Apr 06 2020 09:37:13 PM UTC
Updated:: Oct 29 2023 10:09:51 PM UTC
Resolved:: Jul 01 2021 07:08:51 AM UTC
Confidence Status Last Update:: 28/Jun/21 7:28 AM

Details

Description

Attachments

Attachments

Issue Links

Activity

People

Dates