Investigate whether we can perform a fake disk write in a dry-run election

XMLWordPrintableJSON

    • Type: Task
    • Resolution: Duplicate
    • Priority: Major - P3
    • None
    • Affects Version/s: None
    • Component/s: None
    • Replication
    • None
    • 3
    • None
    • None
    • None
    • None
    • None
    • None

      During an election, if disk operations are failing on a voting node, the real election can never succeed because we attempt to store the last vote document on disk. However, we don't perform a disk operation during dry-run elections, so they can still succeed. We can get into a state where a node is repeatedly voting 'yes' in the dry-run election, but timing out (due to the disk operation) in the real election. This can cause the term to escalate quickly to a very high number. 

       

      As suggested, we should try to investigate and see if we can perform a 'fake' disk write in the dry-run election, to ensure that if disk operations are failing, we fail the dry-run election.

            Assignee:
            [DO NOT USE] Backlog - Replication Team
            Reporter:
            Ali Mir
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

              Created:
              Updated:
              Resolved: