Uploaded image for project: 'Core Server'
  1. Core Server
  2. SERVER-92235

Add a function to scrub BSON for Utf-8 validity and use it in validation

    • Type: Icon: Improvement Improvement
    • Resolution: Unresolved
    • Priority: Icon: Major - P3 Major - P3
    • None
    • Affects Version/s: None
    • Component/s: None
    • None
    • Storage Execution
    • Execution Team 2024-08-05, Execution Team 2024-08-19, Execution Team 2024-09-02, Execution Team 2024-09-16, Execution Team 2024-09-30, Execution Team 2024-10-14, Execution Team 2024-10-28, Execution Team 2024-11-11, Execution Team 2024-11-25

      Validation returned a result that contained invalid UTF-8 encoding.  This was because it passed through keys from an index that had been corrupted and one of those keys was invalid.  As a result, the Python script parsing those validation results failed out.  We should add a function to the BSON api to scrub data and ensure all string values have valid Utf-8 encoding and use it to safeguard results returned from validation.

            Assignee:
            stephanie.eristoff@mongodb.com Stephanie Eristoff
            Reporter:
            binh.vo@mongodb.com Binh Vo
            Votes:
            0 Vote for this issue
            Watchers:
            9 Start watching this issue

              Created:
              Updated: