Uploaded image for project: 'Core Server'
  1. Core Server
  2. SERVER-86384

Duplicate records in queryshapes key subdocument

    • Type: Icon: Bug Bug
    • Resolution: Unresolved
    • Priority: Icon: Major - P3 Major - P3
    • None
    • Affects Version/s: None
    • Component/s: None
    • Query Integration
    • ALL

      Attached is a list of 50 instances where the values in the key subdocument repeat for the same host and server timestamp. 

      The metrics are different, i.e., the server thinks the observations belong to different query shapes. Unfortunately, the key subdocument doesn't capture this difference.

      https://docs.google.com/spreadsheets/d/1iYBVBSWg4rDeDdDIfzYISvnAfQblo322iYMOlQ2FDkY/edit#gid=1775716346

       

      Updated 6/13/2024
      I've queried for instance where this has happened in the last 2 weeks, and my query came up with ~60 instances. I tried to look for commonalities between them and observed all the 60 instances contain array types (but not all array types have this problem).

      Since this happens about 60 instances among hundreds of millions of query shapes this is not impacting the query shape count in a meaningful way. 

      Here's the query I used with an updated spreadsheet containing the duplicate query shape instances:
      https://docs.google.com/spreadsheets/d/1gW_PZz_xXD4G4HGp0pVeY9B4t8uCPeYqsDC1oXST25g/edit#gid=1496153475

       

       

            Assignee:
            Unassigned Unassigned
            Reporter:
            balazs.zombory@mongodb.com Balazs Zombory
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

              Created:
              Updated: