Uploaded image for project: 'Python Driver'
  1. Python Driver
  2. PYTHON-4665

Create small reproducible issue demonstrating slow search indexing

    • Type: Icon: Task Task
    • Resolution: Unresolved
    • Priority: Icon: Unknown Unknown
    • None
    • Affects Version/s: None
    • Component/s: AI/ML
    • None
    • Python Drivers

      Context

      Our integration testing pipelines, to run green, have to add retries with significant sleeps. The upper bounds on the sleep required for these test appears to be higher than the mongot team expects. Beyond our own testing, our impression is that they will not give a good  developer experience.

      https://spruce.mongodb.com/commits/ai-ml-pipeline-testing?view=FAILED

      Definition of done

      The task here is to create a test that mimics the apparatus that we use everywhere (see for example, in langchain [here|https://github.com/langchain-ai/langchain/blob/master/libs/partners/mongodb/tests/utils.py|https://github.com/langchain-ai/langchain/blob/master/libs/partners/mongodb/tests/utils.py])]

      We do not want any external dependency, except perhaps openai, though it would be better not to share that key.. We CAN include programmatic index creation, for simplicity, although that isn't the cause of the problem.

      One need just add N documents with an indexed vector-search field, and then run a $vectorSearch on it until N documents are returned. 

      All the pieces can be quickly gathered from langchain-mongodb.

      Pitfalls

            Assignee:
            casey.clements@mongodb.com Casey Clements
            Reporter:
            casey.clements@mongodb.com Casey Clements
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

              Created:
              Updated: