-
Type: Task
-
Resolution: Unresolved
-
Priority: Unknown
-
None
-
Affects Version/s: None
-
Component/s: AI/ML
-
None
-
Python Drivers
Context
Our integration testing pipelines, to run green, have to add retries with significant sleeps. The upper bounds on the sleep required for these test appears to be higher than the mongot team expects. Beyond our own testing, our impression is that they will not give a good developer experience.
https://spruce.mongodb.com/commits/ai-ml-pipeline-testing?view=FAILED
Definition of done
The task here is to create a test that mimics the apparatus that we use everywhere (see for example, in langchain [here|https://github.com/langchain-ai/langchain/blob/master/libs/partners/mongodb/tests/utils.py|https://github.com/langchain-ai/langchain/blob/master/libs/partners/mongodb/tests/utils.py])]
We do not want any external dependency, except perhaps openai, though it would be better not to share that key.. We CAN include programmatic index creation, for simplicity, although that isn't the cause of the problem.
One need just add N documents with an indexed vector-search field, and then run a $vectorSearch on it until N documents are returned.
All the pieces can be quickly gathered from langchain-mongodb.
Pitfalls
- In its first draft, this isn't well-defined. Please ask casey.clements@mongodb.com to refine.