Engineer(s): Jib Adegunloye, Noah Stapp, Casey Clements
2024-05-15
Final Project Update
Summary:
- What was accomplished since the last update?
- All pull requests have been merged and the langchain-mongodb version has been bumped to reflect these new changes.
- Is there any follow-up work that we'll need to address in the future?
- Nothing scoped as result of this work. We will continue to foster our relationship with these libraries, ergo demanding new work.
- What, if anything, did we de-scope from the project?
- De-scoped additional performance improvements as that is an ongoing effort, similarly, that requires non-blocking performance tests. The performance improved scoped here empirically improve performance for large datasets.
2024-05-13
Summary:
- All dev is complete. 2 pull-requests to LangChain are imminent.
- What was completed over the last two weeks?
- Llama-Index integration testing was merged by the Maintainer.
- The integrations that we run in CI now point to upstream:main instead of Casey's feature branch.
- Intermittent failures in Integration tests no longer show up for tests that point to Atlas Cloud.
- DocArray integration was merged (1 of 2)
- What's the focus over the next two weeks?
- Langchain: Merge remaining two PRs
2024-04-26
Summary: AI/ML Test Pipeline improvements and merged ChatGPT-Retrieval!
- What was completed over the last two weeks?
- ChatGpt-Retrieval code has successfully been merged
- Testing pipeline has gotten more expansive
- Llama-Index testing introduced
- Now supports multiple index creations
- Identified a bug in the Atlas CLI that would cause intermittent failures on evergreen runs
- Reached out an provided a mitigation by using podman directly
- Task filed to track the work:
PYTHON-4391
- What's the focus over the next two weeks?
- Continue to make changes to Llama-Index and provide a notebook
- Introduce the experimental code to allow developers to create vector search indexes via pymongo code: https://github.com/langchain-ai/langchain/pull/19359
- Performance enhancement through batch size increase in Langchain code.
2024-04-12
Summary: AI/ML Pipeline Testing: additions and documentation
- What was completed over the last two weeks?
- DBX Tech Talk!
- Specifically addresses Quality of Life. Fair to say that was the main theme.
- Also helps pave the way for other drivers to onboard their integrations.
- Intermittent failures have been fixed for the following, all of which had separate issues
- chatgpt-retrieval-plugin
- llama-index
- semantic-kernel-python
- What's the focus over the next two weeks?
- Rationalize epics so that we are not tracking everything here.
- Impediments encountered over the last two weeks
- We have a failing dotnet/csharp driver. We should open a ticket and speak to Boris.
- Still waiting on upstream maintainers.
2024-03-29
Summary: AI/ML Pipeline Testing: additions and documentation
- What was completed over the last two weeks?
- Added LlamaIndex to AI/ML Pipeline Testing
- Added Documentation of Git Patch Files technique
- Set Casey up to use Atlas Local deployments
- Investigated intermittent failures in CI
- What's the focus over the next two weeks?
- DBX Tech Talk on AI-ML Integrations
- Jib on vacation
- Create MongoDB Llama Pack template
- Impediments encountered over the last two weeks
- No response from chatgpt-retrieval-plugin maintainer. Has been one month.
Summary: Continuing work on LangChain and LlamaIndex implementations
2024-03-13:
- What was completed over the last two weeks?
- All Langchain-MongoDB packages successfully included
- All Langchain-MongoDB added Example guides of library usage
- What's the focus over the next two weeks?
- Adding LLamaIndex to the AI/ML test Pipeline
- Updating AI/ML Testing Pipeline Documentation
- Impediments encountered over the last two weeks
- Reviewer timelines
- Local Atlas in evergreen doesn't support vectorSearch type. Needs to be updated.
Engineer(s): Jib Adegunloye, Noah Stapp, Casey Clements
Summary: Follow-up improvements to the Langchain and LlamaIndex python library integrations.
2024-03-01: Target date set to to account of review feedback loop
• What was completed over the last two weeks?
• Iteration on this project starts this week
• What's the focus over the next two weeks?
• Integrating the LangChain LLM Caching Layer
• Any tasks related to the LangChain library
• Adding LLamaIndex to the AI/ML Pipeline
• Impediments encountered over the last two weeks
• N/A
• Open Dependencies
• Maintainers of the LangChain & LlamaIndex code may push back timelines.
Engineer(s): Jib Adegunloye, Noah Stapp, Casey Clements
Summary: Continuing work on LangChain and LlamaIndex implementations
2024-02-24: Target date set to
• What was completed over the last two weeks?
• MongoDB is officially a LangChain partner package
• Getting review on MongoDB LLM Cache
• What's the focus over the next two weeks?
• Updating the test suite run for LangChain
• Addressing review comments
• Adding LLamaIndex to the AI/ML Pipeline
• Impediments encountered over the last two weeks
• N/A
• Open Dependencies
• Maintainers of the LangChain & LlamaIndex code may push back timelines.
Engineer(s): Jib Adegunloye, Noah Stapp, Casey Clements
Summary: Continuing work on LangChain and LlamaIndex implementations
2024-03-13: Target date set to
• What was completed over the last two weeks?
• Merged MongoDB LangChain partner package
• Uploaded PR to LangChain/MongoDB LLM Cache
• What's the focus over the next two weeks?
• Updating the test suite of the MongoDB Cache Layer
• Addressing review comments
• Adding LLamaIndex to the AI/ML Pipeline
• Impediments encountered over the last two weeks
• Reviewer timelines
• Open Dependencies
• Maintainers of the LangChain & LlamaIndex code may push back timelines.
-----------------------------------------------------------------------------------
Engineer(s): Jib Adegunloye, Noah Stapp, Casey Clements
Summary: Continuing work on LangChain and LlamaIndex implementations
2024-03-13: Target date set to
• What was completed over the last two weeks?
• Merged MongoDB LangChain partner package
• Uploaded PR to LangChain/MongoDB LLM Cache
• What's the focus over the next two weeks?
• Updating the test suite of the MongoDB Cache Layer
• Addressing review comments
• Adding LLamaIndex to the AI/ML Pipeline
• Impediments encountered over the last two weeks
• Reviewer timelines
• Open Dependencies
• Maintainers of the LangChain & LlamaIndex code may push back timelines.