[Feature Request]: Enhance Docling Query Engine: Add PGVector, MongoDB, and Qdrant Support via VectorDBFactory Wrapper #950

sitloboi2012 · 2025-02-13T04:51:59Z

Is your feature request related to a problem? Please describe.

The current query engine implementation (see docling_query_engine.py) leverages ChromaDB by wrapping its collection into a LlamaIndex ChromaVectorStore for indexing. Meanwhile, the VectorDBFactory class provides a mechanism to create vector database storage with various backends. To improve flexibility and meet our RAG objectives outlined in [Feature Request]: Docling data ingestion to RAG (#688), we need to extend this functionality.

Describe the solution you'd like

Review Existing Implementation:

Examine the current ChromaDB-based query engine implementation.
Understand how the VectorDBFactory maps the user-selected vector DB to a corresponding LlamaIndex VectorStore.

Implement Additional Support:

Develop wrappers or integration logic for alternative vector databases, specifically PGVector, MongoDB, and Qdrant.
Ensure that these new wrappers map configuration options correctly to the LlamaIndex-supported VectorStore interfaces.

Integration & Testing:

Integrate the new wrappers with the existing query engine interface.
Test functionality within the context of the DocumentAgent (Phase 1 DocumentAgent (Phase 1) #438) and ensure compatibility with RAG capabilities.
Update documentation and examples to reflect the extended support.

Additional context

This enhancement is part of our ongoing effort to make the agent more versatile and not limited to a single vector DB. It builds on recent work (e.g., the merged ChromaDB implementation) and aligns with upcoming changes in retrieve_user_proxy_agent.py to support multiple query engines.

The text was updated successfully, but these errors were encountered:

sitloboi2012 · 2025-02-13T04:53:49Z

@AgentGenie @Eric-Shang please help me review this issue and assign it for me 😃 This will be the separate sub-issue build on top and extend the current previous work from in #688 #941

sitloboi2012 added the enhancement New feature or request label Feb 13, 2025

AgentGenie assigned sitloboi2012 Feb 13, 2025

sitloboi2012 mentioned this issue Feb 15, 2025

[Feat] Enhance Docling Query Engine with MongoDB Atlas Vector Search #983

Closed

3 tasks

davorrunje added this to ag2 Feb 18, 2025

davorrunje moved this to Waiting for merge in ag2 Feb 18, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request]: Enhance Docling Query Engine: Add PGVector, MongoDB, and Qdrant Support via VectorDBFactory Wrapper #950

[Feature Request]: Enhance Docling Query Engine: Add PGVector, MongoDB, and Qdrant Support via VectorDBFactory Wrapper #950

sitloboi2012 commented Feb 13, 2025

sitloboi2012 commented Feb 13, 2025

[Feature Request]: Enhance Docling Query Engine: Add PGVector, MongoDB, and Qdrant Support via VectorDBFactory Wrapper #950

[Feature Request]: Enhance Docling Query Engine: Add PGVector, MongoDB, and Qdrant Support via VectorDBFactory Wrapper #950

Comments

sitloboi2012 commented Feb 13, 2025

Is your feature request related to a problem? Please describe.

Describe the solution you'd like

Additional context

sitloboi2012 commented Feb 13, 2025