Added support auto detect tables and send cropped table images for inference
Added support for multi-page PDF document through CMD and API
Removed dependencies to LlamaIndex, Haystack, Unstructured and other libraries, as main Sparrow focus is on Sparrow Parse
This release starts new phase in Sparrow development - Vision LLM support for document data processing.
- Sparrow Parse library supports Vision LLM
- Sparrow Parse provides factory class implementation to run inference locally or on cloud GPU
- Sparrow supports JSON as input query
- JSON query validation and LLM response JSON validation is performed
- Sparrow Parse integration
- Sparrow Parse is integrated into Instructor agent. README updated with example for Instructor agent
- New Instructor agent
- Added instructor agent for better JSON response generation
- New agents with Unstructured
- Added unstructured-light and unstructured agents for better data pre-processing
- Virtual Environments support
- Fixes in LlamaIndex agent to run with latest LlamaIndex versions
- LLM function calling agent
- OCR + LLM support, new vprocessor agent
- Improved FastAPI endpoints
- Added Haystack agent for structured data
- Changed plugins to agents
- Added support for plugin architecture. This allows to use within Sparrow various toolkits, such as LlamaIndex or Haystack
- Significant code refactoring
- Minor improvements related to data ingestion
- Fixed bug to clean Vector DB, when new document is inserted
- Tested with Notus and Openhermes LLMs
- Tested with longer and more realistic documents
- Upgraded LlamaIndex and LangChain
- Lemming LLM RAG