v2.0.0
Breaking Changes📦 graphragView on GitHub →
⚠ 5 breaking✨ 11 features🐛 7 fixes🔧 9 symbols
Summary
Version 2.0.0 introduces major architectural changes, including reworking the API to accept callbacks, reorganizing workflows, and refactoring configuration management into type-safe dataclasses. This release also adds features like LLM provider registration and multi-index querying support.
⚠️ Breaking Changes
- Rework API to accept callbacks: Existing API calls that relied on the old structure may need updates to pass callbacks.
- Remove config inheritance, hydration, and automatic env var overlays: Configuration loading is now stricter; users must explicitly define configurations instead of relying on implicit inheritance or environment variable overlays.
- Rework the update output storage structure: Code relying on the previous structure for storing workflow outputs will need updates.
- Refactor config defaults from constants to type-safe, hierarchical dataclass: Configuration access patterns might have changed due to the move from constants to dataclasses.
- Remove unused columns and change property document_attribute_columns to metadata: Any code referencing the old property name 'document_attribute_columns' must be updated to use 'metadata'.
Migration Steps
- Review API usage for any changes related to accepting callbacks.
- Update configuration loading logic to explicitly define settings, as automatic inheritance and environment variable overlays have been removed.
- Update code referencing output storage structures based on the rework.
- Update code referencing configuration defaults to use the new dataclass structure.
- Replace usage of 'document_attribute_columns' with 'metadata' in relevant configuration or access points.
✨ New Features
- Add children to communities to avoid re-compute.
- Add LLM Manager and Factory, to support provider registration
- Add NLP graph extraction.
- Add pipeline_start and pipeline_end callbacks.
- Add caching to NLP extractor.
- Add vector store id reference to embeddings config.
- Export NLP community reports prompt.
- Add dynamic retry logic.
- Add option to prepend metadata into chunks
- Implemented multi-index querying for api layer
- Multi index query cli support
🐛 Bug Fixes
- Fix DRIFT search on Azure AI Search.
- Fix StopAsyncIteration catch.
- Fix missing embeddings workflow in FastGraphRAG.
- Fix proper use of n_depth for drift search
- Fix report generation recursion.
- Fix summarization over large datasets for inc indexing. Fix relationship summarization
- Patch json mode for community reports
🔧 Affected Symbols
APILLM ManagerLLM Factoryworkflow runnerNLP extractorFastGraphRAGconfig defaultsquery modulefnllm