v2.0.0

Breaking Changes

📅 Feb 25, 2025📦 graphragView on GitHub →

⚠ 5 breaking✨ 11 features🐛 7 fixes🔧 9 symbols

Summary

Version 2.0.0 introduces major architectural changes, including reworking the API to accept callbacks, reorganizing workflows, and refactoring configuration management into type-safe dataclasses. This release also adds features like LLM provider registration and multi-index querying support.

⚠️ Breaking Changes

Rework API to accept callbacks: Existing API calls that relied on the old structure may need updates to pass callbacks.
Remove config inheritance, hydration, and automatic env var overlays: Configuration loading is now stricter; users must explicitly define configurations instead of relying on implicit inheritance or environment variable overlays.
Rework the update output storage structure: Code relying on the previous structure for storing workflow outputs will need updates.
Refactor config defaults from constants to type-safe, hierarchical dataclass: Configuration access patterns might have changed due to the move from constants to dataclasses.
Remove unused columns and change property document_attribute_columns to metadata: Any code referencing the old property name 'document_attribute_columns' must be updated to use 'metadata'.

Migration Steps

Review API usage for any changes related to accepting callbacks.
Update configuration loading logic to explicitly define settings, as automatic inheritance and environment variable overlays have been removed.
Update code referencing output storage structures based on the rework.
Update code referencing configuration defaults to use the new dataclass structure.
Replace usage of 'document_attribute_columns' with 'metadata' in relevant configuration or access points.

✨ New Features

Add children to communities to avoid re-compute.
Add LLM Manager and Factory, to support provider registration
Add NLP graph extraction.
Add pipeline_start and pipeline_end callbacks.
Add caching to NLP extractor.
Add vector store id reference to embeddings config.
Export NLP community reports prompt.
Add dynamic retry logic.
Add option to prepend metadata into chunks
Implemented multi-index querying for api layer
Multi index query cli support

🐛 Bug Fixes

Fix DRIFT search on Azure AI Search.
Fix StopAsyncIteration catch.
Fix missing embeddings workflow in FastGraphRAG.
Fix proper use of n_depth for drift search
Fix report generation recursion.
Fix summarization over large datasets for inc indexing. Fix relationship summarization
Patch json mode for community reports

🔧 Affected Symbols

APILLM ManagerLLM Factoryworkflow runnerNLP extractorFastGraphRAGconfig defaultsquery modulefnllm