arize-phoenix-evals-v2.9.0
📦 arize-phoenixView on GitHub →
✨ 5 features🐛 1 fixes⚡ 1 deprecations🔧 5 symbols
Summary
This release introduces the new FaithfulnessEvaluator, several new evaluation metrics, and improves trace ID handling. A minor bug fix addresses the use of deprecated pandas NA checking.
Migration Steps
- Replace usage of HallucinationEvaluator with FaithfulnessEvaluator.
✨ New Features
- Added FaithfulnessEvaluator.
- Added tool invocation accuracy metric.
- Added missing tool_selection evaluator to both libraries.
- Added neutral optimization direction option to evals.
- Return trace_id in Scores and improved the trace decorator.
🐛 Bug Fixes
- Used non-deprecated pandas na checking.
Affected Symbols
⚡ Deprecations
- HallucinationEvaluator is deprecated in favor of FaithfulnessEvaluator.