Change8

arize-phoenix-evals-v2.9.0

📦 arize-phoenixView on GitHub →
5 features🐛 1 fixes1 deprecations🔧 5 symbols

Summary

This release introduces the new FaithfulnessEvaluator, several new evaluation metrics, and improves trace ID handling. A minor bug fix addresses the use of deprecated pandas NA checking.

Migration Steps

  1. Replace usage of HallucinationEvaluator with FaithfulnessEvaluator.

✨ New Features

  • Added FaithfulnessEvaluator.
  • Added tool invocation accuracy metric.
  • Added missing tool_selection evaluator to both libraries.
  • Added neutral optimization direction option to evals.
  • Return trace_id in Scores and improved the trace decorator.

🐛 Bug Fixes

  • Used non-deprecated pandas na checking.

Affected Symbols

⚡ Deprecations

  • HallucinationEvaluator is deprecated in favor of FaithfulnessEvaluator.