Arize Phoenix
AI & LLMsAI Observability & Evaluation
Release History
arize-phoenix-client-v1.29.13 fixesThis patch release (1.29.1) focuses on minor bug fixes, including updating documentation snippets and fixing an issue related to project deletion after experiment deletion.
arize-phoenix-v13.3.01 fix2 featuresVersion 13.3.0 introduces a new conciseness classification evaluator and removes several UI components related to model inferences and embeddings. Bug fixes include improved error handling for JSONDistanceEvaluator.
arize-phoenix-v13.2.03 fixes1 featureRelease 13.2.0 introduces support for AWS Bedrock cross-region inference preferences and includes several bug fixes, notably reverting a dependency that caused build issues.
arize-phoenix-v13.1.11 fixVersion 13.1.1 primarily addresses a bug fix related to the ModelMenu query fetch policy.
arize-phoenix-v13.1.05 fixes9 featuresVersion 13.1.0 introduces significant enhancements to table functionality, adds new session skills, and merges the phoenix-sdk-python skill into phoenix-evals. This release also includes various bug fixes related to model pricing and UI theming.
arize-phoenix-v13.0.31 fixPatch release 13.0.3 primarily addresses a bug fix related to the playground interface.
arize-phoenix-evals-v2.10.01 fix4 featuresVersion 2.10.0 introduces dataset evaluators, full mustache prompt template support in the server, and enhancements to classification evaluators. A bug fix simplified template formatters.
arize-phoenix-v13.0.21 fixThis patch release primarily addresses dependency updates, specifically upgrading the arize-phoenix-client to version 1.29.0.
arize-phoenix-client-v1.29.01 featureThis release introduces dataset evaluators, enhancing the capabilities for evaluating datasets within the library.
arize-phoenix-v13.0.11 fixVersion 13.0.1 is a patch release primarily focused on fixing an issue related to file inclusion during package building.
arize-phoenix-v13.0.0Breaking39 featuresVersion 13.0.0 introduces extensive enhancements to LLM evaluation capabilities, including new built-in evaluators, improved playground features, and UI updates. This release contains breaking changes related to dataset evaluators.
arize-phoenix-v12.35.08 fixes1 featureThis release introduces support for the Claude Opus 4.6 model and includes several bug fixes related to data validation, dependency updates, and improved UX for trace management.
arize-phoenix-client-v1.28.11 fixVersion 1.28.1 primarily addresses a bug by introducing timezone validation for the log_spans_dataframe function.
arize-phoenix-v12.34.03 fixes1 featureThis release introduces the tool_selection evaluator to the evals library and includes several bug fixes, notably updating built-in model token prices and ensuring correct rendering of message content text.
arize-phoenix-evals-v2.9.01 fix5 featuresThis release introduces the new FaithfulnessEvaluator, several new evaluation metrics, and improves trace ID handling. A minor bug fix addresses the use of deprecated pandas NA checking.
arize-phoenix-v12.33.11 fixPatch release 12.33.1 primarily addresses a bug fix related to GraphQL attribute access.
arize-phoenix-v12.33.01 fix1 featureVersion 12.33.0 introduces configurable email extraction via EMAIL_ATTRIBUTE_PATH for OAuth2 and resolves an import ordering issue related to namespace package discovery.
arize-phoenix-v12.32.03 fixes4 featuresThis release introduces several new skills, including CI/SKILL, evals, and tracing, alongside a new tool invocation accuracy metric. Bug fixes include updated token prices and improved dataset creation submission.
arize-phoenix-v12.31.22 fixesThis patch release focuses on bug fixes, notably enabling multipart GraphQL subscription cancellation and removing the unnecessary pytz dependency.
arize-phoenix-v12.31.12 fixesThis patch release updates the internal arize-phoenix-client dependency and fixes a KeyError issue related to pandas 3.0 compatibility in span queries.
arize-phoenix-client-v1.28.02 featuresThis release introduces the new FaithfulnessEvaluator, deprecates HallucinationEvaluator, and adds support for linking dataset examples to traces via span_id_key.
arize-phoenix-v12.31.02 fixes5 featuresThis release introduces a new CLI, adds the FaithfulnessEvaluator, and includes updates to token prices and trace linking capabilities.
arize-phoenix-v12.30.03 fixes2 featuresThis release introduces new UI components for displaying JSON attributes and fixes several bugs related to preserving invocation parameters and response formats during model updates.
arize-phoenix-v12.29.06 fixes4 featuresThis release introduces connection timeout error handling, a correctness evaluator, and updates to span attribute display in the UI. It also includes dependency updates and documentation improvements.
arize-phoenix-evals-v2.8.01 fix3 featuresVersion 2.8.0 introduces a new correctness evaluator and enhances the LLM constructor by enabling sync/async client kwargs. It also fixes an issue related to newlines in built-in prompts.
arize-phoenix-client-v1.27.21 fixVersion 1.27.2 primarily addresses a bug related to using the context.span_id column when DataFrames utilize an integer index.
arize-phoenix-v12.28.11 fixPatch release 12.28.1 primarily addresses a bug related to handling non-array values in NumDocuments to avoid PostgreSQL errors.
arize-phoenix-v12.28.03 fixes3 featuresVersion 12.28.0 introduces JSONL dataset upload support and a new tool selection correctness metric for evals, alongside several UI and layout bug fixes.
arize-phoenix-v12.27.02 fixes7 featuresVersion 12.27.0 introduces several UI/UX enhancements, including analytics, theme selection, and layout consistency, alongside a fix for built-in prompt formatting.
arize-phoenix-v12.26.04 fixes1 featureVersion 12.26.0 introduces organizational improvements by moving the dataset example to the examples tab and includes several dependency updates and bug fixes, notably updating token prices and ensuring proper database engine disposal.
arize-phoenix-client-v1.27.11 fixThis patch release primarily addresses a bug fix related to updating the available options for the `reasoning_effort` setting when interacting with OpenAI services.
arize-phoenix-v12.25.12 fixesThis patch release focuses on updating internal cost data and refining OpenAI reasoning effort options, alongside general documentation improvements.
arize-phoenix-v12.25.02 fixes3 featuresVersion 12.25.0 introduces lazy loading for embeddings/dimensions, Gemini tool call support, and updates key dependencies. LDAP configuration for email attributes has also been adjusted to use null as a sentinel value.
arize-phoenix-evals-v2.7.11 fixThis patch release (2.7.1) primarily addresses bug fixes related to the rate limiter within the evals functionality.
arize-phoenix-client-v1.27.03 featuresVersion 1.27.0 introduces new tracing capabilities with span notes functionality and adds support for Lightweight Directory Access Protocol (LDAP).
arize-phoenix-v12.24.02 featuresThis release introduces optional scarf analytics and readme analytics features, alongside documentation updates to reorganize the sidebar navigation.
arize-phoenix-v12.23.01 featureThis release introduces a feature allowing the email field to be optional for LDAP configurations and includes documentation updates.
arize-phoenix-v12.22.01 featureThis release introduces a new span note function for the Python client and includes documentation updates.
arize-phoenix-v12.21.01 fix1 featureVersion 12.21.0 introduces a new endpoint for span notes and updates the built-in model token prices for cost tracking.
arize-phoenix-v12.20.01 fix2 featuresVersion 12.20.0 introduces support for existing secrets and LDAP, alongside a dependency update for arize-phoenix-evals.
arize-phoenix-evals-v2.7.01 featureThis release introduces support for prompt/template messages within the evals module.
arize-phoenix-v12.19.02 fixes1 featureVersion 12.19.0 introduces a new feature for splitting on the experiments table and includes bug fixes related to token prices and Bedrock invocation parameters.
arize-phoenix-v12.18.06 fixes3 featuresVersion 12.18.0 introduces support for Claude Opus 4-5, enables split assignment during dataset upload, and includes several dependency updates and bug fixes.
arize-phoenix-client-v1.26.01 featureVersion 1.26.0 introduces the ability to assign splits directly during dataset uploads.
arize-phoenix-client-v1.25.01 featureVersion 1.25.0 introduces new evaluation helpers specifically designed to simplify pulling RAG spans.
arize-phoenix-v12.17.09 fixes3 featuresVersion 12.17.0 introduces new features like enabling repetitions in the playground and improving the retention policy UX, alongside several dependency updates and bug fixes across the platform.
arize-phoenix-evals-v2.6.11 fixVersion 2.6.1 primarily addresses a bug related to handling None values for top_p and temperature settings within the AnthropicModel integration.
arize-phoenix-client-v1.24.01 featureVersion 1.24.0 primarily updates the client license to Apache 2.0.
arize-phoenix-otel-v0.14.01 featureThis release primarily focuses on licensing, switching the client to the Apache 2.0 license.
arize-phoenix-v12.16.0Breaking2 fixes3 featuresThis release introduces significant feature enhancements, including support for OpenAI 5.1 and expanded reasoning capabilities for Gemini models, while simultaneously dropping support for Python 3.9.
arize-phoenix-v12.15.13 fixesThis patch release (12.15.1) focuses on maintenance, including updating built-in model token prices and fixing caching and integer handling issues.
arize-phoenix-v12.15.01 featureVersion 12.15.0 introduces an update to the supported Anthropic model list.
arize-phoenix-v12.14.22 fixesThis patch release primarily focuses on dependency updates, specifically bumping the playwright version, and includes a fix related to prompt handling by using loadQuery.
arize-phoenix-v12.14.11 fixThis patch release focuses on improving internal stability by adding type safety to the loader data handling.
arize-phoenix-v12.14.01 featureThis release introduces the React compiler feature, accessible via configuration, enhancing the platform's capabilities.
arize-phoenix-v12.13.12 fixesThis patch release fixes an issue where backslashes were not properly escaped during Python string sanitization and updates a key dependency.
arize-phoenix-evals-v2.6.02 fixes3 featuresVersion 2.6.0 introduces new adapters for Anthropic and Google GenAI, adds GPT-5 support for Azure OpenAI, and includes various bug fixes and ergonomic improvements to the evaluation module.
arize-phoenix-client-v1.23.01 featureVersion 1.23.0 introduces experiment retries to enhance robustness. This release focuses on stability and new feature implementation.
arize-phoenix-v12.13.05 fixes2 featuresPhoenix v12.13.0 introduces pure functions for rendering and removes reliance on "@arizeai/components", alongside several bug fixes including restored charting colors and updated token prices.
arize-phoenix-v12.12.01 featureVersion 12.12.0 introduces a feature where page titles are derived from navigation data. No breaking changes or bug fixes were noted in this release.
arize-phoenix-v12.11.11 fixThis patch release addresses a specific UI bug related to the timezone selector compatibility across different browsers.
arize-phoenix-v12.11.03 fixes1 featureVersion 12.11.0 introduces user timezone preference settings and includes several bug fixes related to client versioning, form validation, and metric time ranges.
arize-phoenix-client-v1.22.02 fixes4 featuresVersion 1.22.0 introduces several new features including binding examples to evaluators, prompt metadata, and resuming experiments. It also includes bug fixes related to error messaging and input softening for dataset creation.
arize-phoenix-v12.10.03 fixes13 featuresPhoenix version 12.10.0 introduces several new features including ES2022 target support, system themes, and enhanced authentication controls. This release also includes bug fixes related to Helm deployments and playground rendering.
arize-phoenix-v12.9.01 fix5 featuresPhoenix version 12.9.0 introduces several new features, including enhanced menu abstractions, AWS IAM auth support, and improvements to experiment comparison, alongside a bug fix for Bedrock model IDs.
arize-phoenix-v12.8.01 fix4 featuresVersion 12.8.0 introduces several UI and backend enhancements, including a split edit menu and preloading GraphQL queries, alongside an update to built-in model token prices.
arize-phoenix-v12.7.11 fixThis patch release primarily focuses on a bug fix related to fetching user API keys on the profile page.
arize-phoenix-v12.7.07 fixes14 featuresThis release focuses heavily on enhancing dataset and experiment management with new filtering, editing, and comparison features, alongside removing the splits feature flag for simplification. Several bug fixes address cost calculations and UI interactions.
arize-phoenix-v12.6.11 fixVersion 12.6.1 primarily addresses a bug related to session table row clicking functionality.
arize-phoenix-v12.6.01 fix1 featureThis release primarily focuses on upgrading the frontend framework by moving to React 19.2 and includes a fix for sorting issues on the experiment compare list page.
arize-phoenix-v12.5.01 fix7 featuresPhoenix version 12.5.0 introduces several UI and feature enhancements, including filtering for examples and split management, a new viewer role, and improvements to experiment comparison views. A bug fix addresses incorrect Python code for dataset retrieval.
arize-phoenix-v12.4.04 fixes1 featureVersion 12.4.0 introduces PKCE support for OAuth 2.0 OIDC and includes several bug fixes related to data ingestion and UI elements.
arize-phoenix-evals-v2.5.03 featuresVersion 2.5.0 introduces a new regex evaluator, improves binding ergonomics, and expands LLM provider support.
arize-phoenix-v12.3.03 fixes4 featuresPhoenix version 12.3.0 introduces several UI enhancements for datasets and experiments, alongside bug fixes related to token prices, query parameters, and OIDC error handling.
arize-phoenix-v12.2.04 fixes9 featuresVersion 12.2.0 introduces several new UI and dataset management features, including prompt version editing and bulk label operations, alongside various bug fixes for stability and performance.
arize-phoenix-evals-v2.4.01 featureVersion 2.4.0 introduces support for asynchronous functions within the evaluator creation utility and includes documentation enhancements for the evals module.
arize-phoenix-evals-v2.3.01 fix1 featureVersion 2.3.0 introduces support for dot-delimited fstring keys in evaluation templates and improves key extraction logic within evals.
arize-phoenix-v12.1.02 fixes4 featuresPhoenix version 12.1.0 introduces support for Claude Sonnet 4.5, adds GraphQL operations for dataset labels, and includes several bug fixes for component switching and playground loading.
arize-phoenix-client-v1.21.01 featureVersion 1.21.0 introduces new client methods for adding trace and session annotations, along with updated session API documentation.
arize-phoenix-v12.0.0Breaking8 fixes19 featuresVersion 12.0.0 introduces significant database schema updates, including support for session annotations, dataset splits, and experiment snapshots. This major release includes numerous backend improvements and updates to dependency handling.
arize-phoenix-evals-v2.2.02 featuresThis release introduces a new document relevance evaluator and a utility for formatting dataframe evaluations as annotations for logging to Phoenix.
arize-phoenix-client-v1.20.01 fix1 featureVersion 1.20.0 introduces support for tracking repetitions and fixes an issue related to Experiment tracing respecting OITracer configurations.
arize-phoenix-v11.38.02 fixes1 featureVersion 11.38.0 introduces the ability to configure repetitions in the playground and resolves several minor bugs related to link generation and playground error handling.
arize-phoenix-v11.37.04 fixes2 featuresVersion 11.37.0 introduces new UI features like a checkbox and custom HTTP headers for the playground, alongside several bug fixes related to Helm, pagination, and float formatting.
arize-phoenix-evals-v2.1.02 fixes1 featureVersion 2.1.0 introduces support for the Azure provider and includes bug fixes related to OpenAI SDK retries and tracer implementation.
arize-phoenix-v11.36.01 fix2 featuresThis release introduces improvements to experiment comparison visualization and updates the notification system to use react-aria toasts, alongside a minor fix in the TypeScript client snippet.
arize-phoenix-v11.35.06 fixes2 featuresVersion 11.35.0 introduces IPv6 support and image registry configuration for Helm charts, alongside several bug fixes primarily focused on SQLite and dependency updates.
arize-phoenix-client-v1.19.11 fixThis patch release primarily addresses a bug in the client related to handling printed URLs for proxied connections.
arize-phoenix-client-v1.19.01 featureVersion 1.19.0 introduces compatibility improvements for the Experiments<->Evals 2.0 feature set.
arize-phoenix-evals-v2.0.11 fixThis patch release primarily addresses a documentation issue by fixing incorrect import paths in the preview documentation.
arize-phoenix-evals-v2.0.0Breaking2 fixes3 featuresVersion 2.0.0 stabilizes Evals 2.0, introduces asynchronous evaluation capabilities, and adds rate limiting to LLM interactions. This release includes significant updates to the evals module.
arize-phoenix-v11.34.01 fix2 featuresVersion 11.34.0 introduces new features for experiment comparison and span queue management, alongside a fix to the insertion time histogram buckets.
arize-phoenix-v11.33.07 fixes7 featuresVersion 11.33.0 introduces several enhancements to experiment tracking, including new UI elements and paging, alongside prompt labeling features and updated token prices.
arize-phoenix-client-v1.18.21 fixVersion 1.18.2 primarily focuses on a bug fix within the experiments module to validate the 'repetitions' parameter.
arize-phoenix-client-v1.18.12 fixesVersion 1.18.1 primarily addresses bug fixes, including improved handling for multi-index dataframes in document annotations and correcting documentation errors.
arize-phoenix-otel-v0.13.1This patch release focuses primarily on documentation improvements, including adding docs links to readmes, fixing python client documentation, ensuring consistent docstrings, and improving OpenTelemetry documentation.
arize-phoenix-v11.32.11 fixVersion 11.32.1 is a patch release focused on restoring compatibility with older versions of Pydantic (<2.6).
arize-phoenix-v11.32.02 featuresThis release focuses on frontend improvements for the experiments list page and introduces pagination when fetching experiment runs inside run_experiment.
arize-phoenix-client-v1.18.01 featureVersion 1.18.0 introduces pagination support for fetching experiment runs inside the run_experiment function, improving handling of large result sets.
arize-phoenix-client-v1.17.11 fixVersion 1.17.1 primarily addresses a bug fix related to backward compatibility for the 'example' argument within the experiment task function.
Common Errors
AssertionError2 reportsAssertionErrors in Arize Phoenix Anthropic streaming usually stem from unhandled event types within the `ParsedMessage` structure, specifically `ParsedMessageStopEvent` or `ParsedContentBlockStopEvent`. To resolve this, ensure your code includes explicit handling for these stop events, likely by adding new cases to your event-processing logic that gracefully exits or modifies the data stream. This might involve updating a switch statement or adding new conditional branches to properly manage these specific stop conditions.
DatatypeMismatchError1 reportDatatypeMismatchError in Phoenix often arises when filtering or comparing columns with incompatible data types (e.g., comparing a string to a number). To fix this, ensure the data types are consistent before applying filters or comparisons, using explicit type conversions if necessary (e.g., converting a string representation of a number to an integer/float using `astype`). Validate your data schema to confirm the correct data types are assigned to each column.
NotImplementedError1 reportThe "NotImplementedError" in arize-phoenix usually means a requested function or method hasn't been defined for a specific class or integration, like a new model type. To fix it, either implement the missing method within the relevant class (e.g., for the Google ADK and Gemini Models, implement the Playground tool logic) or use a supported method if an alternative exists. Sometimes, upgrading to the newest version of the library will resolve this.
Related AI & LLMs Packages
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.
🦜🔗 The platform for reliable agents.
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
LLM inference in C/C++
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
Subscribe to Updates
Get notified when new versions are released