v3.9.0
📦 mlflowView on GitHub →
✨ 17 features🐛 3 fixes⚡ 1 deprecations🔧 6 symbols
Summary
MLflow 3.9.0 introduces significant enhancements to GenAI capabilities, including the MLflow Assistant chatbot, Trace Overview Dashboard, and a revamped AI Gateway integrated into the tracking server. This release also focuses heavily on evaluation tooling with new LLM judges, judge builders, and distributed tracing support.
Migration Steps
- If you were relying on Unity Catalog function integration within the AI Gateway, you will need to update your setup as this integration is deprecated.
✨ New Features
- Introduction of MLflow Assistant, an in-product chatbot backed by Claude Code for debugging ML apps and agents.
- Trace Overview Dashboard added to GenAI experiments providing pre-built statistics like latency, request count, and quality metrics.
- Revamped AI Gateway with the server now integrated directly into the tracking server, supporting passthrough endpoints, traffic splits, and fallback models.
- Online Monitoring with LLM Judges allows automatic scoring of traces without custom code, supporting pre-defined or custom judges.
- Judge Builder UI introduced for defining and iterating on custom LLM judge prompts directly within the UI.
- Support for Distributed Tracing, propagating trace context across different services and processes via APIs in `mlflow.tracing.distributed`.
- Introduction of `MemAlignOptimizer`, a new algorithm for judge optimization that learns from past feedback.
- Addition of `GePaAlignmentOptimizer` for judge instruction optimization.
- Addition of `Fluency` scorer for evaluating text quality.
- Addition of `KnowledgeRetention` built-in scorer.
- Integration of Phoenix (Arize) third-party scorer.
- Support for conversational datasets (with persona, goal, context) in evaluation.
- Introduction of conversational guidelines scorer.
- Support for Jinja2 template in prompt registry.
- Support for metaprompting in `mlflow.genai.optimize_prompts()`.
- Addition of `log_stream` API for logging binary streams as artifacts.
- Addition of `import_checkpoints` API for Databricks SGC Checkpointing with MLflow.
🐛 Bug Fixes
- Job backend updated to use static names instead of function full names.
- Support for shift+select for Traces in the UI.
- Auto-navigation to Events tab when clicking error spans in Traces.
Affected Symbols
⚡ Deprecations
- Unity Catalog function integration in AI Gateway is deprecated.