Accelerate
Data & ML🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
Release History
v1.12.02 fixes2 featuresThis release introduces major integration with Deepspeed Ulysses/ALST for sequence parallelism, enabling efficient long sequence training. It also includes several minor fixes and documentation updates.
v1.11.0Breaking15 fixes5 featuresThis release introduces support for TransformerEngine MXFP8 and enables FP16/BF16 training on MPS devices. It also drops support for Python 3.9 and brings numerous stability and feature updates across FSDP and nd-parallelism.
v1.10.11 fix1 featureVersion 1.10.1 introduces a new `to_json` utility and improves import safety for device mesh functionality.
v1.10.0Breaking6 fixes4 featuresThis release introduces comprehensive N-D Parallelism support via `ParallelismConfig` integrated with `Accelerator`, alongside significant FSDP improvements, particularly for MoE models.
v1.9.0Breaking6 fixes5 featuresThis release introduces native support for the trackio experiment tracking library and includes significant speedups for model loading, alongside various minor improvements and fixes for FSDP and DeepSpeed configurations.
v1.8.12 featuresThis minor release introduces support for the e5e2 model type and sets the default strategy to hybrid when using a launcher.
v1.8.0Breaking11 fixes11 featuresThis release introduces major refactoring for FSDPv2 setup, adding FP8 support, and significantly enhancing performance and stability for Intel CPU/XPU users. It also deprecates `ipex.optimize` and integrates SwanLab as a new experiment tracker.
v1.7.0Breaking8 fixes5 featuresThis release introduces significant performance improvements through regional compilation for torch.compile and adds layerwise casting hooks for memory optimization. It also brings substantial enhancements to FSDP2 support, including enabling `FULL_STATE_DICT` and fixing memory issues.
v1.6.010 fixes5 featuresThis release introduces major features including FSDPv2 support and initial DeepSpeed Tensor Parallelism support, alongside adding the XCCL distributed backend for XPU devices.
v1.5.22 fixesThis patch release (v1.5.2) focuses on resolving specific bugs related to device detection and production imports.
v1.5.04 fixes2 featuresThis release introduces HPU accelerator support and fixes several bugs related to device indexing, CLI argument precedence, and generator initialization.
v1.4.04 fixes2 featuresThis release introduces initial support for FP8 training via the `torchao` backend and adds initial Tensor Parallelism support for dataloaders, alongside several bug fixes including a critical memory leak resolution.
v1.3.0Breaking10 fixes2 featuresThis release enforces PyTorch 2.0 as the minimum required version and introduces improvements for handling compiled models, TPU execution, and various bug fixes across device support and offloading.