4.5.0
📦 datasetsView on GitHub →
✨ 1 features🐛 3 fixes🔧 1 symbols
Summary
This release introduces native support for the Lance dataset format and includes several bug fixes, notably improving error handling for invalid revisions in `load_dataset`.
✨ New Features
- Added support for the Lance dataset format, including metadata/manifests and standalone .lance files (e.g., using streaming=True with datasets like "lance-format/fineweb-edu").
🐛 Bug Fixes
- Raised an early error for invalid `revision` argument passed to `load_dataset`.
- Fixed an IndexError occurring with low but large example indices.
- Fixed the method used to retrieve attributes from a file object.