b8248
📦 llama-cppView on GitHub →
✨ 1 features
Summary
This release introduces enhanced visibility into GPU memory usage by displaying total and free VRAM capacity during CUDA device initialization. It also provides numerous pre-compiled binaries for diverse platforms.
✨ New Features
- Display total and free VRAM capacity during CUDA device initialization.