b8978
📦 llama-cppView on GitHub →
🐛 1 fixes
Summary
This release includes a fix to discard the last drafted token if it has a low probability. It also provides numerous pre-compiled binaries for macOS, Linux, Android, Windows, and openEuler across various architectures and acceleration methods.
🐛 Bug Fixes
- Discard last drafted token with low probability.