Change8

b8978

📦 llama-cppView on GitHub →
🐛 1 fixes

Summary

This release includes a fix to discard the last drafted token if it has a low probability. It also provides numerous pre-compiled binaries for macOS, Linux, Android, Windows, and openEuler across various architectures and acceleration methods.

🐛 Bug Fixes

  • Discard last drafted token with low probability.