Change8

v0.11.8

📦 ollama
2 features🔧 1 symbols

Summary

This release enables flash attention by default for gpt-oss models and improves their overall loading performance.

✨ New Features

  • Flash attention is now enabled by default for gpt-oss on supported systems.
  • Improved load times for gpt-oss models.

🔧 Affected Symbols

gpt-oss