Ollama – v0.11.5

πŸ“¦ Ollama – v0.11.5

πŸŽ‰ Hey AI Homebrew Enthusiasts! πŸŽ‰

Ollama version 0.11.5-rc2 is here with some fantastic improvements! Here’s what’s new:

  • Faster Models: Performance boost for gpt-oss models! πŸš€
  • GPU Power: Better multi-GPU support and reduced VRAM usage. πŸ’ͺ
  • Bug Fixes: Issues with Harmony tool calls & OpenAI API reasoning effort are now solved! βœ…
  • CPU Boost: `OLLAMA_FLASH_ATTENTION=1` now enables flash attention even on CPU! 🧠
  • Smaller Footprint: Installation size reduced for Windows and Linux users. πŸ’Ύ
  • Improved Memory Management: Thanks to @jessegross!

Plus, a big welcome to our new contributors: @vorburger, @dan-and, and @youzichuan! πŸ™Œ

You can find the full changelog here: https://github.com/jmorganca/ollama/compare/v0.11.4…v0.11.5-rc2

Happy Ollama-ing! πŸ¦™βœ¨

πŸ”— https://github.com/ollama/ollama/releases/tag/v0.11.5-rc2

August 15, 2025 (0)