π¦ Ollama – v0.11.5
π Hey AI Homebrew Enthusiasts! π
Ollama version 0.11.5-rc2 is here with some fantastic improvements! Hereβs what’s new:
- Faster Models: Performance boost for gpt-oss models! π
- GPU Power: Better multi-GPU support and reduced VRAM usage. πͺ
- Bug Fixes: Issues with Harmony tool calls & OpenAI API reasoning effort are now solved! β
- CPU Boost: `OLLAMA_FLASH_ATTENTION=1` now enables flash attention even on CPU! π§
- Smaller Footprint: Installation size reduced for Windows and Linux users. πΎ
- Improved Memory Management: Thanks to @jessegross!
Plus, a big welcome to our new contributors: @vorburger, @dan-and, and @youzichuan! π
You can find the full changelog here: https://github.com/jmorganca/ollama/compare/v0.11.4…v0.11.5-rc2
Happy Ollama-ing! π¦β¨
π https://github.com/ollama/ollama/releases/tag/v0.11.5-rc2