Tater News - Lemonade

📦 Lemonade – v8.1.1

Hey AI Enthusiasts! 📢

Lemonade has a fresh update – version 8.1.1 is here! This release is packed with exciting improvements for running LLMs locally with maximum performance. Check it out: https://github.com/lemonade-sdk/lemonade/releases/tag/v8.1.1

Here’s a rundown of what’s new:

ROCm7 Support: Radeon GPU users rejoice! Now you can leverage ROCm7 as a llamacpp backend.
NPU & Hybrid LLM Support: Updates to model builds for custom NPUs and hybrid LLMs using RAI SW 1.5.
New GGUF Models: gpt-oss-120b-GGUF and gpt-oss-20b-GGUF are now supported in the Lemonade Server. 🍋
Context Length Adjustment: Fine-tune your LLM performance with the new `–ctx-length` option in `lemonade-server serve`.
Image Input: The LLM Chat web app now supports image input! 🖼️
Automated Website Publishing: Say hello to a smoother workflow with automated website publishing!
Hot New Models: qwen3-coder and cogito-v2-109B are now available as hot GGUF models. 🔥
Improved Model Documentation: The server\_models.md file has been updated, including information on NPU models.
Resolved Download Issues: Large model downloads from Hugging Face are back!

Happy experimenting! 🕹️🍻

August 6, 2025 (0)