Improved performance and model support with GGUF
Which summary reads better? Pick one — models revealed after.Both summaries are AI-generated.
Ollama 0.30 introduces improved performance and expanded GGUF model compatibility, offering up to 20% faster performance on NVIDIA hardware and broader GPU acceleration support. The update enables seamless integration with GGUF files and enhances tool-calling capabilities for coding agents and assistants. Vulkan is now enabled by default, extending GPU acceleration to AMD and Intel devices without requiring vendor-specific libraries.
Ollama 0.30 has been released with improved performance and broader GGUF model compatibility through llama.cpp, complementing its MLX engine on Apple silicon. The update delivers up to 20% faster performance on NVIDIA hardware, enables Vulkan by default to extend GPU acceleration to AMD and Intel devices, and expands support for more model families and fine-tuned models. Models with tool-calling capabilities can also be used directly with coding agents and assistants via a single command.