llama.cpp releases
456 articles archived · Visit source ↗ · RSS
-
llama.cpp releases dev-tools 1mo ago
b9123
ggml-webgpu: Enables running gpt-oss-20b ( #22906 ) Enable to run gpt-oss-20b and refactor mulmat-q disable test-backend-ops in ubuntu-24-webgpu macOS/iOS: macOS Apple Silicon (arm64) macOS Apple Silicon (arm64, KleidiAI enabled) macOS Intel (x64) iOS XCFramework Linux: Ubuntu…
13 -
llama.cpp releases dev-tools 1mo ago
b9122
ggml-webgpu: address precision issues for multimodal ( #22808 ) fix(mixed-types): use f32 for precision and update the shared memory calculation logic for f32 fix(unary): correct the gelu, gelu quick and gelu erf functions fix(flash-attn-tile): fix the hardcode v type…
9 -
llama.cpp releases dev-tools 1mo ago
b9119
vulkan: Fix Windows performance regression on Intel GPU BF16 workloads for Xe2 and newer ( #22461 ) refactor Use l_warptile only when coopamt is available for BF16 macOS/iOS: macOS Apple Silicon (arm64) macOS Apple Silicon (arm64, KleidiAI enabled) macOS Intel (x64) iOS…
23 -
llama.cpp releases dev-tools 1mo ago
b9118
vulkan: Check shared memory size for mmq shaders ( #22693 ) macOS/iOS: macOS Apple Silicon (arm64) macOS Apple Silicon (arm64, KleidiAI enabled) macOS Intel (x64) iOS XCFramework Linux: Ubuntu x64 (CPU) Ubuntu arm64 (CPU) Ubuntu s390x (CPU) Ubuntu x64 (Vulkan) Ubuntu arm64…
31 -
llama.cpp releases dev-tools 1mo ago
b9116
mtmd: add MiMo v2.5 vision ( #22883 ) mimo-v2.5: vision support mimo-v2.5: use fused qkv for vision mimi-v2.5: fix f16 vision overflow mimo-v2.5: comment cleanups mimo-v2.5: Flash doesn't have mmproj more cleanup remember to use filter_tensors mimo-v2.5: fix trailing whitespace…
25 -
llama.cpp releases dev-tools 1mo ago
b9115: convert : add split() to LoraTorchTensor in LoRA converter (#22832)
convert : add split() method to LoraTorchTensor Fix python type-check Fix flake8 Lint fix: handle positional dim arg in torch.split dispatch Fix type-check again Fix type-checks Remove unit test per reviewers feedback work around ty deficiency Co-authored-by: Sigbjørn Skjæret…
10