llama.cpp releases · May 15, 2026 · 1 min read

b9169

#model-release #music

Mirrored from llama.cpp releases for archival readability. Support the source by reading on the original site.

Like Read original ↗

mtmd: add chunks and fix preproc for qwen3a (#23073)

mtmd: add chunks and fix preproc for qwen3a
add attn_mask
limit mtmd_chunk size (avoid blow up memory)
correct audio tokens
re-order the set_input case
remove attn_mask

macOS/iOS:

Linux:

Android:

Android arm64 (CPU)

Windows:

openEuler:

Discussion (0)

Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.

No comments yet. Sign in and be the first to say something.

More from llama.cpp releases