llama.cpp releases · · 1 min read

b9453

Mirrored from llama.cpp releases for archival readability. Support the source by reading on the original site.

model: Add EXAONE 4.5 implementations (#21733)

  • Add EXAONE 4.5 and Add GQA for MMproj

  • mtmd: EXAONE 4.5 vision markers and projector path

EXAONE 4.5 uses and for image boundaries; Qwen keeps
<|vision_start|> and <|vision_end|>.

Route EXAONE 4.5 through the Qwen2.5-VL-style encode path (window attention
pattern, optional mmproj input norm). Update exaone4_5 projector weights and
convert_hf_to_gguf for mmproj export.

  • mtmd: load EXAONE4 nextn tensors correctly

Align EXAONE4 tensor registration with EXAONE_MOE for NextN/MTP slots and avoid skip-flag propagation on duplicated rope_freqs so model loading succeeds for EXAONE 4.5 GGUF.

  • Minor fixes

  • Address PR feedback

  • Address PR feedback

  • Fix EXAONE after merge

  • Fix EXAONE 4.5 conversion

  • Address PR feedback

  • Refactor EXAONE 4.5 conversion

  • Address PR feedback

  • Fix unintended deletion

  • Minor fix


Co-authored-by: LG-AI-EXAONE [email protected]

macOS/iOS:

Linux:

Android:

Windows:

openEuler:

  • DISABLED
  • openEuler x86 (310p)
  • openEuler x86 (910b, ACL Graph)
  • openEuler aarch64 (310p)
  • openEuler aarch64 (910b, ACL Graph)

UI:

Discussion (0)

Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.

Sign in →

No comments yet. Sign in and be the first to say something.

More from llama.cpp releases