llama.cpp releases · · 1 min read

b9128

Mirrored from llama.cpp releases for archival readability. Support the source by reading on the original site.

hexagon: eliminate scalar VTCM loads via HVX splat helpers (#22993)

  • hexagon: add hvx_vec_repl helpers and use those for splat-from-vtcm usecase

  • hmx-mm: optimize per-group scale handling

  • hmx-fa: optimize slope load from vtcm

  • hmx-fa: use aligned access where possible in hmx-utils

  • hexagon: add hvx_vec_repl_2x_f16 helper and consolidate repl helpers


Co-authored-by: Max Krasnyansky [email protected]

macOS/iOS:

Linux:

Android:

Windows:

openEuler:

Discussion (0)

Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.

Sign in →

No comments yet. Sign in and be the first to say something.

More from llama.cpp releases