r/LocalLLaMA · · 1 min read

How to use audio and vision modalities in llama.cpp?

Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.

How to use audio and vision modalities in llama.cpp with Gemma4 12B it?

I’m on release b9494, but when I run llama-cli it shows “modalities: text” only, and crashes if I try to add an image.

submitted by /u/No-Leave-4512
[link] [comments]

Discussion (0)

Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.

Sign in →

No comments yet. Sign in and be the first to say something.

More from r/LocalLLaMA