llama.cpp now supports model management (downloading etc) via API
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
#23976 got merged a couple hours ago, which means llama.cpp can now not only load/unload models on demand from a directory, but also download them on demand. No UI yet, but that's coming pretty soon.
This means you can now deploy llama.cpp, expose the API, and manage the complete lifecycle using it and nothing else.
[link] [comments]
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.