r/LocalLLaMA · May 16, 2026 · 1 min read

LLM Phone Home: Reliable Apps that can deliver inference from local backend

Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.

Hello all,
I’m wondering what suggestions there are for an ios app that can serve an openai compatible endpoint. I am using 3sparks which works GREAT for that specific use, BUT, there is no mcp, no web search, etc. I want to show people that a local model with web search on your phone is very impressive, but I can’t find an app that can mimic OWUI/LMS/etc.

Texting Hermes works but I was hoping to find a solution that is not using a slow agent, just calling requests from local server.

So far, I tried:
Apollo, Locally AI, Noema, and 3 Sparks. Previously I have gone through other apps that run models in situ (in the iphone) but they don’t have remote endpoint usage. Noema seemed promising but Deepseek V4 Flash from my mac studio never makes it through a request (works great with 3 Sparks, but no web search or mcp capability).

submitted by /u/Miserable-Dare5090
[link] [comments]

Discussion (0)

No comments yet. Sign in and be the first to say something.

Discussion (0)

More from r/LocalLLaMA