made a local voice AI for windows you can talk to in any language. open source, bring your own key
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
| been building this on and off for a while and finally got it to a point where i'm not embarrassed to share it, so here goes. it's called Shadow AI. basically a voice-first AI companion that runs on your own windows machine. you just talk to it and it talks back, no typing required. i mostly made it because i was tired of opening a chat window and pecking at a keyboard every time i wanted to ask something or get a quick task done. stuff it can do: - actual voice conversations, not the awkward push-to-talk kind. you just talk. - you can talk to it in literally any language. switch mid-sentence if you want, it keeps up. this honestly turned out way better than i expected. - web search built in. it runs its own local SearXNG instance so you're not piping your searches through some third party, and no docker setup or any of that nonsense, it just works after install. - it remembers stuff. it builds up a memory of things you've told it across sessions so you're not reintroducing yourself every time. - reminders / scheduled tasks. tell it to remind you about something later and it actually does. - optional google stuff if you want it. gmail, calendar, drive, docs, sheets, contacts. it's completely optional and the app works fine without ever touching it. there's a little in-app guide that walks you through connecting your own google account so nothing goes through me or any server. - it can learn little skills over time based on how you use it. a few honest notes: it's windows only. that's not a "for now" thing, it's just what i built it for and where i use it, so don't expect a mac/linux build. it's bring-your-own-key. you paste in your own free gemini api key during setup. costs me nothing and costs you nothing (the free tier is plenty for normal use), and your key stays local on your machine. it's open source under AGPL-3.0 and the whole thing runs locally on 127.0.0.1, nothing phones home. if you want to read the code or check what it's doing, it's all there. closest comparison i can give is "think OpenClaw but voice-only," just so you have a mental picture. for the record this is an independent project, not affiliated with or endorsed by OpenClaw in any way, "OpenClaw" is the trademark of its respective owner. i'm just using it as a reference point. it's a solo project so issues are welcome but i can't promise i'll get to all of them, and i'm not really taking pull requests, i'd rather keep it built the way i want it. just being upfront about that. happy to answer questions. would genuinely love to hear what people think. https://github.com/shadowdoggie/shadow-ai EDIT: I am sorry, i messed up big time, i realize that now. My post title says "local", and as of right now there is no local model available yet, and i am truly sorry for that, and you folks rightfully downvoted my comments into oblivion. I promise i will make it up to you folks, and i already put it on the roadmap on the github repo, and even if i violated any rules for posting and the post gets removed, i will still make sure the application WILL get local model options for both voice aswell as the subagents feature. Again my sincere apologies. [link] [comments] |
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.