[browser-use-wasm] I made a browser-use agent that runs in WASM at zero cost
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
| The only cost is electricity! I built this in a few weeks since I couldn't find anything else like it. Demo: https://pdufour.github.io/browser-use-wasm/ One thing I've wanted to do for a while was add a widget to my page that allowed me to control the complete webpage just like any of the browser-use agents can. The key distinction is I wanted it to be fully self-contained, no serve involved. After a few weeks of tinkering I have a fairly good browser-use model running entirely via Snapdom / WASM / WebGPU / Wllama / ShowUi-2b and a little JS to tie it all together. The browser use library I developed can handle all this:
Some lessons I learned making things others might find helpful:
This code is super super alpha and a lot of stuff is probably broken but I thought I would share with Reddit to ask for feedback and see if people had any ideas on how to develop this further. I'm open to any ideas! [link] [comments] |
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.