Where are we with computer-control harnesses?
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
Seems like local vision language models models are getting smart enough so that it would be useful to hand them the cursor in a secure sandbox. What harnesses are available that can do this?
edit: oh my fucking God something about this post triggered all of the bots to come out and post their sloppy LinkedIn style bullshit. Fuck off.
[link] [comments]
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.