-
UI-TARS-desktop
The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra
-
SimpleMem
SimpleMem: Efficient Lifelong Memory for LLM Agents — Text & Multimodal
-
claude-video-vision
Give Claude the ability to watch and understand videos — Claude Code plugin with frame extraction and multimodal audio analysis