r/LocalLLaMA · · 1 min read

bytedance released an open source model that attempts to do just about anything with only 3b parameters

Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.

bytedance released an open source model that attempts to do just about anything with only 3b parameters

Lance is a lightweight native unified multimodal model that supports image and video understanding, generation, and editing within a single framework.

  • Efficient at 3B scale. With only 3B active parameters, Lance delivers strong performance across image generation, image editing, and video generation benchmarks.
  • Trained from scratch. Lance is built with a staged multi-task recipe and trained entirely from scratch within a 128-A100-GPU budget.
submitted by /u/uxl
[link] [comments]

Discussion (0)

Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.

Sign in →

No comments yet. Sign in and be the first to say something.

More from r/LocalLLaMA