r/LocalLLaMA · May 13, 2026 · 1 min read

sensenova/SenseNova-U1-A3B-MoT · Hugging Face

Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.

sensenova/SenseNova-U1-A3B-MoT · Hugging Face

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture

🚀 SenseNova U1 is a new series of native multimodal models that unifies multimodal understanding, reasoning, and generation within a monolithic architecture. It marks a fundamental paradigm shift in multimodal AI: from modality integration to true unification. Rather than relying on adapters to translate between modalities, SenseNova U1 models think-and-act across language and vision natively.

Unifying visual understanding and generation in an end-to-end architecture from pixel to word opens tremendous possibilities, enabling highly efficient and strong understanding, generation, and interleaved reasoning in a natively multimodal manner.

Model	Params	HF Weights
SenseNova-U1-8B-MoT-SFT	8B MoT	🤗 link
SenseNova-U1-8B-MoT	8B MoT	🤗 link
SenseNova-U1-8B-MoT-LoRA-8step-V1.0	0.4B	🤗 link
SenseNova-U1-A3B-MoT-SFT	A3B MoT	🤗 link
SenseNova-U1-A3B-MoT	A3B MoT	🤗 link

2 weeks ago, they released 8B model mentioned in above table.

submitted by /u/pmttyji
[link] [comments]

Discussion (0)

No comments yet. Sign in and be the first to say something.

Discussion (0)

More from r/LocalLLaMA