r/LocalLLaMA · · 1 min read

clark-labs/clark-air-sana-1.6b-1.58bit · Hugging Face

Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.

clark-labs/clark-air-sana-1.6b-1.58bit · Hugging Face

A Sana 1.6B text-to-image transformer compressed to ternary (~1.85 bits/weight): 8.6× smaller than FP16, near-FP16 quality.

Footprint (measured)

Artifact Size vs FP16 What it is
FP16 transformer 3.21 GB 1× (100%) reference
Clark Air (packed) 374 MB 8.6× (≈12%) packed ternary (clark-air-sana-1.6b-packed.safetensors)
Clark Air (unpacked) 3.21 GB compatibility this repo's transformer/, dequantized bf16, drop-in diffusers

Measured ~1.85 bits/weight → 8.6× smaller (374 MB packed ÷ 3.21 GB FP16).

About

The transformer weights are quantized to ternary with group-wise scales; a small high-precision tail (~5% of parameters, the conditioning and projection layers) is kept at higher precision.

  • Base: Sana 1.6B, 512px

License

Apache-2.0 © Clark Labs, Inc.

submitted by /u/pmttyji
[link] [comments]

Discussion (0)

Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.

Sign in →

No comments yet. Sign in and be the first to say something.

More from r/LocalLLaMA