clark-labs/clark-air-sana-1.6b-1.58bit · Hugging Face
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
| A Sana 1.6B text-to-image transformer compressed to ternary (~1.85 bits/weight): 8.6× smaller than FP16, near-FP16 quality. Footprint (measured)
Measured ~1.85 bits/weight → 8.6× smaller (374 MB packed ÷ 3.21 GB FP16). AboutThe transformer weights are quantized to ternary with group-wise scales; a small high-precision tail (~5% of parameters, the conditioning and projection layers) is kept at higher precision.
LicenseApache-2.0 © Clark Labs, Inc. [link] [comments] |
More from r/LocalLLaMA
-
What's in your RAG?
Jul 2
-
Palantir CEO rages against closed models
Jul 2
-
A cheap trick for reliable structured output: feed the validation error back into the retry
Jul 2
-
SenseNova-U1-8b-MoT-Infographic-V2 (released yesterday) - An open source SOTA beast for infographic design and image editing.
Jul 2
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.