Studying FLUX in diffusers library was hard, so I built a smaller open-source version [P]
Mirrored from r/MachineLearning for archival readability. Support the source by reading on the original site.
| If you've tried to study modern diffusion models by digging through the official diffusers library, you know it can be overwhelming with its complexity and abstractions. I wanted to simplify FLUX diffusion models, so I built minFLUX: a PyTorch implementation focused on its core architecture and math. Here is the project: https://github.com/purohit10saurabh/minFLUX What’s inside: - Minimal FLUX.1 + FLUX.2 implementation with VAE and transformer model. - Line-by-line mappings to the source HuggingFace diffusers. - Training loop (VAE encode → flow matching → velocity MSE) - Inference loop (noise → Euler ODE → VAE decode) - Shared utilities (RoPE, timestep embeddings) The most interesting part for me was seeing that FLUX.2 is not just a scaled-up FLUX.1. It improves the transformer blocks, modulation, FFN, VAE normalization, position IDs, etc. The architecture overview of FLUX.2 is attached. Let me know if you find this interesting! 🙂 [link] [comments] |
More from r/MachineLearning
-
Loss functions in Instance Representation Learning [R]
Jun 29
-
Price elasticity model [R]
Jun 29
-
Rejected MICCAI paper: workshop -> journal/conference or directly journal/conference [R]
Jun 29
-
I built a demo agricultural planning system with an AI advisor for small-scale farmers in Nicaragua using NASA data [p]
Jun 29
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.