Surflo: Consistent 3D Surface Flow Model with Global State
Mirrored from Hugging Face Daily Papers for archival readability. Support the source by reading on the original site.
Surflo: Consistent 3D Surface Flow Model with Global State
Abstract
Surflo compresses unposed RGB views into latent tokens and decodes 3D surface points through flow matching, enabling flexible resolution output and efficient processing compared to existing methods.
Geometry is invariant to viewpoint, which makes any collection of images a redundant encoding of a single 3D state. Existing feed-forward reconstruction models fail to exploit this: per-view methods emit overlapping, unaligned pointmaps that grow linearly with input count, while global-latent methods commit to a fixed, low-resolution output. We introduce Surflo, which compresses a variable number of unposed RGB views into K latent tokens-one global state-and decodes oriented 3D surface points by independently transporting them from noise onto the surface via flow matching. This frees the output from any fixed grid or token budget: the same latent yields from a few thousand to a million points in a single forward pass. To suppress the local inconsistencies inherent to independent per-point decoding, an inference-time guidance term correlates nearby points by injecting a photometric gradient during ODE integration. Surflo matches or surpasses feed-forward baselines on surface metrics, runs an order of magnitude faster than optimization-based methods that require hundreds of views, and is the only feed-forward approach to combine a global latent with arbitrary-resolution decoding.
Get this paper in your agent:
hf papers read 2606.13644 curl -LsSf https://hf.co/cli/install.sh | bash Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper
Collections including this paper 0
No Collection including this paper
More from Hugging Face Daily Papers
-
On the Limits of LLM Adaptability: Impact of Model-Internalized Priors on Annotation Task Performance
Jun 12
-
Rethinking Psychometric Evaluation of LLMs: When and Why Self-Reports Predict Behavior
Jun 12
-
See What I See, Know What I Think: Dense Latent Communication Across Heterogeneous Agents
Jun 12
-
Getting Better at Working With You: Compiling User Corrections into Runtime Enforcement for Coding Agents
Jun 12
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.