Hugging Face Daily Papers · · 4 min read

TriSplat: Simulation-Ready Feed-Forward 3D Scene Reconstruction

Mirrored from Hugging Face Daily Papers for archival readability. Support the source by reading on the original site.

<video src=\"https://cdn-uploads.huggingface.co/production/uploads/66699aa8a33847217b5a49c7/lf22F_foODHoht18D2OOJ.mp4\" controls=\"\" class=\"max-w-full!\"></video></p>","updatedAt":"2026-05-26T03:13:39.603Z","author":{"_id":"66699aa8a33847217b5a49c7","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/u8Z-6U8U7ARXOpdBDI7Qm.png","fullname":"Weijie Wang","name":"lhmd","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":6,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.5653097629547119},"editors":["lhmd"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/u8Z-6U8U7ARXOpdBDI7Qm.png"],"reactions":[],"isReport":false}}],"primaryEmailConfirmed":false,"paper":{"id":"2605.26115","authors":[{"_id":"6a150cf7b57a1823d5708ac9","user":{"_id":"66699aa8a33847217b5a49c7","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/u8Z-6U8U7ARXOpdBDI7Qm.png","isPro":false,"fullname":"Weijie Wang","user":"lhmd","type":"user","name":"lhmd"},"name":"Weijie Wang","status":"claimed_verified","statusLastChangedAt":"2026-05-26T07:09:16.866Z","hidden":false},{"_id":"6a150cf7b57a1823d5708aca","name":"Zimu Li","hidden":false},{"_id":"6a150cf7b57a1823d5708acb","name":"Jinchuan Shi","hidden":false},{"_id":"6a150cf7b57a1823d5708acc","name":"Zeyu Zhang","hidden":false},{"_id":"6a150cf7b57a1823d5708acd","name":"Botao Ye","hidden":false},{"_id":"6a150cf7b57a1823d5708ace","name":"Marc Pollefeys","hidden":false},{"_id":"6a150cf7b57a1823d5708acf","user":{"_id":"653862bdbe39573b3b247b44","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/653862bdbe39573b3b247b44/oMqOC4USbQlPX5HFLDXt6.jpeg","isPro":false,"fullname":"Donny Chen","user":"donydchen","type":"user","name":"donydchen"},"name":"Donny Y. Chen","status":"claimed_verified","statusLastChangedAt":"2026-05-26T07:09:14.903Z","hidden":false},{"_id":"6a150cf7b57a1823d5708ad0","name":"Bohan Zhuang","hidden":false}],"mediaUrls":["https://cdn-uploads.huggingface.co/production/uploads/66699aa8a33847217b5a49c7/LiZ6UYOmfsy_gQ1Hn1eS0.mp4"],"publishedAt":"2026-05-25T00:00:00.000Z","submittedOnDailyAt":"2026-05-26T00:00:00.000Z","title":"TriSplat: Simulation-Ready Feed-Forward 3D Scene Reconstruction","submittedOnDailyBy":{"_id":"66699aa8a33847217b5a49c7","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/u8Z-6U8U7ARXOpdBDI7Qm.png","isPro":false,"fullname":"Weijie Wang","user":"lhmd","type":"user","name":"lhmd"},"summary":"Sparse-view 3D reconstruction is increasingly addressed with feed-forward splatting networks that predict explicit primitives directly from images. Yet most existing methods remain centered on Gaussian primitives and expose surfaces only indirectly: extracting a usable mesh for downstream simulation, physics reasoning, or embodied interaction still requires expensive post-hoc steps that break the feed-forward promise. This limitation is especially pronounced in pose-free settings, where scene structure and camera parameters must be estimated jointly from sparse observations. We present TriSplat, a feed-forward reconstruction network that represents scenes with oriented triangle primitives and directly exports simulation-ready mesh scenes from a single forward pass. Given input images, the network predicts local 3D point maps, triangle attributes, camera poses, and optional intrinsics. Rather than regressing triangle orientation as an unconstrained latent variable, our approach constructs geometry normals from the predicted point maps, refines them with an image-conditioned normal head, and converts them into stable local frames for triangle parameterization. A mono-normal bootstrap schedule further stabilizes early training, while opacity and blur scheduling progressively sharpens the learned surface representation for direct mesh extraction. Experiments on RealEstate10K and DL3DV show that this representation produces more geometry-faithful reconstructions than Gaussian feed-forward baselines while maintaining competitive novel-view rendering quality. Because the rendering primitives are themselves surface triangles, the output can be directly ingested by physics engines, collision detectors, and standard rendering pipelines without any conversion, making it a practical simulation-ready solution for feed-forward 3D scene reconstruction.","upvotes":21,"discussionId":"6a150cf8b57a1823d5708ad1","projectPage":"https://lhmd.top/trisplat/#interactive","githubRepo":"https://github.com/ziplab/TriSplat","githubRepoAddedBy":"user","ai_summary":"TriSplat is a feed-forward 3D reconstruction network that uses oriented triangle primitives to directly generate simulation-ready meshes from single images, bypassing expensive post-processing steps.","ai_keywords":["splatting networks","Gaussian primitives","feed-forward reconstruction","triangle primitives","mesh scenes","3D point maps","triangle attributes","camera poses","normal head","mono-normal bootstrap","opacity scheduling","blur scheduling","physics engines","collision detectors"],"githubStars":6,"organization":{"_id":"61bac2af530e5c78d7b99667","name":"zju","fullname":"Zhejiang University","avatar":"https://cdn-avatars.huggingface.co/v1/production/uploads/5e1058e9fcf41d740b69966d/7G1xjlxwCdMEmKcxNR0n5.png"}},"canReadDatabase":false,"canManagePapers":false,"canSubmit":false,"hasHfLevelAccess":false,"upvoted":false,"upvoters":[{"_id":"66699aa8a33847217b5a49c7","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/u8Z-6U8U7ARXOpdBDI7Qm.png","isPro":false,"fullname":"Weijie Wang","user":"lhmd","type":"user"},{"_id":"6979d8678bf19c83d7eedfbc","avatarUrl":"/avatars/cbdb313ffd54ff8ccb38ba138bf634d2.svg","isPro":false,"fullname":"Auricchio Terri","user":"lhm-t","type":"user"},{"_id":"69f15425228008b58be54388","avatarUrl":"/avatars/6d0541578c890abb0cd1672d5649c01b.svg","isPro":false,"fullname":"anthodg","user":"anthodg","type":"user"},{"_id":"69f0292ef2053a6fa22da915","avatarUrl":"/avatars/c2b7b9130f5917c8cb59bb3ecb5cb89b.svg","isPro":false,"fullname":"Aukidelog","user":"3dlover-1","type":"user"},{"_id":"69f0288031b9968683c90b0f","avatarUrl":"/avatars/913935e378e880db56a608bbc5441e36.svg","isPro":false,"fullname":"Hakshi","user":"ffrecon","type":"user"},{"_id":"69f157560abcd9bdb8f30461","avatarUrl":"/avatars/f31703dbde4cded202a71a8cdcc5b86c.svg","isPro":false,"fullname":"Hayyat Zhang","user":"Hayyat2","type":"user"},{"_id":"69f154c2a50f64510fd18643","avatarUrl":"/avatars/e99f7ef9d993e0d5973a6f3f48ef7c7f.svg","isPro":false,"fullname":"kelao su","user":"kelao123321","type":"user"},{"_id":"6039478ab3ecf716b1a5fd4d","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/6039478ab3ecf716b1a5fd4d/_Thy4E7taiSYBLKxEKJbT.jpeg","isPro":true,"fullname":"taesiri","user":"taesiri","type":"user"},{"_id":"65388514613fe158bd514e4c","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/65388514613fe158bd514e4c/hpCY_g2oxLd1Ruq8LTA29.jpeg","isPro":false,"fullname":"alterego238","user":"alterego238","type":"user"},{"_id":"67fb6b6081692bf8e2bd49b1","avatarUrl":"/avatars/d2f28aa4e19c39cb39f5a301014e5739.svg","isPro":false,"fullname":"memory of fish","user":"fish456","type":"user"},{"_id":"696a1f07d93688663db6d872","avatarUrl":"/avatars/cab3e585c5ce21364c26ee84b2c65666.svg","isPro":false,"fullname":"yuzai","user":"henryyzhaoo","type":"user"},{"_id":"653862bdbe39573b3b247b44","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/653862bdbe39573b3b247b44/oMqOC4USbQlPX5HFLDXt6.jpeg","isPro":false,"fullname":"Donny Chen","user":"donydchen","type":"user"}],"acceptLanguages":["en"],"dailyPaperRank":0,"organization":{"_id":"61bac2af530e5c78d7b99667","name":"zju","fullname":"Zhejiang University","avatar":"https://cdn-avatars.huggingface.co/v1/production/uploads/5e1058e9fcf41d740b69966d/7G1xjlxwCdMEmKcxNR0n5.png"},"markdownContentUrl":"https://huggingface.co/buckets/huggingchat/papers-content/resolve/2605/2605.26115.md"}">
Papers
arxiv:2605.26115

TriSplat: Simulation-Ready Feed-Forward 3D Scene Reconstruction

Published on May 25
· Submitted by
Weijie Wang
on May 26
Authors:
,
,
,
,
,

Abstract

TriSplat is a feed-forward 3D reconstruction network that uses oriented triangle primitives to directly generate simulation-ready meshes from single images, bypassing expensive post-processing steps.

AI-generated summary

Sparse-view 3D reconstruction is increasingly addressed with feed-forward splatting networks that predict explicit primitives directly from images. Yet most existing methods remain centered on Gaussian primitives and expose surfaces only indirectly: extracting a usable mesh for downstream simulation, physics reasoning, or embodied interaction still requires expensive post-hoc steps that break the feed-forward promise. This limitation is especially pronounced in pose-free settings, where scene structure and camera parameters must be estimated jointly from sparse observations. We present TriSplat, a feed-forward reconstruction network that represents scenes with oriented triangle primitives and directly exports simulation-ready mesh scenes from a single forward pass. Given input images, the network predicts local 3D point maps, triangle attributes, camera poses, and optional intrinsics. Rather than regressing triangle orientation as an unconstrained latent variable, our approach constructs geometry normals from the predicted point maps, refines them with an image-conditioned normal head, and converts them into stable local frames for triangle parameterization. A mono-normal bootstrap schedule further stabilizes early training, while opacity and blur scheduling progressively sharpens the learned surface representation for direct mesh extraction. Experiments on RealEstate10K and DL3DV show that this representation produces more geometry-faithful reconstructions than Gaussian feed-forward baselines while maintaining competitive novel-view rendering quality. Because the rendering primitives are themselves surface triangles, the output can be directly ingested by physics engines, collision detectors, and standard rendering pipelines without any conversion, making it a practical simulation-ready solution for feed-forward 3D scene reconstruction.

Community

Paper author Paper submitter about 5 hours ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.
Tap or paste here to upload images

· Sign up or log in to comment

Get this paper in your agent:

hf papers read 2605.26115
Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash

Models citing this paper 1

Datasets citing this paper 1

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2605.26115 in a Space README.md to link it from this page.

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.

Discussion (0)

Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.

Sign in →

No comments yet. Sign in and be the first to say something.

More from Hugging Face Daily Papers