Hugging Face Daily Papers · May 18, 2026 · 4 min read

CM-EVS: Sparse Panoramic RGB-D-Pose Data for Complete Scene Coverage

Mirrored from Hugging Face Daily Papers for archival readability. Support the source by reading on the original site.

Like Read original ↗

We propose COVER (Coverage-Oriented Viewpoint curation with ERP Range-depth warping), a training-free ERP viewpoint curator that projects geometry observed from selected views into candidate ERP probes, scores incremental coverage, and penalizes depth conflicts. Under bounded proxy error, its greedy coverage proxy preserves the standard coverage-style approximation behavior up to an additive error term. Using COVER, we build CM-EVS (Coverage-curated Metric ERP View Set), a panoramic RGB-D-pose dataset with 36,373 curated ERP frames from 1,275 indoor scenes across Blender indoor, HM3D, and ScanNet++, complemented by outdoor panoramas from TartanGround and OB3D re-encoded into the same schema.</p>\n","updatedAt":"2026-05-18T10:38:37.294Z","author":{"_id":"64b76528fdb702b3d8641514","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/64b76528fdb702b3d8641514/Ho-uWcQCAEIURM1lhWEWJ.jpeg","fullname":"Jungang Li","name":"Jungang","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":4,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.8676348924636841},"editors":["Jungang"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/64b76528fdb702b3d8641514/Ho-uWcQCAEIURM1lhWEWJ.jpeg"],"reactions":[],"isReport":false}}],"primaryEmailConfirmed":false,"paper":{"id":"2605.15597","authors":[{"_id":"6a0aebc03049bece374a85f8","name":"Jiale Liu","hidden":false},{"_id":"6a0aebc03049bece374a85f9","name":"Jungang Li","hidden":false},{"_id":"6a0aebc03049bece374a85fa","name":"Jieming Yu","hidden":false},{"_id":"6a0aebc03049bece374a85fb","name":"Xinglin Yu","hidden":false},{"_id":"6a0aebc03049bece374a85fc","name":"Zihao Dongfang","hidden":false},{"_id":"6a0aebc03049bece374a85fd","name":"Zongjian Ding","hidden":false},{"_id":"6a0aebc03049bece374a85fe","name":"Kaifeng Ding","hidden":false},{"_id":"6a0aebc03049bece374a85ff","name":"Yi Yang","hidden":false},{"_id":"6a0aebc03049bece374a8600","name":"Lidong Chen","hidden":false},{"_id":"6a0aebc03049bece374a8601","name":"Yang Zou","hidden":false},{"_id":"6a0aebc03049bece374a8602","name":"Shunwen Bai","hidden":false},{"_id":"6a0aebc03049bece374a8603","name":"Jiahuan Zhang","hidden":false},{"_id":"6a0aebc03049bece374a8604","name":"Haoran Huang","hidden":false},{"_id":"6a0aebc03049bece374a8605","name":"Shan Huang","hidden":false},{"_id":"6a0aebc03049bece374a8606","name":"Yudong Gao","hidden":false},{"_id":"6a0aebc03049bece374a8607","name":"Mingjun Cheng","hidden":false}],"publishedAt":"2026-05-15T00:00:00.000Z","submittedOnDailyAt":"2026-05-18T00:00:00.000Z","title":"CM-EVS: Sparse Panoramic RGB-D-Pose Data for Complete Scene Coverage","submittedOnDailyBy":{"_id":"64b76528fdb702b3d8641514","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/64b76528fdb702b3d8641514/Ho-uWcQCAEIURM1lhWEWJ.jpeg","isPro":false,"fullname":"Jungang Li","user":"Jungang","type":"user","name":"Jungang"},"summary":"Modern 3D visual learning relies on observations sampled from metric 3D assets, yet existing scans, meshes, point clouds, simulations, and reconstructions do not directly provide a sparse, comparable, and geometry-consistent panoramic training interface. Dense trajectories duplicate nearby views, source-specific rendering policies yield heterogeneous annotations, and sparse heuristics may miss important regions or introduce depth-inconsistent observations. We study how to convert 3D assets into sparse panoramic RGB-D-pose data that preserves complete scene coverage with low redundancy and auditable provenance. We propose COVER (Coverage-Oriented Viewpoint curation with ERP Range-depth warping), a training-free ERP viewpoint curator that projects geometry observed from selected views into candidate ERP probes, scores incremental coverage, and penalizes depth conflicts. Under bounded proxy error, its greedy coverage proxy preserves the standard coverage-style approximation behavior up to an additive error term. Using COVER, we build CM-EVS (Coverage-curated Metric ERP View Set), a panoramic RGB-D-pose dataset with 36,373 curated ERP frames from 1,275 indoor scenes across Blender indoor, HM3D, and ScanNet++, complemented by outdoor panoramas from TartanGround and OB3D re-encoded into the same schema. Each frame provides full-sphere RGB, metric range depth, calibrated pose; COVER-produced indoor frames include per-step provenance logs. With a median of only 25 frames per indoor scene, CM-EVS covers all 13 unified room types while maintaining compact scene-level coverage. Experiments show that COVER improves the coverage-conflict trade-off, making CM-EVS a sparse, compact, and auditable RGB-D-pose resource for geometry-consistent panoramic 3D learning.","upvotes":8,"discussionId":"6a0aebc03049bece374a8608","githubRepo":"https://github.com/Strange-animalss/CM-EVS","githubRepoAddedBy":"user","githubStars":2},"canReadDatabase":false,"canManagePapers":false,"canSubmit":false,"hasHfLevelAccess":false,"upvoted":false,"upvoters":[{"_id":"64b76528fdb702b3d8641514","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/64b76528fdb702b3d8641514/Ho-uWcQCAEIURM1lhWEWJ.jpeg","isPro":false,"fullname":"Jungang Li","user":"Jungang","type":"user"},{"_id":"646279f22538819c729e9e96","avatarUrl":"/avatars/457905d7c84df52723ce3be163139dcb.svg","isPro":false,"fullname":"Arc","user":"ManfredC","type":"user"},{"_id":"6895f2997d5e8a6c34bc2155","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/no-auth/Q3a043oiVPA85dmwsr5Jb.png","isPro":false,"fullname":"Xinglin Yu","user":"rainbow180121920","type":"user"},{"_id":"68d36ca698cf2f221b944019","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/no-auth/nBayLsE3uxUf78Hv8Jgft.png","isPro":false,"fullname":"yudonggao","user":"yudong0504","type":"user"},{"_id":"65db5f578c1745678f0ed708","avatarUrl":"/avatars/4e2de6f5f3a936447b7e391cb14c5346.svg","isPro":false,"fullname":"DONGFANG ZIHAO","user":"UUUserna","type":"user"},{"_id":"654666bd8767484a051bc32a","avatarUrl":"/avatars/b5dad7e01eb62032817b7177c5cb50c0.svg","isPro":false,"fullname":"Irene Yu","user":"iry","type":"user"},{"_id":"6a0af5e4fb524f5f65cdf440","avatarUrl":"/avatars/90441d5544612938fd18d2cb4d26d112.svg","isPro":false,"fullname":"Jane.J","user":"jjzky","type":"user"},{"_id":"647ccbfed2da33779cbabad5","avatarUrl":"/avatars/c2d3c2157feaaf8b4dccda03f3dbc26b.svg","isPro":false,"fullname":"WANG","user":"slarkprime","type":"user"}],"acceptLanguages":["en"],"dailyPaperRank":0,"markdownContentUrl":"https://huggingface.co/buckets/huggingchat/papers-content/resolve/2605/2605.15597.md"}">

Papers

arxiv:2605.15597

CM-EVS: Sparse Panoramic RGB-D-Pose Data for Complete Scene Coverage

Published on May 15

· Submitted by

Jungang Li on May 18

Upvote

Authors:

Abstract

Modern 3D visual learning relies on observations sampled from metric 3D assets, yet existing scans, meshes, point clouds, simulations, and reconstructions do not directly provide a sparse, comparable, and geometry-consistent panoramic training interface. Dense trajectories duplicate nearby views, source-specific rendering policies yield heterogeneous annotations, and sparse heuristics may miss important regions or introduce depth-inconsistent observations. We study how to convert 3D assets into sparse panoramic RGB-D-pose data that preserves complete scene coverage with low redundancy and auditable provenance. We propose COVER (Coverage-Oriented Viewpoint curation with ERP Range-depth warping), a training-free ERP viewpoint curator that projects geometry observed from selected views into candidate ERP probes, scores incremental coverage, and penalizes depth conflicts. Under bounded proxy error, its greedy coverage proxy preserves the standard coverage-style approximation behavior up to an additive error term. Using COVER, we build CM-EVS (Coverage-curated Metric ERP View Set), a panoramic RGB-D-pose dataset with 36,373 curated ERP frames from 1,275 indoor scenes across Blender indoor, HM3D, and ScanNet++, complemented by outdoor panoramas from TartanGround and OB3D re-encoded into the same schema. Each frame provides full-sphere RGB, metric range depth, calibrated pose; COVER-produced indoor frames include per-step provenance logs. With a median of only 25 frames per indoor scene, CM-EVS covers all 13 unified room types while maintaining compact scene-level coverage. Experiments show that COVER improves the coverage-conflict trade-off, making CM-EVS a sparse, compact, and auditable RGB-D-pose resource for geometry-consistent panoramic 3D learning.

View arXiv page View PDF GitHub 2 Add to collection

Community

Jungang

Paper submitter about 15 hours ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

Upvote

Get this paper in your agent:

hf papers read 2605.15597

Don't have the latest CLI?

curl -LsSf https://hf.co/cli/install.sh | bash

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2605.15597 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2605.15597 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2605.15597 in a Space README.md to link it from this page.

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.

Discussion (0)

No comments yet. Sign in and be the first to say something.

CM-EVS: Sparse Panoramic RGB-D-Pose Data for Complete Scene Coverage

Abstract

Community

Models citing this paper 0

Datasets citing this paper 0

Spaces citing this paper 0

Collections including this paper 0

Discussion (0)

More from Hugging Face Daily Papers