Hugging Face Daily Papers · · 10 min read

The Road Ahead in Autonomous Driving: The KITScenes Multimodal Dataset

Mirrored from Hugging Face Daily Papers for archival readability. Support the source by reading on the original site.

<strong>We present KITScenes Multimodal, a European dataset built around a high-fidelity sensor suite together with the most complete HD maps for autonomous driving ever released.</strong></p>\n<h2 class=\"relative group flex items-baseline\">\n\t<a id=\"highlights\" class=\"block pr-1.5 text-lg md:absolute md:p-1.5 md:opacity-0 md:group-hover:opacity-100 md:right-full\" href=\"#highlights\" rel=\"nofollow\">\n\t\t<span class=\"header-link\"><svg class=\"text-gray-500 hover:text-black dark:hover:text-gray-200 w-4\" xmlns=\"http://www.w3.org/2000/svg\" xmlns:xlink=\"http://www.w3.org/1999/xlink\" aria-hidden=\"true\" role=\"img\" width=\"1em\" height=\"1em\" preserveAspectRatio=\"xMidYMid meet\" viewBox=\"0 0 256 256\"><path d=\"M167.594 88.393a8.001 8.001 0 0 1 0 11.314l-67.882 67.882a8 8 0 1 1-11.314-11.315l67.882-67.881a8.003 8.003 0 0 1 11.314 0zm-28.287 84.86l-28.284 28.284a40 40 0 0 1-56.567-56.567l28.284-28.284a8 8 0 0 0-11.315-11.315l-28.284 28.284a56 56 0 0 0 79.196 79.197l28.285-28.285a8 8 0 1 0-11.315-11.314zM212.852 43.14a56.002 56.002 0 0 0-79.196 0l-28.284 28.284a8 8 0 1 0 11.314 11.314l28.284-28.284a40 40 0 0 1 56.568 56.567l-28.285 28.285a8 8 0 0 0 11.315 11.314l28.284-28.284a56.065 56.065 0 0 0 0-79.196z\" fill=\"currentColor\"></path></svg></span>\n\t</a>\n\t<span>\n\t\tHighlights\n\t</span>\n</h2>\n<ul>\n<li><strong>European urban focus</strong> — recordings from Karlsruhe, Frankfurt, and Sindelfingen</li>\n<li><strong>High-fidelity sensor suite</strong> — 72.5 MPix global-shutter surround cameras, 7 LiDARs (~906k points/frame on average), 3 Continental ARS548 4D imaging radars, and redundant GNSS/INS</li>\n<li><strong>Long-range sensing</strong> — effective LiDAR range beyond 400 m with substantially higher return density than common public driving datasets</li>\n<li><strong>Production-grade HD maps</strong> — Lanelet2 maps with lane topology, regulatory elements, 29 road-feature classes, 120 traffic-sign classes (GTSIGN-220 taxonomy), and 3D-localized signs, traffic lights, and poles</li>\n<li><strong>Research benchmarks</strong> — relational HD map perception, long-range monocular depth estimation, novel view synthesis, and end-to-end / world-model research</li>\n</ul>\n<p><video src=\"https://cdn-uploads.huggingface.co/production/uploads/67d308b6bf75e7929a799d40/IWs7AX_7mxRdwe31NaXXb.mp4\" controls=\"\" class=\"max-w-full!\"></video></p>\n","updatedAt":"2026-06-05T19:36:34.148Z","author":{"_id":"67d308b6bf75e7929a799d40","avatarUrl":"/avatars/c3cfc3f5df71ee23dd0b41377c2eb457.svg","fullname":"Fabian Immel","name":"immel-f","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":3,"isUserFollowing":false}},"numEdits":1,"identifiedLanguage":{"language":"en","probability":0.7117511630058289},"editors":["immel-f"],"editorAvatarUrls":["/avatars/c3cfc3f5df71ee23dd0b41377c2eb457.svg"],"reactions":[],"isReport":false}},{"id":"6a2325c2c8f54b0666d94317","author":{"_id":"67d308b6bf75e7929a799d40","avatarUrl":"/avatars/c3cfc3f5df71ee23dd0b41377c2eb457.svg","fullname":"Fabian Immel","name":"immel-f","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":3,"isUserFollowing":false},"createdAt":"2026-06-05T19:38:42.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"**We present KITScenes Multimodal, a European dataset built around a high-fidelity sensor suite together with the most complete HD maps for autonomous driving ever released.**\n\n## Highlights\n\n- **European urban focus** — recordings from Karlsruhe, Frankfurt, and Sindelfingen\n- **High-fidelity sensor suite** — 72.5 MPix global-shutter surround cameras, 7 LiDARs (~906k points/frame on average), 3 Continental ARS548 4D imaging radars, and redundant GNSS/INS\n- **Long-range sensing** — effective LiDAR range beyond 400 m with substantially higher return density than common public driving datasets\n- **Production-grade HD maps** — Lanelet2 maps with lane topology, regulatory elements, 29 road-feature classes, 120 traffic-sign classes (GTSIGN-220 taxonomy), and 3D-localized signs, traffic lights, and poles\n- **Research benchmarks** — relational HD map perception, long-range monocular depth estimation, novel view synthesis, and end-to-end / world-model research","html":"<p><strong>We present KITScenes Multimodal, a European dataset built around a high-fidelity sensor suite together with the most complete HD maps for autonomous driving ever released.</strong></p>\n<h2 class=\"relative group flex items-baseline\">\n\t<a id=\"highlights\" class=\"block pr-1.5 text-lg md:absolute md:p-1.5 md:opacity-0 md:group-hover:opacity-100 md:right-full\" href=\"#highlights\" rel=\"nofollow\">\n\t\t<span class=\"header-link\"><svg class=\"text-gray-500 hover:text-black dark:hover:text-gray-200 w-4\" xmlns=\"http://www.w3.org/2000/svg\" xmlns:xlink=\"http://www.w3.org/1999/xlink\" aria-hidden=\"true\" role=\"img\" width=\"1em\" height=\"1em\" preserveAspectRatio=\"xMidYMid meet\" viewBox=\"0 0 256 256\"><path d=\"M167.594 88.393a8.001 8.001 0 0 1 0 11.314l-67.882 67.882a8 8 0 1 1-11.314-11.315l67.882-67.881a8.003 8.003 0 0 1 11.314 0zm-28.287 84.86l-28.284 28.284a40 40 0 0 1-56.567-56.567l28.284-28.284a8 8 0 0 0-11.315-11.315l-28.284 28.284a56 56 0 0 0 79.196 79.197l28.285-28.285a8 8 0 1 0-11.315-11.314zM212.852 43.14a56.002 56.002 0 0 0-79.196 0l-28.284 28.284a8 8 0 1 0 11.314 11.314l28.284-28.284a40 40 0 0 1 56.568 56.567l-28.285 28.285a8 8 0 0 0 11.315 11.314l28.284-28.284a56.065 56.065 0 0 0 0-79.196z\" fill=\"currentColor\"></path></svg></span>\n\t</a>\n\t<span>\n\t\tHighlights\n\t</span>\n</h2>\n<ul>\n<li><strong>European urban focus</strong> — recordings from Karlsruhe, Frankfurt, and Sindelfingen</li>\n<li><strong>High-fidelity sensor suite</strong> — 72.5 MPix global-shutter surround cameras, 7 LiDARs (~906k points/frame on average), 3 Continental ARS548 4D imaging radars, and redundant GNSS/INS</li>\n<li><strong>Long-range sensing</strong> — effective LiDAR range beyond 400 m with substantially higher return density than common public driving datasets</li>\n<li><strong>Production-grade HD maps</strong> — Lanelet2 maps with lane topology, regulatory elements, 29 road-feature classes, 120 traffic-sign classes (GTSIGN-220 taxonomy), and 3D-localized signs, traffic lights, and poles</li>\n<li><strong>Research benchmarks</strong> — relational HD map perception, long-range monocular depth estimation, novel view synthesis, and end-to-end / world-model research</li>\n</ul>\n","updatedAt":"2026-06-05T19:38:42.419Z","author":{"_id":"67d308b6bf75e7929a799d40","avatarUrl":"/avatars/c3cfc3f5df71ee23dd0b41377c2eb457.svg","fullname":"Fabian Immel","name":"immel-f","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":3,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.7375171184539795},"editors":["immel-f"],"editorAvatarUrls":["/avatars/c3cfc3f5df71ee23dd0b41377c2eb457.svg"],"reactions":[],"isReport":false}},{"id":"6a237c0d3da15f2b696638da","author":{"_id":"63d3e0e8ff1384ce6c5dd17d","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1674830754237-63d3e0e8ff1384ce6c5dd17d.jpeg","fullname":"Librarian Bot (Bot)","name":"librarian-bot","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":362,"isUserFollowing":false},"createdAt":"2026-06-06T01:46:53.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"This is an automated message from the [Librarian Bot](https://huggingface.co/librarian-bots). I found the following papers similar to this paper. \n\nThe following papers were recommended by the Semantic Scholar API \n\n* [CARD: A Multi-Modal Automotive Dataset for Dense 3D Reconstruction in Challenging Road Topography](https://huggingface.co/papers/2605.05014) (2026)\n* [4DLidarOpen: An Open 4D FMCW Lidar Dataset for Motion-Aware Autonomous Driving](https://huggingface.co/papers/2605.18074) (2026)\n* [123D: Unifying Multi-Modal Autonomous Driving Data at Scale](https://huggingface.co/papers/2605.08084) (2026)\n* [MapATM: Enhancing HD Map Construction through Actor Trajectory Modeling](https://huggingface.co/papers/2604.11081) (2026)\n* [Sensor2Sensor: Cross-Embodiment Sensor Conversion for Autonomous Driving](https://huggingface.co/papers/2605.22809) (2026)\n* [nuReasoning: A Reasoning-Centric Dataset and Benchmark for Long-Tail Autonomous Driving](https://huggingface.co/papers/2605.31572) (2026)\n* [EagleVision: A Multi-Task Benchmark for Cross-Domain Perception in High-Speed Autonomous Racing](https://huggingface.co/papers/2604.11400) (2026)\n\n\n Please give a thumbs up to this comment if you found it helpful!\n\n If you want recommendations for any Paper on Hugging Face checkout [this](https://huggingface.co/spaces/librarian-bots/recommend_similar_papers) Space\n\n You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: `@librarian-bot recommend`","html":"<p>This is an automated message from the <a href=\"https://huggingface.co/librarian-bots\">Librarian Bot</a>. I found the following papers similar to this paper. </p>\n<p>The following papers were recommended by the Semantic Scholar API </p>\n<ul>\n<li><a href=\"https://huggingface.co/papers/2605.05014\">CARD: A Multi-Modal Automotive Dataset for Dense 3D Reconstruction in Challenging Road Topography</a> (2026)</li>\n<li><a href=\"https://huggingface.co/papers/2605.18074\">4DLidarOpen: An Open 4D FMCW Lidar Dataset for Motion-Aware Autonomous Driving</a> (2026)</li>\n<li><a href=\"https://huggingface.co/papers/2605.08084\">123D: Unifying Multi-Modal Autonomous Driving Data at Scale</a> (2026)</li>\n<li><a href=\"https://huggingface.co/papers/2604.11081\">MapATM: Enhancing HD Map Construction through Actor Trajectory Modeling</a> (2026)</li>\n<li><a href=\"https://huggingface.co/papers/2605.22809\">Sensor2Sensor: Cross-Embodiment Sensor Conversion for Autonomous Driving</a> (2026)</li>\n<li><a href=\"https://huggingface.co/papers/2605.31572\">nuReasoning: A Reasoning-Centric Dataset and Benchmark for Long-Tail Autonomous Driving</a> (2026)</li>\n<li><a href=\"https://huggingface.co/papers/2604.11400\">EagleVision: A Multi-Task Benchmark for Cross-Domain Perception in High-Speed Autonomous Racing</a> (2026)</li>\n</ul>\n<p> Please give a thumbs up to this comment if you found it helpful!</p>\n<p> If you want recommendations for any Paper on Hugging Face checkout <a href=\"https://huggingface.co/spaces/librarian-bots/recommend_similar_papers\">this</a> Space</p>\n<p> You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: <code><span class=\"SVELTE_PARTIAL_HYDRATER contents\" data-target=\"UserMention\" data-props=\"{&quot;user&quot;:&quot;librarian-bot&quot;}\"><span class=\"inline-block\"><span class=\"contents\"><a href=\"/librarian-bot\">@<span class=\"underline\">librarian-bot</span></a></span> </span></span> recommend</code></p>\n","updatedAt":"2026-06-06T01:46:53.857Z","author":{"_id":"63d3e0e8ff1384ce6c5dd17d","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1674830754237-63d3e0e8ff1384ce6c5dd17d.jpeg","fullname":"Librarian Bot (Bot)","name":"librarian-bot","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":362,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.7075566649436951},"editors":["librarian-bot"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/1674830754237-63d3e0e8ff1384ce6c5dd17d.jpeg"],"reactions":[],"isReport":false}}],"primaryEmailConfirmed":false,"paper":{"id":"2606.02956","authors":[{"_id":"6a20279c15100c5272a83fef","user":{"_id":"63722445e8891da06b2a2e6e","avatarUrl":"/avatars/8e1fddd6c012e58867332b972d6d783c.svg","isPro":false,"fullname":"Richard Schwarzkopf","user":"R-Fehler","type":"user","name":"R-Fehler"},"name":"Richard Schwarzkopf","status":"claimed_verified","statusLastChangedAt":"2026-06-05T15:09:57.129Z","hidden":false},{"_id":"6a20279c15100c5272a83ff0","user":{"_id":"67d308b6bf75e7929a799d40","avatarUrl":"/avatars/c3cfc3f5df71ee23dd0b41377c2eb457.svg","isPro":false,"fullname":"Fabian Immel","user":"immel-f","type":"user","name":"immel-f"},"name":"Fabian Immel","status":"claimed_verified","statusLastChangedAt":"2026-06-05T15:09:59.627Z","hidden":false},{"_id":"6a20279c15100c5272a83ff1","user":{"_id":"682e02a18d23217f0a697a08","avatarUrl":"/avatars/f23532f5340cafff4ce575a0cb55d780.svg","isPro":false,"fullname":"Alexander Blumberg","user":"ABlumberg","type":"user","name":"ABlumberg"},"name":"Alexander Blumberg","status":"claimed_verified","statusLastChangedAt":"2026-06-05T15:09:55.245Z","hidden":false},{"_id":"6a20279c15100c5272a83ff2","name":"Jonas Merkert","hidden":false},{"_id":"6a20279c15100c5272a83ff3","name":"Nils Rack","hidden":false},{"_id":"6a20279c15100c5272a83ff4","user":{"_id":"6797fb9f7162b6652e64b7e5","avatarUrl":"/avatars/5966e697354be18dc43240c9de9be05e.svg","isPro":false,"fullname":"Kaiwen Wang","user":"kaiwen-wang","type":"user","name":"kaiwen-wang"},"name":"Kaiwen Wang","status":"claimed_verified","statusLastChangedAt":"2026-06-05T15:09:53.223Z","hidden":false},{"_id":"6a20279c15100c5272a83ff5","user":{"_id":"69ef930e71beb7293d016cfc","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/69ef930e71beb7293d016cfc/Tj1bRtydXyBQrXoAnc9z5.jpeg","isPro":false,"fullname":"Fabian Konstantinidis","user":"fabikonsti","type":"user","name":"fabikonsti"},"name":"Fabian Konstantinidis","status":"claimed_verified","statusLastChangedAt":"2026-06-05T15:09:50.919Z","hidden":false},{"_id":"6a20279c15100c5272a83ff6","user":{"_id":"689cbd627694d1d7a741e7d2","avatarUrl":"/avatars/a8d69f5a7ce07c574d209e52dc132ce6.svg","isPro":false,"fullname":"Julian Truetsch","user":"jtruetsch","type":"user","name":"jtruetsch"},"name":"Julian Truetsch","status":"claimed_verified","statusLastChangedAt":"2026-06-05T15:09:48.651Z","hidden":false},{"_id":"6a20279c15100c5272a83ff7","user":{"_id":"67efd1bdd652a85830fba699","avatarUrl":"/avatars/f23048d88a822df3d47fbe16c4045123.svg","isPro":false,"fullname":"Fernandez","user":"c-fernandez","type":"user","name":"c-fernandez"},"name":"Carlos Fernandez","status":"claimed_verified","statusLastChangedAt":"2026-06-05T15:09:46.547Z","hidden":false},{"_id":"6a20279c15100c5272a83ff8","user":{"_id":"67ea8a7a8d223bab85df863c","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/no-auth/QYVjn5LrqoeSyOujMBuF7.png","isPro":false,"fullname":"Annika Baetz","user":"AnnikaJulia","type":"user","name":"AnnikaJulia"},"name":"Annika Bätz","status":"claimed_verified","statusLastChangedAt":"2026-06-05T15:09:44.766Z","hidden":false},{"_id":"6a20279c15100c5272a83ff9","user":{"_id":"635c21e5dc371b8f9105d985","avatarUrl":"/avatars/65b2117319ad3e55ba558cb247b15380.svg","isPro":false,"fullname":"Kevin Rösch","user":"keroe","type":"user","name":"keroe"},"name":"Kevin Rösch","status":"claimed_verified","statusLastChangedAt":"2026-06-05T15:09:43.016Z","hidden":false},{"_id":"6a20279c15100c5272a83ffa","user":{"_id":"69c65436faed05be5d8b0ae6","avatarUrl":"/avatars/95c79f24fc234cb10ad149f816f57e30.svg","isPro":false,"fullname":"Marlon Steiner","user":"marlon-steiner","type":"user","name":"marlon-steiner"},"name":"Marlon Steiner","status":"claimed_verified","statusLastChangedAt":"2026-06-05T15:09:41.170Z","hidden":false},{"_id":"6a20279c15100c5272a83ffb","user":{"_id":"69f1a348f0694aff7501d07a","avatarUrl":"/avatars/fdb23e9ac5b1bdf2076518e571c1264d.svg","isPro":false,"fullname":"Willi Poh","user":"w-poh","type":"user","name":"w-poh"},"name":"Willi Poh","status":"claimed_verified","statusLastChangedAt":"2026-06-05T15:09:39.182Z","hidden":false},{"_id":"6a20279c15100c5272a83ffc","user":{"_id":"67f381fa0dad3893b2e384d8","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/no-auth/I19MvdskpjmOA2kfJlGdc.png","isPro":false,"fullname":"Yinzhe Shen","user":"shenyinzhe","type":"user","name":"shenyinzhe"},"name":"Yinzhe Shen","status":"claimed_verified","statusLastChangedAt":"2026-06-05T15:09:36.752Z","hidden":false},{"_id":"6a20279c15100c5272a83ffd","user":{"_id":"665b56460e92f92b0e78e36f","avatarUrl":"/avatars/4d9b5178a032b72e7a63ac77c45094c5.svg","isPro":true,"fullname":"Royden Wagner","user":"roydenwa","type":"user","name":"roydenwa"},"name":"Royden Wagner","status":"claimed_verified","statusLastChangedAt":"2026-06-05T15:09:34.805Z","hidden":false},{"_id":"6a20279c15100c5272a83ffe","user":{"_id":"676d385f75e350e464ad5a38","avatarUrl":"/avatars/ead5fae98b33a5f20968cbe98c5d168e.svg","isPro":false,"fullname":"Felix Hauser","user":"FelixHauser","type":"user","name":"FelixHauser"},"name":"Felix Hauser","status":"claimed_verified","statusLastChangedAt":"2026-06-05T15:09:32.889Z","hidden":false},{"_id":"6a20279c15100c5272a83fff","user":{"_id":"6458a16f6fa580137af493c6","avatarUrl":"/avatars/7bf9fc3f6ec9cfcb8109bd72dabef39d.svg","isPro":false,"fullname":"Dominik Strutz","user":"dstrutz","type":"user","name":"dstrutz"},"name":"Dominik Strutz","status":"claimed_verified","statusLastChangedAt":"2026-06-05T15:09:30.890Z","hidden":false},{"_id":"6a20279c15100c5272a84000","user":{"_id":"635eb4a6d1c91c2ee1eb0718","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/635eb4a6d1c91c2ee1eb0718/NJlebbRQ2AJZyjTi1UX4H.jpeg","isPro":true,"fullname":"Jaime Villa","user":"jvpc0d3r","type":"user","name":"jvpc0d3r"},"name":"Jaime Villa","status":"claimed_verified","statusLastChangedAt":"2026-06-05T15:09:29.113Z","hidden":false},{"_id":"6a20279c15100c5272a84001","user":{"_id":"643ffa1f21ec7c165d837a7e","avatarUrl":"/avatars/743124a8143b215fe25fc791837cb2a9.svg","isPro":false,"fullname":"Gleb Stepanov","user":"glstep","type":"user","name":"glstep"},"name":"Gleb Stepanov","status":"claimed_verified","statusLastChangedAt":"2026-06-05T15:09:27.319Z","hidden":false},{"_id":"6a20279c15100c5272a84002","user":{"_id":"69ca7b13a347c653e8152436","avatarUrl":"/avatars/26a3a1e67a6c2c0013efdbf8e181884b.svg","isPro":false,"fullname":"Holger Caesar","user":"nightrome","type":"user","name":"nightrome"},"name":"Holger Caesar","status":"claimed_verified","statusLastChangedAt":"2026-06-05T15:09:25.363Z","hidden":false},{"_id":"6a20279c15100c5272a84003","user":{"_id":"64e7a39f7acd8971f2ca1e28","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/64e7a39f7acd8971f2ca1e28/08E7EhG6pdmy7aHHZJThD.jpeg","isPro":false,"fullname":"Omer Sahin Tas","user":"omersahintas","type":"user","name":"omersahintas"},"name":"Ömer Şahin Taş","status":"claimed_verified","statusLastChangedAt":"2026-06-05T15:09:23.491Z","hidden":false},{"_id":"6a20279c15100c5272a84004","user":{"_id":"69fa35d478bb6609cd385087","avatarUrl":"/avatars/b94311400addfe101e0a303a2c9a4f69.svg","isPro":false,"fullname":"Frank Bieder","user":"FrankBieder","type":"user","name":"FrankBieder"},"name":"Frank Bieder","status":"claimed_verified","statusLastChangedAt":"2026-06-05T15:09:21.692Z","hidden":false},{"_id":"6a20279c15100c5272a84005","user":{"_id":"68ecfef31aca3c633e13dcfa","avatarUrl":"/avatars/a0fdca5a33397e32427d97d823df1a13.svg","isPro":false,"fullname":"Jan-Hendrik Pauls","user":"jhpauls","type":"user","name":"jhpauls"},"name":"Jan-Hendrik Pauls","status":"claimed_verified","statusLastChangedAt":"2026-06-05T15:09:19.753Z","hidden":false},{"_id":"6a20279c15100c5272a84006","name":"Christoph Stiller","hidden":false}],"mediaUrls":["https://cdn-uploads.huggingface.co/production/uploads/67d308b6bf75e7929a799d40/QzXmkvZzniWOupQMKfT_7.mp4"],"publishedAt":"2026-06-01T00:00:00.000Z","submittedOnDailyAt":"2026-06-05T00:00:00.000Z","title":"The Road Ahead in Autonomous Driving: The KITScenes Multimodal Dataset","submittedOnDailyBy":{"_id":"67d308b6bf75e7929a799d40","avatarUrl":"/avatars/c3cfc3f5df71ee23dd0b41377c2eb457.svg","isPro":false,"fullname":"Fabian Immel","user":"immel-f","type":"user","name":"immel-f"},"summary":"Existing autonomous driving datasets have enabled major progress, but fall short in sensor fidelity, map completeness, or geographic diversity. We present KITScenes Multimodal, a European dataset built around high-fidelity sensors and maps. Our fully synchronized sensor suite combines high-resolution global-shutter cameras, long-range lidar beyond 400m, 4D imaging radar, and redundant GNSS/INS localization. Our HD maps are, to our knowledge, the most complete of any sensor dataset, validated through autonomous driving trials on open-source software. For the first time in a public dataset, all driving-relevant traffic elements, such as traffic lights, are mapped in 3D to a reprojection-accurate level with full topological connectivity. Recorded in cities with irregular street layouts and mixed traffic modes, our dataset complements existing datasets by broadening the available geographic diversity. We also introduce four benchmarks, each advancing spatial learning for embodied AI: online HD map construction, long-range depth estimation, novel view synthesis, and end-to-end driving. Project page: https://kitscenes.com/","upvotes":12,"discussionId":"6a20279c15100c5272a84007","projectPage":"https://kitscenes.com","ai_summary":"KITScenes Multimodal dataset provides high-fidelity European driving data with comprehensive 3D maps and diverse urban environments for embodied AI research.","ai_keywords":["autonomous driving datasets","high-fidelity sensors","HD maps","4D imaging radar","GNSS/INS localization","embodied AI","spatial learning","online HD map construction","long-range depth estimation","novel view synthesis","end-to-end driving"],"ai_summary_model":"Qwen/Qwen2.5-Coder-32B-Instruct","organization":{"_id":"643e96d7e1b2a57ff0d5393b","name":"KIT-Karlsruhe","fullname":"Karlsruhe Institute of Technology","avatar":"https://cdn-avatars.huggingface.co/v1/production/uploads/643e93f1e1b2a57ff0d52680/1QuQ7oevBAcFK-Q10tI22.png"}},"canReadDatabase":false,"canManagePapers":false,"canSubmit":false,"hasHfLevelAccess":false,"upvoted":false,"upvoters":[{"_id":"67d308b6bf75e7929a799d40","avatarUrl":"/avatars/c3cfc3f5df71ee23dd0b41377c2eb457.svg","isPro":false,"fullname":"Fabian Immel","user":"immel-f","type":"user"},{"_id":"643ffa1f21ec7c165d837a7e","avatarUrl":"/avatars/743124a8143b215fe25fc791837cb2a9.svg","isPro":false,"fullname":"Gleb Stepanov","user":"glstep","type":"user"},{"_id":"64e7a39f7acd8971f2ca1e28","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/64e7a39f7acd8971f2ca1e28/08E7EhG6pdmy7aHHZJThD.jpeg","isPro":false,"fullname":"Omer Sahin Tas","user":"omersahintas","type":"user"},{"_id":"63722445e8891da06b2a2e6e","avatarUrl":"/avatars/8e1fddd6c012e58867332b972d6d783c.svg","isPro":false,"fullname":"Richard Schwarzkopf","user":"R-Fehler","type":"user"},{"_id":"635eb4a6d1c91c2ee1eb0718","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/635eb4a6d1c91c2ee1eb0718/NJlebbRQ2AJZyjTi1UX4H.jpeg","isPro":true,"fullname":"Jaime Villa","user":"jvpc0d3r","type":"user"},{"_id":"67efd1bdd652a85830fba699","avatarUrl":"/avatars/f23048d88a822df3d47fbe16c4045123.svg","isPro":false,"fullname":"Fernandez","user":"c-fernandez","type":"user"},{"_id":"69402d778cdc5f83e2974f68","avatarUrl":"/avatars/025abd0d4dc256f90ff1cbf6f810bd8c.svg","isPro":false,"fullname":"Dominik Strutz","user":"d-stru","type":"user"},{"_id":"69fa35d478bb6609cd385087","avatarUrl":"/avatars/b94311400addfe101e0a303a2c9a4f69.svg","isPro":false,"fullname":"Frank Bieder","user":"FrankBieder","type":"user"},{"_id":"68ecfef31aca3c633e13dcfa","avatarUrl":"/avatars/a0fdca5a33397e32427d97d823df1a13.svg","isPro":false,"fullname":"Jan-Hendrik Pauls","user":"jhpauls","type":"user"},{"_id":"67f381fa0dad3893b2e384d8","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/no-auth/I19MvdskpjmOA2kfJlGdc.png","isPro":false,"fullname":"Yinzhe Shen","user":"shenyinzhe","type":"user"},{"_id":"66b395d0b33318f73abc308f","avatarUrl":"/avatars/08f31eebb832ac5d3e06a24a9d432d5e.svg","isPro":false,"fullname":"fh","user":"fh12345","type":"user"},{"_id":"69ef930e71beb7293d016cfc","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/69ef930e71beb7293d016cfc/Tj1bRtydXyBQrXoAnc9z5.jpeg","isPro":false,"fullname":"Fabian Konstantinidis","user":"fabikonsti","type":"user"}],"acceptLanguages":["en"],"dailyPaperRank":0,"organization":{"_id":"643e96d7e1b2a57ff0d5393b","name":"KIT-Karlsruhe","fullname":"Karlsruhe Institute of Technology","avatar":"https://cdn-avatars.huggingface.co/v1/production/uploads/643e93f1e1b2a57ff0d52680/1QuQ7oevBAcFK-Q10tI22.png"},"markdownContentUrl":"https://huggingface.co/buckets/huggingchat/papers-content/resolve/2606/2606.02956.md"}">

Abstract

KITScenes Multimodal dataset provides high-fidelity European driving data with comprehensive 3D maps and diverse urban environments for embodied AI research.

Existing autonomous driving datasets have enabled major progress, but fall short in sensor fidelity, map completeness, or geographic diversity. We present KITScenes Multimodal, a European dataset built around high-fidelity sensors and maps. Our fully synchronized sensor suite combines high-resolution global-shutter cameras, long-range lidar beyond 400m, 4D imaging radar, and redundant GNSS/INS localization. Our HD maps are, to our knowledge, the most complete of any sensor dataset, validated through autonomous driving trials on open-source software. For the first time in a public dataset, all driving-relevant traffic elements, such as traffic lights, are mapped in 3D to a reprojection-accurate level with full topological connectivity. Recorded in cities with irregular street layouts and mixed traffic modes, our dataset complements existing datasets by broadening the available geographic diversity. We also introduce four benchmarks, each advancing spatial learning for embodied AI: online HD map construction, long-range depth estimation, novel view synthesis, and end-to-end driving. Project page: https://kitscenes.com/

Community

Paper author Paper submitter about 6 hours ago
edited about 6 hours ago

We present KITScenes Multimodal, a European dataset built around a high-fidelity sensor suite together with the most complete HD maps for autonomous driving ever released.

Highlights

  • European urban focus — recordings from Karlsruhe, Frankfurt, and Sindelfingen
  • High-fidelity sensor suite — 72.5 MPix global-shutter surround cameras, 7 LiDARs (~906k points/frame on average), 3 Continental ARS548 4D imaging radars, and redundant GNSS/INS
  • Long-range sensing — effective LiDAR range beyond 400 m with substantially higher return density than common public driving datasets
  • Production-grade HD maps — Lanelet2 maps with lane topology, regulatory elements, 29 road-feature classes, 120 traffic-sign classes (GTSIGN-220 taxonomy), and 3D-localized signs, traffic lights, and poles
  • Research benchmarks — relational HD map perception, long-range monocular depth estimation, novel view synthesis, and end-to-end / world-model research

Paper author Paper submitter about 6 hours ago

We present KITScenes Multimodal, a European dataset built around a high-fidelity sensor suite together with the most complete HD maps for autonomous driving ever released.

Highlights

  • European urban focus — recordings from Karlsruhe, Frankfurt, and Sindelfingen
  • High-fidelity sensor suite — 72.5 MPix global-shutter surround cameras, 7 LiDARs (~906k points/frame on average), 3 Continental ARS548 4D imaging radars, and redundant GNSS/INS
  • Long-range sensing — effective LiDAR range beyond 400 m with substantially higher return density than common public driving datasets
  • Production-grade HD maps — Lanelet2 maps with lane topology, regulatory elements, 29 road-feature classes, 120 traffic-sign classes (GTSIGN-220 taxonomy), and 3D-localized signs, traffic lights, and poles
  • Research benchmarks — relational HD map perception, long-range monocular depth estimation, novel view synthesis, and end-to-end / world-model research

This is an automated message from the Librarian Bot. I found the following papers similar to this paper.

The following papers were recommended by the Semantic Scholar API

Please give a thumbs up to this comment if you found it helpful!

If you want recommendations for any Paper on Hugging Face checkout this Space

You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.
Tap or paste here to upload images

· Sign up or log in to comment

Get this paper in your agent:

hf papers read 2606.02956
Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2606.02956 in a model README.md to link it from this page.

Datasets citing this paper 1

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2606.02956 in a Space README.md to link it from this page.

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.

Discussion (0)

Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.

Sign in →

No comments yet. Sign in and be the first to say something.

More from Hugging Face Daily Papers