Would you say capture-time semantic annotation for robot trajectories is a solved problem? [R]
Mirrored from r/MachineLearning for archival readability. Support the source by reading on the original site.
It seems raw teleoperation data (RGB + joint states) structurally lacks affordance, contact intent, and embodiment-specific kinematic context. (information that can't be reliably recovered post-hoc once the demonstration is recorded)
Most current approaches either filter/clean after collection, or rely on simulation to compensate. But neither seems to close the semantic gap for contact-rich tasks in unstructured environments.
Is anyone working on supervision at acquisition time, enriching the stream as it's captured rather than labeling after the fact?
And if not, is this a real bottleneck or am I overestimating the problem?
[link] [comments]
More from r/MachineLearning
-
Are We Underestimating Small Edge AI Models?[D]
Jun 5
-
Is it allowed to use OpenAI API outputs to create a silver code dataset or benchmark for a specific Python library? [d]
Jun 5
-
Scrap the LLMs. Scoring 4.76% on the brand new ARC-3 using pure code, a 2012 AMD CPU, and zero AI tokens.[P]
Jun 5
-
[R] Measuring the Symmetry--Data Exchange Rate
Jun 4
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.