Hugging Face Daily Papers · May 21, 2026 · 4 min read

MOCHA: Multi-Objective Chebyshev Annealing for Agent Skill Optimization

Mirrored from Hugging Face Daily Papers for archival readability. Support the source by reading on the original site.

Like Read original ↗

Skill optimization is inherently multi-objective: a skill must maximize task correctness and satisfy hard platform limits (truncated descriptions, compacted instruction bodies, finite shared context). Prior prompt optimizers either ignore these trade-offs or collapse them into a single scalar, missing Pareto-optimal variants in non-convex regions. MOCHA replaces single-objective selection with Chebyshev scalarization — provably covering the full Pareto front — combined with exponential annealing that transitions from exploration to exploitation as the rollout budget is consumed. Across six diverse skills, MOCHA beats the strongest baseline by 7.5% on average (up to +14.9%) and finds 2× more Pareto-optimal variants, while existing optimizers plateau at the seed on 4 of 6 tasks.<br><a href=\"https://cdn-uploads.huggingface.co/production/uploads/6366e2d9575c93ceda0791d8/-te4DkcopPXYSantgGtLz.png\" rel=\"nofollow\"><img src=\"https://cdn-uploads.huggingface.co/production/uploads/6366e2d9575c93ceda0791d8/-te4DkcopPXYSantgGtLz.png\" alt=\"teaser_non_convex\"></a></p>\n<p><a href=\"https://cdn-uploads.huggingface.co/production/uploads/6366e2d9575c93ceda0791d8/bCOJrOMkF59Sgd9eHxaz1.png\" rel=\"nofollow\"><img src=\"https://cdn-uploads.huggingface.co/production/uploads/6366e2d9575c93ceda0791d8/bCOJrOMkF59Sgd9eHxaz1.png\" alt=\"fig_evolution\"></a></p>\n","updatedAt":"2026-05-21T06:04:52.170Z","author":{"_id":"6366e2d9575c93ceda0791d8","avatarUrl":"/avatars/a53cb1bb7cd9c63a2520587108ffe962.svg","fullname":"Mehrab Tanjim","name":"Mehrab-Tanjim","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.7766125202178955},"editors":["Mehrab-Tanjim"],"editorAvatarUrls":["/avatars/a53cb1bb7cd9c63a2520587108ffe962.svg"],"reactions":[],"isReport":false}}],"primaryEmailConfirmed":false,"paper":{"id":"2605.19330","authors":[{"_id":"6a0ea070164dbbc68a26c67c","name":"Md Mehrab Tanjim","hidden":false},{"_id":"6a0ea070164dbbc68a26c67d","name":"Jayakumar Subramanian","hidden":false},{"_id":"6a0ea070164dbbc68a26c67e","name":"Xiang Chen","hidden":false},{"_id":"6a0ea070164dbbc68a26c67f","name":"Branislav Kveton","hidden":false},{"_id":"6a0ea070164dbbc68a26c680","name":"Subhojyoti Mukherjee","hidden":false},{"_id":"6a0ea070164dbbc68a26c681","name":"Anlan Zhang","hidden":false},{"_id":"6a0ea070164dbbc68a26c682","name":"Sungchul Kim","hidden":false},{"_id":"6a0ea070164dbbc68a26c683","name":"Somdeb Sarkhel","hidden":false},{"_id":"6a0ea070164dbbc68a26c684","name":"Sunav Choudhury","hidden":false}],"publishedAt":"2026-05-19T00:00:00.000Z","submittedOnDailyAt":"2026-05-21T00:00:00.000Z","title":"MOCHA: Multi-Objective Chebyshev Annealing for Agent Skill Optimization","submittedOnDailyBy":{"_id":"6366e2d9575c93ceda0791d8","avatarUrl":"/avatars/a53cb1bb7cd9c63a2520587108ffe962.svg","isPro":false,"fullname":"Mehrab Tanjim","user":"Mehrab-Tanjim","type":"user","name":"Mehrab-Tanjim"},"summary":"LLM agents organize behavior through skills - structured natural-language specifications governing how an agent reasons, retrieves, and responds. Unlike monolithic prompts, skills are multi-field artifacts subject to hard platform constraints: description fields are truncated for routing, instruction bodies are compacted via progressive disclosure, and co-resident skills compete for limited context windows. These constraints make skill optimization inherently multi-objective: a skill must simultaneously maximize task performance and satisfy platform limits. Yet existing prompt optimizers either ignore these trade-offs or collapse them into a weighted sum, missing Pareto-optimal variants in non-convex objective regions. We introduce MOCHA (Multi-Objective Chebyshev Annealing), which replaces single-objective selection with Chebyshev scalarization - covering the full Pareto front, including non-convex regions - combined with exponential annealing that transitions from exploration to exploitation. In our experiments across six diverse agent skills - where all methods share the same multi-objective mutation operator and baselines receive identical per-objective textual feedback - existing optimizers fail to improve the seed skill on 4 of 6 tasks: 1000 rollouts yield zero progress. MOCHA breaks through on every task, achieving 7.5% relative improvement in mean correctness over the strongest baseline (up to 14.9% on FEVER and 10.4% on TheoremQA) while discovering twice as many more Pareto-optimal skill variants.","upvotes":0,"discussionId":"6a0ea070164dbbc68a26c685","organization":{"_id":"637b318856db0404b7c5a0c2","name":"adobe-research","fullname":"Adobe Research","avatar":"https://cdn-avatars.huggingface.co/v1/production/uploads/1669033410364-624bebf604abc7ebb01789af.png"}},"canReadDatabase":false,"canManagePapers":false,"canSubmit":false,"hasHfLevelAccess":false,"upvoted":false,"upvoters":[],"acceptLanguages":["en"],"organization":{"_id":"637b318856db0404b7c5a0c2","name":"adobe-research","fullname":"Adobe Research","avatar":"https://cdn-avatars.huggingface.co/v1/production/uploads/1669033410364-624bebf604abc7ebb01789af.png"},"markdownContentUrl":"https://huggingface.co/buckets/huggingchat/papers-content/resolve/2605/2605.19330.md"}">

Papers

arxiv:2605.19330

MOCHA: Multi-Objective Chebyshev Annealing for Agent Skill Optimization

Published on May 19

· Submitted by

Mehrab Tanjim on May 21

Adobe Research

Upvote

Authors:

Abstract

LLM agents organize behavior through skills - structured natural-language specifications governing how an agent reasons, retrieves, and responds. Unlike monolithic prompts, skills are multi-field artifacts subject to hard platform constraints: description fields are truncated for routing, instruction bodies are compacted via progressive disclosure, and co-resident skills compete for limited context windows. These constraints make skill optimization inherently multi-objective: a skill must simultaneously maximize task performance and satisfy platform limits. Yet existing prompt optimizers either ignore these trade-offs or collapse them into a weighted sum, missing Pareto-optimal variants in non-convex objective regions. We introduce MOCHA (Multi-Objective Chebyshev Annealing), which replaces single-objective selection with Chebyshev scalarization - covering the full Pareto front, including non-convex regions - combined with exponential annealing that transitions from exploration to exploitation. In our experiments across six diverse agent skills - where all methods share the same multi-objective mutation operator and baselines receive identical per-objective textual feedback - existing optimizers fail to improve the seed skill on 4 of 6 tasks: 1000 rollouts yield zero progress. MOCHA breaks through on every task, achieving 7.5% relative improvement in mean correctness over the strongest baseline (up to 14.9% on FEVER and 10.4% on TheoremQA) while discovering twice as many more Pareto-optimal skill variants.

View arXiv page View PDF Add to collection

Community

Mehrab-Tanjim

Paper submitter about 7 hours ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

Upvote

Get this paper in your agent:

hf papers read 2605.19330

Don't have the latest CLI?

curl -LsSf https://hf.co/cli/install.sh | bash

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2605.19330 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2605.19330 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2605.19330 in a Space README.md to link it from this page.

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.

Discussion (0)

No comments yet. Sign in and be the first to say something.

MOCHA: Multi-Objective Chebyshev Annealing for Agent Skill Optimization

Abstract

Community

Models citing this paper 0

Datasets citing this paper 0

Spaces citing this paper 0

Collections including this paper 0

Discussion (0)

More from Hugging Face Daily Papers