Hugging Face Daily Papers · · 3 min read

Ψ-Bench: Evaluating Persona-Sensitive Influencing in Persuasive Dialogues

Mirrored from Hugging Face Daily Papers for archival readability. Support the source by reading on the original site.

A benchmark for personalized persuasion</p>\n","updatedAt":"2026-06-03T02:26:44.706Z","author":{"_id":"638d601b5e14c2f38678fb3a","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/638d601b5e14c2f38678fb3a/Elu-TTd97lGy7YL7eKRuZ.jpeg","fullname":"韩沛煊","name":"HakHan","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.6865590810775757},"editors":["HakHan"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/638d601b5e14c2f38678fb3a/Elu-TTd97lGy7YL7eKRuZ.jpeg"],"reactions":[],"isReport":false}}],"primaryEmailConfirmed":false,"paper":{"id":"2606.02754","authors":[{"_id":"6a1f90c5e292c1c78ecb1338","name":"Peixuan Han","hidden":false},{"_id":"6a1f90c5e292c1c78ecb1339","name":"Hongyi Du","hidden":false},{"_id":"6a1f90c5e292c1c78ecb133a","name":"Jiayu Liu","hidden":false},{"_id":"6a1f90c5e292c1c78ecb133b","name":"Yihang Sun","hidden":false},{"_id":"6a1f90c5e292c1c78ecb133c","name":"Yutong Liu","hidden":false},{"_id":"6a1f90c5e292c1c78ecb133d","name":"Jiaxuan You","hidden":false}],"publishedAt":"2026-06-01T18:20:27.000Z","submittedOnDailyAt":"2026-06-03T00:00:00.000Z","title":"Ψ-Bench: Evaluating Persona-Sensitive Influencing in Persuasive Dialogues","submittedOnDailyBy":{"_id":"638d601b5e14c2f38678fb3a","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/638d601b5e14c2f38678fb3a/Elu-TTd97lGy7YL7eKRuZ.jpeg","isPro":false,"fullname":"韩沛煊","user":"HakHan","type":"user","name":"HakHan"},"summary":"Personalization is a crucial capability of modern language agents. However, current research primarily positions personalized agents as passive responders to user preferences, limiting their ability to interact with users and provide suggestions or guidance proactively. To systematically evaluate such proactive personalization in realistic interactions, we propose Ψ-Bench, a benchmark for assessing LLMs' ability to influence realistic users through conversation. We design three real-world interaction scenarios that involve persuasion in Ψ-Bench, and endow simulated clients with personal characteristics through explicit user profiles derived from dialogue histories. We evaluate 10 frontier LLMs on Ψ-Bench and find that while most models can produce coherent and reasonable arguments, even state-of-the-art models still leave considerable room for improvement in persuasion. We also find that providing access to client profiles yields an average performance gain of 18.24\\%, highlighting the importance of user-specific information for effective persuasion. Overall, our work highlights persona-sensitive influencing as a challenging yet practical direction for evaluating and developing more proactive personalized LLM agents. Codes are available at: https://github.com/Hanpx20/Psi-Bench.","upvotes":8,"discussionId":"6a1f90c5e292c1c78ecb133e","ai_summary":"LLMs demonstrate limited effectiveness in persuasive conversation despite generating coherent arguments, with user-specific profiles significantly improving performance.","ai_keywords":["personalized agents","language agents","LLMs","persuasion","user profiles","dialogue histories","persona-sensitive influencing"],"ai_summary_model":"Qwen/Qwen2.5-Coder-32B-Instruct"},"canReadDatabase":false,"canManagePapers":false,"canSubmit":false,"hasHfLevelAccess":false,"upvoted":false,"upvoters":[{"_id":"638d601b5e14c2f38678fb3a","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/638d601b5e14c2f38678fb3a/Elu-TTd97lGy7YL7eKRuZ.jpeg","isPro":false,"fullname":"韩沛煊","user":"HakHan","type":"user"},{"_id":"66783baec3f824dde8f783ac","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/66783baec3f824dde8f783ac/oqFYUrgs2vnGRhAMSrQpC.jpeg","isPro":false,"fullname":"Jeff","user":"JiayuJeff","type":"user"},{"_id":"673e5ac90ebca74a178c7517","avatarUrl":"/avatars/1832cf3241e120fcef4be845bdfbd140.svg","isPro":false,"fullname":"YingJie Yu","user":"yvngexe","type":"user"},{"_id":"63624ffd2a84d82a8c8d3f60","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/63624ffd2a84d82a8c8d3f60/NhDFHhlCWJeMevXN7aQUX.png","isPro":false,"fullname":"Chumeng Liang","user":"chumengl","type":"user"},{"_id":"68087b4f3f5cc7179ae959a7","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/no-auth/l9skgMVKXJollx6BwNaWm.png","isPro":false,"fullname":"Xiaocheng Yang","user":"Xiaocheng-Yang","type":"user"},{"_id":"66f8689725464a7989b75845","avatarUrl":"/avatars/43a61a528c5779103eaf5687ba44ee14.svg","isPro":false,"fullname":"Jiarui Yao","user":"FlippyDora","type":"user"},{"_id":"665e121c6007027038fd4005","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/no-auth/sIVBJAGM-Kneq9KMf8aXb.png","isPro":false,"fullname":"Cheng Qian","user":"chengq9","type":"user"},{"_id":"6a073971ffffe0e25c0755b5","avatarUrl":"/avatars/b2720ecb828ebb861403ed852f0d8c36.svg","isPro":false,"fullname":"Yulin Liu","user":"Yulin0610","type":"user"}],"acceptLanguages":["en"],"dailyPaperRank":0,"markdownContentUrl":"https://huggingface.co/buckets/huggingchat/papers-content/resolve/2606/2606.02754.md"}">
Papers
arxiv:2606.02754

Ψ-Bench: Evaluating Persona-Sensitive Influencing in Persuasive Dialogues

Published on Jun 1
· Submitted by
韩沛煊
on Jun 3
Authors:
,
,
,
,
,

Abstract

LLMs demonstrate limited effectiveness in persuasive conversation despite generating coherent arguments, with user-specific profiles significantly improving performance.

Personalization is a crucial capability of modern language agents. However, current research primarily positions personalized agents as passive responders to user preferences, limiting their ability to interact with users and provide suggestions or guidance proactively. To systematically evaluate such proactive personalization in realistic interactions, we propose Ψ-Bench, a benchmark for assessing LLMs' ability to influence realistic users through conversation. We design three real-world interaction scenarios that involve persuasion in Ψ-Bench, and endow simulated clients with personal characteristics through explicit user profiles derived from dialogue histories. We evaluate 10 frontier LLMs on Ψ-Bench and find that while most models can produce coherent and reasonable arguments, even state-of-the-art models still leave considerable room for improvement in persuasion. We also find that providing access to client profiles yields an average performance gain of 18.24\%, highlighting the importance of user-specific information for effective persuasion. Overall, our work highlights persona-sensitive influencing as a challenging yet practical direction for evaluating and developing more proactive personalized LLM agents. Codes are available at: https://github.com/Hanpx20/Psi-Bench.

Community

Paper submitter about 11 hours ago

A benchmark for personalized persuasion

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.
Tap or paste here to upload images

· Sign up or log in to comment

Get this paper in your agent:

hf papers read 2606.02754
Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2606.02754 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2606.02754 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2606.02754 in a Space README.md to link it from this page.

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.

Discussion (0)

Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.

Sign in →

No comments yet. Sign in and be the first to say something.

More from Hugging Face Daily Papers