Hugging Face Daily Papers · · 4 min read

Parametric Social Identity Injection and Diversification in Public Opinion Simulation

Mirrored from Hugging Face Daily Papers for archival readability. Support the source by reading on the original site.

Excited to share our work on <strong>Parametric Social Identity Injection (PSII)</strong> for socially diverse public opinion simulation with LLM agents.</p>\n<p>We identify <strong>Diversity Collapse</strong>, a phenomenon where prompt-conditioned LLM agents produce overly homogeneous responses and flattened inter-group differences. PSII addresses this by injecting explicit demographic and value-orientation vectors into intermediate hidden states, enabling fine-grained and controllable identity modulation beyond prompt-only personas.</p>\n<p>On World Values Survey questions, PSII improves distributional fidelity and diversity across multiple open-source LLMs.</p>\n<p>Demo: <a href=\"https://psii-demo.streamlit.app\" rel=\"nofollow\">https://psii-demo.streamlit.app</a><br>Code: <a href=\"https://github.com/halsayxi/PSII\" rel=\"nofollow\">https://github.com/halsayxi/PSII</a><br>Paper: <a href=\"https://arxiv.org/abs/2603.16142\" rel=\"nofollow\">https://arxiv.org/abs/2603.16142</a><br>Model: <a href=\"https://huggingface.co/hexi222/psii-identity-representations\">https://huggingface.co/hexi222/psii-identity-representations</a></p>\n<p><a href=\"https://cdn-uploads.huggingface.co/production/uploads/676aff243d3de6571fbdda8e/teZYYDbn4GXyr6x7aKyv3.png\" rel=\"nofollow\"><img src=\"https://cdn-uploads.huggingface.co/production/uploads/676aff243d3de6571fbdda8e/teZYYDbn4GXyr6x7aKyv3.png\" alt=\"framework\"></a></p>\n","updatedAt":"2026-06-08T03:19:20.027Z","author":{"_id":"676aff243d3de6571fbdda8e","avatarUrl":"/avatars/397353cabdfea76c4cfab2d1d2f37247.svg","fullname":"Wang Hexi","name":"hexi222","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.6840781569480896},"editors":["hexi222"],"editorAvatarUrls":["/avatars/397353cabdfea76c4cfab2d1d2f37247.svg"],"reactions":[],"isReport":false}}],"primaryEmailConfirmed":false,"paper":{"id":"2603.16142","authors":[{"_id":"6a228191047f837f98677846","user":{"_id":"676aff243d3de6571fbdda8e","avatarUrl":"/avatars/397353cabdfea76c4cfab2d1d2f37247.svg","isPro":false,"fullname":"Wang Hexi","user":"hexi222","type":"user","name":"hexi222"},"name":"Hexi Wang","status":"claimed_verified","statusLastChangedAt":"2026-06-05T15:05:59.841Z","hidden":false},{"_id":"6a228191047f837f98677847","name":"Yujia Zhou","hidden":false},{"_id":"6a228191047f837f98677848","name":"Bangde Du","hidden":false},{"_id":"6a228191047f837f98677849","name":"Qingyao Ai","hidden":false},{"_id":"6a228191047f837f9867784a","name":"Yiqun Liu","hidden":false}],"publishedAt":"2026-06-01T00:00:00.000Z","submittedOnDailyAt":"2026-06-08T00:00:00.000Z","title":"Parametric Social Identity Injection and Diversification in Public Opinion Simulation","submittedOnDailyBy":{"_id":"676aff243d3de6571fbdda8e","avatarUrl":"/avatars/397353cabdfea76c4cfab2d1d2f37247.svg","isPro":false,"fullname":"Wang Hexi","user":"hexi222","type":"user","name":"hexi222"},"summary":"Large language models (LLMs) have recently been adopted as synthetic agents for public opinion simulation, offering a promising alternative to costly and slow human surveys. Despite their scalability, current LLM-based simulation methods fail to capture social diversity, producing flattened inter-group differences and overly homogeneous responses across demographic groups. We identify this limitation as a Diversity Collapse phenomenon in LLM hidden representations, where distinct social identities become increasingly indistinguishable across layers. Motivated by this observation, we propose Parametric Social Identity Injection (PSII), a general framework that injects explicit, parametric representations of demographic attributes and value orientations directly into intermediate hidden states of LLMs. Unlike prompt-based persona conditioning, PSII enables fine-grained and controllable identity modulation at the representation level. Extensive experiments on the World Values Survey using multiple open-source LLMs show that PSII significantly improves distributional fidelity and diversity, reducing KL divergence to real-world survey data while enhancing overall diversity. This work provides new insights into representation-level control of LLM agents and advances scalable, diversity-aware public opinion simulation.","upvotes":0,"discussionId":"6a228191047f837f9867784b","projectPage":"https://psii-demo.streamlit.app/","githubRepo":"https://github.com/halsayxi/PSII","githubRepoAddedBy":"user","ai_summary":"Large language models suffer from reduced social diversity in public opinion simulation due to identity indistinction in hidden representations, which is addressed through a parametric injection framework that enhances demographic representation fidelity and diversity.","ai_keywords":["large language models","public opinion simulation","diversity collapse","hidden representations","parametric social identity injection","demographic attributes","value orientations","representation-level control","KL divergence","World Values Survey"],"ai_summary_model":"Qwen/Qwen2.5-Coder-32B-Instruct","githubStars":2,"organization":{"_id":"628735cbc83a2d6ab8d14a66","name":"Tsinghua","fullname":"Tsinghua University","avatar":"https://www.gravatar.com/avatar/6c5c1441e3283e7543342e59277ea219?d=retro&size=100"}},"canReadDatabase":false,"canManagePapers":false,"canSubmit":false,"hasHfLevelAccess":false,"upvoted":false,"upvoters":[],"acceptLanguages":["en"],"organization":{"_id":"628735cbc83a2d6ab8d14a66","name":"Tsinghua","fullname":"Tsinghua University","avatar":"https://www.gravatar.com/avatar/6c5c1441e3283e7543342e59277ea219?d=retro&size=100"},"markdownContentUrl":"https://huggingface.co/buckets/huggingchat/papers-content/resolve/2603/2603.16142.md"}">
Papers
arxiv:2603.16142

Parametric Social Identity Injection and Diversification in Public Opinion Simulation

Published on Jun 1
· Submitted by
Wang Hexi
on Jun 8
Authors:
,
,
,

Abstract

Large language models suffer from reduced social diversity in public opinion simulation due to identity indistinction in hidden representations, which is addressed through a parametric injection framework that enhances demographic representation fidelity and diversity.

Large language models (LLMs) have recently been adopted as synthetic agents for public opinion simulation, offering a promising alternative to costly and slow human surveys. Despite their scalability, current LLM-based simulation methods fail to capture social diversity, producing flattened inter-group differences and overly homogeneous responses across demographic groups. We identify this limitation as a Diversity Collapse phenomenon in LLM hidden representations, where distinct social identities become increasingly indistinguishable across layers. Motivated by this observation, we propose Parametric Social Identity Injection (PSII), a general framework that injects explicit, parametric representations of demographic attributes and value orientations directly into intermediate hidden states of LLMs. Unlike prompt-based persona conditioning, PSII enables fine-grained and controllable identity modulation at the representation level. Extensive experiments on the World Values Survey using multiple open-source LLMs show that PSII significantly improves distributional fidelity and diversity, reducing KL divergence to real-world survey data while enhancing overall diversity. This work provides new insights into representation-level control of LLM agents and advances scalable, diversity-aware public opinion simulation.

Community

Paper author Paper submitter about 6 hours ago

Excited to share our work on Parametric Social Identity Injection (PSII) for socially diverse public opinion simulation with LLM agents.

We identify Diversity Collapse, a phenomenon where prompt-conditioned LLM agents produce overly homogeneous responses and flattened inter-group differences. PSII addresses this by injecting explicit demographic and value-orientation vectors into intermediate hidden states, enabling fine-grained and controllable identity modulation beyond prompt-only personas.

On World Values Survey questions, PSII improves distributional fidelity and diversity across multiple open-source LLMs.

Demo: https://psii-demo.streamlit.app
Code: https://github.com/halsayxi/PSII
Paper: https://arxiv.org/abs/2603.16142
Model: https://huggingface.co/hexi222/psii-identity-representations

framework

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.
Tap or paste here to upload images

· Sign up or log in to comment

Get this paper in your agent:

hf papers read 2603.16142
Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash

Models citing this paper 1

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2603.16142 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2603.16142 in a Space README.md to link it from this page.

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.

Discussion (0)

Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.

Sign in →

No comments yet. Sign in and be the first to say something.

More from Hugging Face Daily Papers