Hugging Face Daily Papers · June 8, 2026 · 4 min read

Parametric Social Identity Injection and Diversification in Public Opinion Simulation

Mirrored from Hugging Face Daily Papers for archival readability. Support the source by reading on the original site.

Like Read original ↗

Excited to share our work on Parametric Social Identity Injection (PSII) for socially diverse public opinion simulation with LLM agents.\nWe identify Diversity Collapse, a phenomenon where prompt-conditioned LLM agents produce overly homogeneous responses and flattened inter-group differences. PSII addresses this by injecting explicit demographic and value-orientation vectors into intermediate hidden states, enabling fine-grained and controllable identity modulation beyond prompt-only personas.\nOn World Values Survey questions, PSII improves distributional fidelity and diversity across multiple open-source LLMs.\nDemo: <a href=\"https://psii-demo.streamlit.app\" rel=\"nofollow\">https://psii-demo.streamlit.app</a> Code: <a href=\"https://github.com/halsayxi/PSII\" rel=\"nofollow\">https://github.com/halsayxi/PSII</a> Paper: <a href=\"https://arxiv.org/abs/2603.16142\" rel=\"nofollow\">https://arxiv.org/abs/2603.16142</a> Model: <a href=\"https://huggingface.co/hexi222/psii-identity-representations\">https://huggingface.co/hexi222/psii-identity-representations</a>\n<a href=\"https://cdn-uploads.huggingface.co/production/uploads/676aff243d3de6571fbdda8e/teZYYDbn4GXyr6x7aKyv3.png\" rel=\"nofollow\"><img src=\"https://cdn-uploads.huggingface.co/production/uploads/676aff243d3de6571fbdda8e/teZYYDbn4GXyr6x7aKyv3.png\" alt=\"framework\"></a>\n","updatedAt":"2026-06-08T03:19:20.027Z","author":{"_id":"676aff243d3de6571fbdda8e","avatarUrl":"/avatars/397353cabdfea76c4cfab2d1d2f37247.svg","fullname":"Wang Hexi","name":"hexi222","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.6840781569480896},"editors":["hexi222"],"editorAvatarUrls":["/avatars/397353cabdfea76c4cfab2d1d2f37247.svg"],"reactions":[],"isReport":false}}],"primaryEmailConfirmed":false,"paper":{"id":"2603.16142","authors":[{"_id":"6a228191047f837f98677846","user":{"_id":"676aff243d3de6571fbdda8e","avatarUrl":"/avatars/397353cabdfea76c4cfab2d1d2f37247.svg","isPro":false,"fullname":"Wang Hexi","user":"hexi222","type":"user","name":"hexi222"},"name":"Hexi Wang","status":"claimed_verified","statusLastChangedAt":"2026-06-05T15:05:59.841Z","hidden":false},{"_id":"6a228191047f837f98677847","name":"Yujia Zhou","hidden":false},{"_id":"6a228191047f837f98677848","name":"Bangde Du","hidden":false},{"_id":"6a228191047f837f98677849","name":"Qingyao Ai","hidden":false},{"_id":"6a228191047f837f9867784a","name":"Yiqun Liu","hidden":false}],"publishedAt":"2026-06-01T00:00:00.000Z","submittedOnDailyAt":"2026-06-08T00:00:00.000Z","title":"Parametric Social Identity Injection and Diversification in Public Opinion Simulation","submittedOnDailyBy":{"_id":"676aff243d3de6571fbdda8e","avatarUrl":"/avatars/397353cabdfea76c4cfab2d1d2f37247.svg","isPro":false,"fullname":"Wang Hexi","user":"hexi222","type":"user","name":"hexi222"},"summary":"Large language models (LLMs) have recently been adopted as synthetic agents for public opinion simulation, offering a promising alternative to costly and slow human surveys. Despite their scalability, current LLM-based simulation methods fail to capture social diversity, producing flattened inter-group differences and overly homogeneous responses across demographic groups. We identify this limitation as a Diversity Collapse phenomenon in LLM hidden representations, where distinct social identities become increasingly indistinguishable across layers. Motivated by this observation, we propose Parametric Social Identity Injection (PSII), a general framework that injects explicit, parametric representations of demographic attributes and value orientations directly into intermediate hidden states of LLMs. Unlike prompt-based persona conditioning, PSII enables fine-grained and controllable identity modulation at the representation level. Extensive experiments on the World Values Survey using multiple open-source LLMs show that PSII significantly improves distributional fidelity and diversity, reducing KL divergence to real-world survey data while enhancing overall diversity. This work provides new insights into representation-level control of LLM agents and advances scalable, diversity-aware public opinion simulation.","upvotes":0,"discussionId":"6a228191047f837f9867784b","projectPage":"https://psii-demo.streamlit.app/","githubRepo":"https://github.com/halsayxi/PSII","githubRepoAddedBy":"user","ai_summary":"Large language models suffer from reduced social diversity in public opinion simulation due to identity indistinction in hidden representations, which is addressed through a parametric injection framework that enhances demographic representation fidelity and diversity.","ai_keywords":["large language models","public opinion simulation","diversity collapse","hidden representations","parametric social identity injection","demographic attributes","value orientations","representation-level control","KL divergence","World Values Survey"],"ai_summary_model":"Qwen/Qwen2.5-Coder-32B-Instruct","githubStars":2,"organization":{"_id":"628735cbc83a2d6ab8d14a66","name":"Tsinghua","fullname":"Tsinghua University","avatar":"https://www.gravatar.com/avatar/6c5c1441e3283e7543342e59277ea219?d=retro&size=100"}},"canReadDatabase":false,"canManagePapers":false,"canSubmit":false,"hasHfLevelAccess":false,"upvoted":false,"upvoters":[],"acceptLanguages":["en"],"organization":{"_id":"628735cbc83a2d6ab8d14a66","name":"Tsinghua","fullname":"Tsinghua University","avatar":"https://www.gravatar.com/avatar/6c5c1441e3283e7543342e59277ea219?d=retro&size=100"},"markdownContentUrl":"https://huggingface.co/buckets/huggingchat/papers-content/resolve/2603/2603.16142.md"}">

Papers

arxiv:2603.16142

Parametric Social Identity Injection and Diversification in Public Opinion Simulation

Published on Jun 1

· Submitted by

Wang Hexi on Jun 8

Tsinghua University

Upvote

Authors:

Hexi Wang ,

Abstract

Large language models suffer from reduced social diversity in public opinion simulation due to identity indistinction in hidden representations, which is addressed through a parametric injection framework that enhances demographic representation fidelity and diversity.

Generated by Qwen/Qwen2.5-Coder-32B-Instruct

Large language models (LLMs) have recently been adopted as synthetic agents for public opinion simulation, offering a promising alternative to costly and slow human surveys. Despite their scalability, current LLM-based simulation methods fail to capture social diversity, producing flattened inter-group differences and overly homogeneous responses across demographic groups. We identify this limitation as a Diversity Collapse phenomenon in LLM hidden representations, where distinct social identities become increasingly indistinguishable across layers. Motivated by this observation, we propose Parametric Social Identity Injection (PSII), a general framework that injects explicit, parametric representations of demographic attributes and value orientations directly into intermediate hidden states of LLMs. Unlike prompt-based persona conditioning, PSII enables fine-grained and controllable identity modulation at the representation level. Extensive experiments on the World Values Survey using multiple open-source LLMs show that PSII significantly improves distributional fidelity and diversity, reducing KL divergence to real-world survey data while enhancing overall diversity. This work provides new insights into representation-level control of LLM agents and advances scalable, diversity-aware public opinion simulation.

View arXiv page View PDF Project page GitHub 2 Add to collection

Community

hexi222

Paper author Paper submitter about 6 hours ago

Excited to share our work on Parametric Social Identity Injection (PSII) for socially diverse public opinion simulation with LLM agents.

We identify Diversity Collapse, a phenomenon where prompt-conditioned LLM agents produce overly homogeneous responses and flattened inter-group differences. PSII addresses this by injecting explicit demographic and value-orientation vectors into intermediate hidden states, enabling fine-grained and controllable identity modulation beyond prompt-only personas.

On World Values Survey questions, PSII improves distributional fidelity and diversity across multiple open-source LLMs.

Demo: https://psii-demo.streamlit.app
Code: https://github.com/halsayxi/PSII
Paper: https://arxiv.org/abs/2603.16142
Model: https://huggingface.co/hexi222/psii-identity-representations

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

Upvote

Get this paper in your agent:

hf papers read 2603.16142

Don't have the latest CLI?

curl -LsSf https://hf.co/cli/install.sh | bash

Models citing this paper 1

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2603.16142 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2603.16142 in a Space README.md to link it from this page.

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.

Discussion (0)

No comments yet. Sign in and be the first to say something.

Parametric Social Identity Injection and Diversification in Public Opinion Simulation

Abstract

Community

Models citing this paper 1

Datasets citing this paper 0

Spaces citing this paper 0

Collections including this paper 0

Discussion (0)

More from Hugging Face Daily Papers