\n<li><p>Search agents are usually optimized around one or a few “search environments”, whether web search APIs or local search built with a single retriever.</p>\n</li>\n<li><p>In practice, search environments are diverse, shaped by the retriever’s behavior, the indexing pipeline, the corpus distribution and quality, and the interaction interface.</p>\n</li>\n<li><p>Can search agents adapt their search strategies to different environments? More fundamentally, are they even aware when they're placed in different environments?</p>\n</li>\n</ul>\n<p>We believe this calls for a new research direction: 𝗘𝗻𝘃𝗶𝗿𝗼𝗻𝗺𝗲𝗻𝘁-𝗮𝘄𝗮𝗿𝗲 𝗜𝗻𝗳𝗼𝗿𝗺𝗮𝘁𝗶𝗼𝗻 𝗥𝗲𝘁𝗿𝗶𝗲𝘃𝗮𝗹. Our ACL 2026 work takes an initial step toward this goal by studying one core factor: how search agents adapt to different retriever behaviors, and how much this adaptation matters. Check it out!</p>\n<p><a href=\"https://cdn-uploads.huggingface.co/production/uploads/604f67ef0fe8ff3ec13d71ef/0AJBlVzq6EFCxQZhsbU3G.png\" rel=\"nofollow\"><img src=\"https://cdn-uploads.huggingface.co/production/uploads/604f67ef0fe8ff3ec13d71ef/0AJBlVzq6EFCxQZhsbU3G.png\" alt=\"image\"></a></p>\n","updatedAt":"2026-06-19T03:12:27.977Z","author":{"_id":"604f67ef0fe8ff3ec13d71ef","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/604f67ef0fe8ff3ec13d71ef/KhUwWvZ3OJ9nEee3B-SXO.png","fullname":"Hou Pong (Ken) Chan","name":"kenchan0226","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":15,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.8537598252296448},"editors":["kenchan0226"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/604f67ef0fe8ff3ec13d71ef/KhUwWvZ3OJ9nEee3B-SXO.png"],"reactions":[],"isReport":false}}],"primaryEmailConfirmed":false,"paper":{"id":"2606.16817","authors":[{"_id":"6a34b13c4c5c5e0d69bf1ca1","name":"Ruifeng Yuan","hidden":false},{"_id":"6a34b13c4c5c5e0d69bf1ca2","name":"Chaohao Yuan","hidden":false},{"_id":"6a34b13c4c5c5e0d69bf1ca3","name":"David Dai","hidden":false},{"_id":"6a34b13c4c5c5e0d69bf1ca4","name":"Yu Rong","hidden":false},{"_id":"6a34b13c4c5c5e0d69bf1ca5","name":"Hong Cheng","hidden":false},{"_id":"6a34b13c4c5c5e0d69bf1ca6","name":"Hou Pong Chan","hidden":false},{"_id":"6a34b13c4c5c5e0d69bf1ca7","name":"Chenghao Xiao","hidden":false}],"publishedAt":"2026-06-15T00:00:00.000Z","submittedOnDailyAt":"2026-06-19T00:00:00.000Z","title":"Understanding the Behaviors of Environment-aware Information Retrieval","submittedOnDailyBy":{"_id":"604f67ef0fe8ff3ec13d71ef","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/604f67ef0fe8ff3ec13d71ef/KhUwWvZ3OJ9nEee3B-SXO.png","isPro":false,"fullname":"Hou Pong (Ken) Chan","user":"kenchan0226","type":"user","name":"kenchan0226"},"summary":"Recent retrieval-augmented generation (RAG) approaches have demonstrated strong capability in handling complex queries, yet current research overlooks a critical challenge: different retrievers require fundamentally different query formulation strategies for optimal performance. In this work, we present the first systematic analysis of how LLMs can learn to adapt their query formulation strategies for different retrievers via reinforcement learning (RL). Our empirical study reveals that RL effectively teaches an LLM to tailor its queries to specific retriever characteristics. We discover that different retrievers exhibit surprisingly distinct optimal query styles (e.g., descriptive vs. question-like), suggesting strategies learned for one retriever ineffective for another. We further show that performance can be enhanced by incorporating retriever-specific human guidance and by scaling model size. To facilitate learning over multi-retrieval-step trajectories, we introduce a branching-based rollout technique that improves training stability. Our work provides the first empirical evidence and actionable insights for building truly retriever-aware RAG systems. Code and resources are available at https://github.com/LCO-Embedding/Envs-aware-Information-Retrieval.","upvotes":3,"discussionId":"6a34b13c4c5c5e0d69bf1ca8","githubRepo":"https://github.com/LCO-Embedding/Envs-aware-Information-Retrieval","githubRepoAddedBy":"user","ai_summary":"Large language models can be trained via reinforcement learning to adapt query formulation strategies for different retrievers, with distinct optimal query styles and improved performance through retriever-specific guidance and model scaling.","ai_keywords":["retrieval-augmented generation","LLMs","reinforcement learning","query formulation strategies","retriever-specific guidance","multi-retrieval-step trajectories","branching-based rollout technique"],"ai_summary_model":"Qwen/Qwen2.5-Coder-32B-Instruct","githubStars":5,"organization":{"_id":"68eb82481dfe9ec7bd1cc16b","name":"LCO-Embedding","fullname":"LCO-Embedding","avatar":"https://cdn-avatars.huggingface.co/v1/production/uploads/63108cc834c7d77420b0fd68/kOt7TD2Ge6hgUsz8CF6Ro.png"}},"canReadDatabase":false,"canManagePapers":false,"canSubmit":false,"hasHfLevelAccess":false,"upvoted":false,"upvoters":[{"_id":"604f67ef0fe8ff3ec13d71ef","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/604f67ef0fe8ff3ec13d71ef/KhUwWvZ3OJ9nEee3B-SXO.png","isPro":false,"fullname":"Hou Pong (Ken) Chan","user":"kenchan0226","type":"user"},{"_id":"66d8512c54209e9101811e8e","avatarUrl":"/avatars/62dfd8e6261108f2508efe678d5a2a57.svg","isPro":false,"fullname":"M Saad Salman","user":"MSS444","type":"user"},{"_id":"66fa54b9076c1a309d563a41","avatarUrl":"/avatars/2daaa10c4c5bfcde74c7f995d15be1e0.svg","isPro":false,"fullname":"Ruifeng Yuan","user":"csyrf","type":"user"}],"acceptLanguages":["en"],"dailyPaperRank":0,"organization":{"_id":"68eb82481dfe9ec7bd1cc16b","name":"LCO-Embedding","fullname":"LCO-Embedding","avatar":"https://cdn-avatars.huggingface.co/v1/production/uploads/63108cc834c7d77420b0fd68/kOt7TD2Ge6hgUsz8CF6Ro.png"},"markdownContentUrl":"https://huggingface.co/buckets/huggingchat/papers-content/resolve/2606/2606.16817.md","query":{}}">
Understanding the Behaviors of Environment-aware Information Retrieval
Abstract
Large language models can be trained via reinforcement learning to adapt query formulation strategies for different retrievers, with distinct optimal query styles and improved performance through retriever-specific guidance and model scaling.
Recent retrieval-augmented generation (RAG) approaches have demonstrated strong capability in handling complex queries, yet current research overlooks a critical challenge: different retrievers require fundamentally different query formulation strategies for optimal performance. In this work, we present the first systematic analysis of how LLMs can learn to adapt their query formulation strategies for different retrievers via reinforcement learning (RL). Our empirical study reveals that RL effectively teaches an LLM to tailor its queries to specific retriever characteristics. We discover that different retrievers exhibit surprisingly distinct optimal query styles (e.g., descriptive vs. question-like), suggesting strategies learned for one retriever ineffective for another. We further show that performance can be enhanced by incorporating retriever-specific human guidance and by scaling model size. To facilitate learning over multi-retrieval-step trajectories, we introduce a branching-based rollout technique that improves training stability. Our work provides the first empirical evidence and actionable insights for building truly retriever-aware RAG systems. Code and resources are available at https://github.com/LCO-Embedding/Envs-aware-Information-Retrieval.
Community
Search agents are usually optimized around one or a few “search environments”, whether web search APIs or local search built with a single retriever.
In practice, search environments are diverse, shaped by the retriever’s behavior, the indexing pipeline, the corpus distribution and quality, and the interaction interface.
Can search agents adapt their search strategies to different environments? More fundamentally, are they even aware when they're placed in different environments?
We believe this calls for a new research direction: 𝗘𝗻𝘃𝗶𝗿𝗼𝗻𝗺𝗲𝗻𝘁-𝗮𝘄𝗮𝗿𝗲 𝗜𝗻𝗳𝗼𝗿𝗺𝗮𝘁𝗶𝗼𝗻 𝗥𝗲𝘁𝗿𝗶𝗲𝘃𝗮𝗹. Our ACL 2026 work takes an initial step toward this goal by studying one core factor: how search agents adapt to different retriever behaviors, and how much this adaptation matters. Check it out!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.
Tap or paste here to upload images
Cite arxiv.org/abs/2606.16817 in a model README.md to link it from this page.
Cite arxiv.org/abs/2606.16817 in a dataset README.md to link it from this page.
Cite arxiv.org/abs/2606.16817 in a Space README.md to link it from this page.
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.