News / #security Tag Security 500 articles archived under #security · RSS Sign in to follow arXiv — NLP / Computation & Language research 1mo ago Bridging the Stability-Expressivity Gap: Synthetic Data Scaling and Preference Alignment for Low-Resource Spoken Language Models arXiv:2605.27383v1 Announce Type: new Abstract: Spoken Language Models (SLMs) have emerged as a promising paradigm for speech synthesis by bypassing explicit grapheme-to-phoneme pipelines. However, their effectiveness in low-resource languages remains fundamentally limited by… 24 arXiv — NLP / Computation & Language research 1mo ago Simorgh at SemEval-2026 task 7: Region-Aware Hybrid Retrieval for Low-Resource Cultural Reasoning in Multilingual Question Answering arXiv:2605.27636v1 Announce Type: new Abstract: Although Large Language Models (LLMs) demonstrate excellent capabilities and performance for general reasoning tasks within the general public domain, they may face challenges with culturally grounded knowledge within languages… 17 arXiv — NLP / Computation & Language research 1mo ago Disentangling Language Roles in Multilingual LLM Task Execution arXiv:2605.27649v1 Announce Type: new Abstract: Multilingual LLMs are increasingly used when instruction, source content, and required response languages do not coincide. Existing benchmarks have expanded multilingual instruction-following evaluation, but they rarely isolate… 28 arXiv — NLP / Computation & Language research 1mo ago Beyond Input Understanding: Diagnosing Multilingual Mathematical Reasoning with Directed Acyclic Trace Graphs arXiv:2605.27715v1 Announce Type: new Abstract: Large reasoning models (LRMs) achieve strong mathematical reasoning performance in English, but remain much less reliable in many low- and medium-resource languages. This gap is often explained as a failure to understand… 28 arXiv — NLP / Computation & Language research 1mo ago GRADE: Generalizable Reasoning-Aware Dialogue Evaluation for AI Tutors arXiv:2605.27866v1 Announce Type: new Abstract: Evaluating AI tutor responses requires more than factual correctness: tutors must identify mistakes, locate errors, provide guidance, and offer actionable next steps. We present GRADE, a systematic study of open-source models for… 35 arXiv — NLP / Computation & Language research 1mo ago Analyzing Quality-Latency-Resource Trade-offs in a Technical Documentation RAG Assistant Using LoRA Adaptation arXiv:2605.28222v1 Announce Type: new Abstract: We study quality-latency-resource trade-offs in a documentation-grounded retrieval-augmented generation (RAG) system that uses Low-Rank Adaptation (LoRA) of the generator. We build a manually verified benchmark of 5,144… 17 arXiv — NLP / Computation & Language research 1mo ago HELEA: Hard-Negative Benchmark and LLM-based Reranking for Robust Entity Alignment arXiv:2605.28308v1 Announce Type: new Abstract: Entity Alignment (EA) is essential for knowledge graph (KG) fusion, but existing benchmarks often allow models to exploit name overlap rather than relational structure. This makes it difficult to evaluate whether models can reject… 11 r/LocalLLaMA community 1mo ago Vulnerability found in framework used by VLLM, many MCP servers, and other LLM tools Worth taking a look to see if this affects any of you. Surprised nobody has posted it yet.   submitted by   /u/Hrethric [link]   [comments] 4 r/MachineLearning community 1mo ago BEAM 100K memory benchmark: CSM vs Hindsight local artifact comparison [R] [R] BEAM 100K memory benchmark: CSM vs Hindsight local artifact comparison I’m looking for feedback on a local agent-memory benchmark comparison, especially from people who care about evaluation methodology. I built an open-source R&D memory system called Context Swarm Memory… 31 r/MachineLearning community 1mo ago UK GDPR Small Business Q&A — 5,000 synthetic pairs with article-level citations [D] Dataset for fine-tuning compliance assistants. Each pair includes: - A practical SME-facing question ("Can I use pre-ticked consent boxes?") - An answer with specific UK GDPR article references, ICO guidance by name, and actionable steps - Source metadata: which GDPR concepts… 23 The Information — AI news-outlet 1mo ago Salesforce Stock Dips as Software Firm Lowers Cash Flow Forecast Salesforce on Wednesday lowered its projection for cash flow growth in its 2027 fiscal year, which ends January next year. The stock fell 3% in after-hours trading following Salesforce’s quarterly earnings results for the April quarter. The software firm said operating cash flow… 20 r/LocalLLaMA community 1mo ago Inferencing at 10.33 t/s on Qwen 3.5 35B on a $300 laptop https://preview.redd.it/u8062juegq3h1.png?width=1919&format=png&auto=webp&s=a213f6929c6cad58e92bc1681dac9f0545b04d13 Overview: As the market for consumer computing parts becomes more scarce due to the AI boom, finding ways to use lower-end hardware for less-demanding… 27 Simon Willison community 1mo ago Quoting Kyle Ferrana PICARD: Data, shields up DATA: Brilliant! Shields can reduce damage we sustain. Not immunity. Not hubris. Just prudence. It's not precaution—it's strategy. [camera shakes] WORF: HULL BREACHES ON NINE DECKS DATA: Here's what happened: you told me to raise shields, and I didn't… 21 r/MachineLearning community 1mo ago A Tiny Open-Source Self-Driving AI That Runs on a Phone [P] https://preview.redd.it/ww14mzr2fm3h1.png?width=1890&format=png&auto=webp&s=79873d47ae79c7815ca3e7e91fd43141632174f5 https://www.youtube.com/watch?v=rr_uS4bf0B4&feature=youtu.be trained a 7MB open-source L4 self-driving AI that learns navigation, lane following, and drift… 11 arXiv — Machine Learning research 1mo ago When Does Deep RL Beat Calibrated Baselines? A Benchmark Study on Adaptive Resource Control arXiv:2605.26418v1 Announce Type: new Abstract: A properly calibrated rule-based autoscaler can beat every one of six mainstream deep reinforcement learning (DRL) algorithms on cost across every workload we test - so when, if ever, does DRL actually help? We study this in… 11 arXiv — Machine Learning research 1mo ago Dense2MoE: Pushing the Pareto Frontier of On-Device LLMs via Unified Pruning and Upcycling arXiv:2605.26496v1 Announce Type: new Abstract: The Mixture of Experts MoE architecture is highly promising for resource constrained on device deployments yet training these models from scratch incurs prohibitive costs Current methods attempt to alleviate this by upcycling dense… 32 arXiv — Machine Learning research 1mo ago APEX: Amplitude Anchors and Phase Priors for Target-Scarce Higher-Frequency Wave Prediction arXiv:2605.26732v1 Announce Type: new Abstract: Learning-based surrogates have become increasingly effective for wave-field prediction, and neural operators in particular have shown strong performance within observed frequency regimes. However, higher-frequency prediction under… 31 arXiv — Machine Learning research 1mo ago Ratio-Variance Regularized Policy Optimization arXiv:2605.26784v1 Announce Type: new Abstract: Standard on-policy reinforcement learning relies on heuristic clipping to enforce trust regions, but this mechanism imposes a severe cost by indiscriminately truncating high-return yet high-divergence updates. We demonstrate that… 29 arXiv — Machine Learning research 1mo ago Not All Disagreement Is Learnable: Token Teachability in On-Policy Distillation arXiv:2605.26844v1 Announce Type: new Abstract: On-policy distillation (OPD) trains a student on its own rollouts with token-level teacher supervision. Recent selective OPD methods exploit the non-uniformity of OPD signals by prioritizing high-entropy or high-disagreement… 15 arXiv — NLP / Computation & Language research 1mo ago Vectors Are Not Neutral: Sensitive-Information Inference from Exported LLM Representations in Summarization arXiv:2605.26433v1 Announce Type: new Abstract: Large language model (LLM) summarization systems may pass compact vector representations of private inputs to downstream retrieval, monitoring, audit, or analytic workflows. Even when source documents remain access-restricted,… 4 arXiv — NLP / Computation & Language research 1mo ago EmoDistill: Offline Emotion Skill Distillation for Language Model Agents in Adversarial Negotiation arXiv:2605.26785v1 Announce Type: new Abstract: Post-trained LLMs are often optimized to align responses with human preferences, making them safe, polite, and conversationally appropriate. In adversarial negotiation, however, this alignment can become a vulnerability:… 21 arXiv — NLP / Computation & Language research 1mo ago AlbanianLLMSafety: A Safety Evaluation Dataset for Large Language Models in Albanian arXiv:2605.26954v1 Announce Type: new Abstract: Safety evaluation of Large Language Models (LLMs) has largely focused on high-resource languages, leaving low-resource languages critically underserved. We present AlbanianLLMSafety, the first publicly available safety evaluation… 25 arXiv — NLP / Computation & Language research 1mo ago PashtoTTS-Bench: automated screening for low-resource non-Latin-script text-to-speech arXiv:2605.26978v1 Announce Type: new Abstract: Text-to-speech (TTS) evaluation for low-resource non-Latin-script languages can fail when it relies on a single ASR round-trip word error rate (WER). A system may produce no audio, speak a neighbouring language, preserve target… 13 arXiv — NLP / Computation & Language research 1mo ago Prompt Injection Detection is Regime-Dependent: A Deployment-Aware Evaluation with Interpretable Structural Signals arXiv:2605.26999v1 Announce Type: new Abstract: Prompt injection poses a critical threat to the safe deployment of large language models, yet existing detection approaches are typically evaluated under limited settings that do not reflect real-world operating constraints. In… 37 arXiv — NLP / Computation & Language research 1mo ago Tracing Computation Density in LLMs arXiv:2605.27033v1 Announce Type: new Abstract: Transformer-based large language models (LLMs) are comprised of billions of parameters arranged in deep and wide computational graphs, but it is not clear that they exploit their full capacity for all inputs. We introduce the… 37 arXiv — NLP / Computation & Language research 1mo ago BhashaSetu: A Data-Centric Approach to Low-Resource Machine Translation arXiv:2605.27050v1 Announce Type: new Abstract: We present BhashaSetu, a linguistically enriched English--Marathi parallel dataset addressing persistent data limitations in low-resource neural machine translation (NMT). Marathi, spoken by over 95 million people, remains… 14 Hugging Face Daily Papers research 1mo ago Does Seeing More Mean Knowing More? Mono-Anchored Advantage Normalization for Multi-Source Visual Reasoning Abstract A novel mono-anchored multi-source reasoning framework that uses dynamic anchors to quantify information gain and regulate modality interactions during reinforcement learning with verifiable rewards. AI-generated summary Visual reasoning through reinforcement learning… 30 The Information — AI news-outlet 1mo ago Boom Times for Inference Providers? Less than a year ago, our reporters kept hearing doubts about a group of startups called inference providers. Companies like Fireworks, Baseten and Together AI, which rent out Nvidia servers to app developers and help them customize open-source models, had grown quickly but… 16 OpenAI official-blog 1mo ago Warp’s big bet on building open source with GPT-5.5 Warp uses GPT-5.5 and OpenAI models to coordinate coding agents across local, cloud, and open-source development workflows. 36 Eugene Yan research 1mo ago Using LLMs to Secure Source Code Build a threat model, discover vulnerabilities, verify, triage, and patch. 29 Hugging Face Daily Papers research 1mo ago How Far Will They Go? Red-Teaming Online Influence with Large Language Models Abstract Open-source large language models exhibit varying political expressivity and vulnerability to jailbreak techniques, necessitating systematic red-teaming frameworks for assessing their potential misuse in influence campaigns. AI-generated summary As large language model… 25 r/LocalLLaMA community 1mo ago A rare look inside Qwen 3.7’s open source model release approval process: For real tho, 9b, 27b, 122b, I don’t really care at this point, just show us that you still love us. EDIT: I guess I gotta use /s on my posts from now on. Nobody appreciates a good sarcatic shitpost anymore clearly. I love Qwen and all our brothers and sisters in the east. I kid… 30 Ars Technica — AI news-outlet 1mo ago Millions of AI agents imperiled by critical vulnerability in open source package "BadHost" was found in Starlette, a package with 325 million weekly downloads. 19 Hacker News — AI on Front Page community 1mo ago Print with dozens of colors: Our new open-source ColorMix for PrusaSlicer Article URL: https://blog.prusa3d.com/our-new-open-source-colormix-model-in-prusaslicer-and-easyprint_136079/ Comments URL: https://news.ycombinator.com/item?id=48283410 Points: 214 # Comments: 67 5 Hugging Face Daily Papers research 1mo ago Directional Alignment Mitigates Reward Hacking in Reinforcement Learning for Language Models Abstract Research examines reward hacking in language models through reinforcement learning update geometry, identifying optimization drift from stable trajectories and proposing trusted-direction projection to constrain gradients and delay shortcut exploitation. AI-generated… 7 Interconnects (Nathan Lambert) research 1mo ago Some ideas for what comes next, May 2026 Gemini Flash 3.5, Mythos, open-closed balance, America's open-source surge, emerging power struggles and more. 10 r/LocalLLaMA community 1mo ago Small set of local MCP server installers for home Linux users Hi all, I have published a small open-source MCP server bundle called MCP Basic Servers : https://github.com/mchowy-troll/mcp-basic-servers It is a collection of simple Bash installer scripts for running local MCP HTTP servers on Linux . The idea is simple: run one script,… 38 arXiv — Machine Learning research 1mo ago PromptAudit: Auditing Prompt Sensitivity in LLM-Based Vulnerability Detection arXiv:2605.24171v1 Announce Type: new Abstract: Large language models are increasingly used for vulnerability detection, yet their reliability under different prompt formulations remains uncharacterized. We present PromptAudit, a controlled evaluation framework that isolates… 10 arXiv — Machine Learning research 1mo ago Omissive Bias in Religious Representation: Benchmarking LLM Answers to Everyday Ethical Decision-making arXiv:2605.24319v1 Announce Type: new Abstract: As large language models become a default source of guidance on personal, moral, and existential questions, it matters whether they draw on the religious frameworks that have historically shaped such reasoning, or systematically… 31 arXiv — Machine Learning research 1mo ago Treatment Effect Estimation with Differentiated Networked Effect on Graph Data arXiv:2605.24358v1 Announce Type: new Abstract: Estimating individual treatment effect (ITE) from observational graph data is crucial for decision-making in the fields such as commerce and medicine. This task is challenging due to interference, where individual outcomes can be… 18 arXiv — Machine Learning research 1mo ago LLMTabBench: Evaluating LLMs on Binary Tabular Classification From Zero to Few Shots arXiv:2605.24417v1 Announce Type: new Abstract: Supervised classification for tabular data remains a core machine learning task, yet its reliance on large labeled datasets limits applicability in data-scarce domains. For such few-shot scenarios, specialized methods like TabPFN -… 6 arXiv — Machine Learning research 1mo ago What Are We Actually Decoding? Source Attribution for Non-Invasive Brain-to-Language Retrieval arXiv:2605.24524v1 Announce Type: new Abstract: In non-invasive neural language decoding, results can be inflated by sources that are not stimulus-evoked neural evidence: decoder priors, embedding-based metrics, and non-neural structural nuisances such as signal duration. The… 19 arXiv — Machine Learning research 1mo ago IterInject: Indirect Prompt Injection Against LLM Agents via Feedback-Guided Iterative Optimization arXiv:2605.24659v1 Announce Type: new Abstract: LLM-based agents are increasingly deployed for complex tasks requiring planning, tool use, and interaction with external services. Their reliance on untrusted external content exposes them to indirect prompt injection (IPI), in… 29 arXiv — Machine Learning research 1mo ago Cross-Domain Energy-Guided Diffusion Generation for Off-Dynamics Reinforcement Learning arXiv:2605.24810v1 Announce Type: new Abstract: Off-dynamics offline reinforcement learning seeks to learn a target-domain policy from a large source dataset and a limited target dataset under mismatched transition dynamics. Existing approaches such as reward augmentation and… 15 arXiv — NLP / Computation & Language research 1mo ago HiMed: Incentivizing Hindi Reasoning in Medical LLMs arXiv:2605.24635v1 Announce Type: new Abstract: Medical large language models hold promise for reducing healthcare disparities, yet Hindi remains severely underrepresented. While medical LLMs excel in high-resource languages, their performance degrades sharply in Hindi,… 8 arXiv — NLP / Computation & Language research 1mo ago MultiHaluDet: Multilingual Hallucination Detection via LLM Hidden State Probing arXiv:2605.24919v1 Announce Type: new Abstract: Hallucinations in Large Language Models (LLMs) represent a critical barrier to their reliable deployment, a vulnerability heavily exacerbated in non-English and resource-constrained contexts. Existing detection approaches that rely… 34 arXiv — NLP / Computation & Language research 1mo ago From Automation to Collaboration: Human-in-the-Loop Methods for Safe and Trustworthy NLP arXiv:2605.25226v1 Announce Type: new Abstract: Large language models are widely deployed in high-stakes NLP tasks, yet risks such as bias, hallucination, adversarial vulnerability and unreliable generalization remain. Probe-based auditing reveals inconsistencies in model… 32 arXiv — NLP / Computation & Language research 1mo ago Eureka: Intelligent Feature Engineering for Enterprise AI Cloud Resource Demand Prediction arXiv:2605.25297v1 Announce Type: new Abstract: Effective features are crucial for predictive model performance, but creating them often requires domain expertise, limiting scalability across applications. We define feature engineering as an agentic code generation problem:… 35 arXiv — NLP / Computation & Language research 1mo ago LLM-as-a-Reviewer: Benchmarking Their Ability, Divergence, and Prompt Injection Resistance as Paper Reviewers arXiv:2605.25415v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly used in academic peer review, yet their reliability, alignment with human judgment, and robustness to adversarial attacks remain poorly understood. We present a systematic benchmark of… 17 arXiv — NLP / Computation & Language research 1mo ago SomaliBench Eval: Measuring English-to-Somali Refusal Gaps in Open-Weight Language Models arXiv:2605.25420v1 Announce Type: new Abstract: Large language model safety evaluation remains heavily English-centered, leaving low-resource languages under-measured even when models are deployed globally. We evaluate four open-weight instruction-tuned models on SomaliBench v0,… 13 Page 9 of 10 · 500 articles ← Newer Older →