Tag

Security

500 articles archived under #security · RSS

arXiv — NLP / Computation & Language research 1mo ago

Bridging the Stability-Expressivity Gap: Synthetic Data Scaling and Preference Alignment for Low-Resource Spoken Language Models

arXiv:2605.27383v1 Announce Type: new Abstract: Spoken Language Models (SLMs) have emerged as a promising paradigm for speech synthesis by bypassing explicit grapheme-to-phoneme pipelines. However, their effectiveness in low-resource languages remains fundamentally limited by…

24
arXiv — NLP / Computation & Language research 1mo ago

Simorgh at SemEval-2026 task 7: Region-Aware Hybrid Retrieval for Low-Resource Cultural Reasoning in Multilingual Question Answering

arXiv:2605.27636v1 Announce Type: new Abstract: Although Large Language Models (LLMs) demonstrate excellent capabilities and performance for general reasoning tasks within the general public domain, they may face challenges with culturally grounded knowledge within languages…

17
arXiv — NLP / Computation & Language research 1mo ago

Disentangling Language Roles in Multilingual LLM Task Execution

arXiv:2605.27649v1 Announce Type: new Abstract: Multilingual LLMs are increasingly used when instruction, source content, and required response languages do not coincide. Existing benchmarks have expanded multilingual instruction-following evaluation, but they rarely isolate…

28
arXiv — NLP / Computation & Language research 1mo ago

Beyond Input Understanding: Diagnosing Multilingual Mathematical Reasoning with Directed Acyclic Trace Graphs

arXiv:2605.27715v1 Announce Type: new Abstract: Large reasoning models (LRMs) achieve strong mathematical reasoning performance in English, but remain much less reliable in many low- and medium-resource languages. This gap is often explained as a failure to understand…

28
arXiv — NLP / Computation & Language research 1mo ago

GRADE: Generalizable Reasoning-Aware Dialogue Evaluation for AI Tutors

arXiv:2605.27866v1 Announce Type: new Abstract: Evaluating AI tutor responses requires more than factual correctness: tutors must identify mistakes, locate errors, provide guidance, and offer actionable next steps. We present GRADE, a systematic study of open-source models for…

35
arXiv — NLP / Computation & Language research 1mo ago

Analyzing Quality-Latency-Resource Trade-offs in a Technical Documentation RAG Assistant Using LoRA Adaptation

arXiv:2605.28222v1 Announce Type: new Abstract: We study quality-latency-resource trade-offs in a documentation-grounded retrieval-augmented generation (RAG) system that uses Low-Rank Adaptation (LoRA) of the generator. We build a manually verified benchmark of 5,144…

17
arXiv — NLP / Computation & Language research 1mo ago

HELEA: Hard-Negative Benchmark and LLM-based Reranking for Robust Entity Alignment

arXiv:2605.28308v1 Announce Type: new Abstract: Entity Alignment (EA) is essential for knowledge graph (KG) fusion, but existing benchmarks often allow models to exploit name overlap rather than relational structure. This makes it difficult to evaluate whether models can reject…

11
r/LocalLLaMA community 1mo ago

Vulnerability found in framework used by VLLM, many MCP servers, and other LLM tools

Worth taking a look to see if this affects any of you. Surprised nobody has posted it yet.   submitted by   /u/Hrethric [link]   [comments]

4
r/MachineLearning community 1mo ago

BEAM 100K memory benchmark: CSM vs Hindsight local artifact comparison [R]

[R] BEAM 100K memory benchmark: CSM vs Hindsight local artifact comparison I’m looking for feedback on a local agent-memory benchmark comparison, especially from people who care about evaluation methodology. I built an open-source R&D memory system called Context Swarm Memory…

31
r/MachineLearning community 1mo ago

UK GDPR Small Business Q&A — 5,000 synthetic pairs with article-level citations [D]

Dataset for fine-tuning compliance assistants. Each pair includes: - A practical SME-facing question ("Can I use pre-ticked consent boxes?") - An answer with specific UK GDPR article references, ICO guidance by name, and actionable steps - Source metadata: which GDPR concepts…

23
The Information — AI news-outlet 1mo ago

Salesforce Stock Dips as Software Firm Lowers Cash Flow Forecast

Salesforce on Wednesday lowered its projection for cash flow growth in its 2027 fiscal year, which ends January next year. The stock fell 3% in after-hours trading following Salesforce’s quarterly earnings results for the April quarter. The software firm said operating cash flow…

20
r/LocalLLaMA community 1mo ago

Inferencing at 10.33 t/s on Qwen 3.5 35B on a $300 laptop

https://preview.redd.it/u8062juegq3h1.png?width=1919&format=png&auto=webp&s=a213f6929c6cad58e92bc1681dac9f0545b04d13 Overview: As the market for consumer computing parts becomes more scarce due to the AI boom, finding ways to use lower-end hardware for less-demanding…

27
Simon Willison community 1mo ago

Quoting Kyle Ferrana

PICARD: Data, shields up DATA: Brilliant! Shields can reduce damage we sustain. Not immunity. Not hubris. Just prudence. It's not precaution—it's strategy. [camera shakes] WORF: HULL BREACHES ON NINE DECKS DATA: Here's what happened: you told me to raise shields, and I didn't…

21
r/MachineLearning community 1mo ago

A Tiny Open-Source Self-Driving AI That Runs on a Phone [P]

https://preview.redd.it/ww14mzr2fm3h1.png?width=1890&format=png&auto=webp&s=79873d47ae79c7815ca3e7e91fd43141632174f5 https://www.youtube.com/watch?v=rr_uS4bf0B4&feature=youtu.be trained a 7MB open-source L4 self-driving AI that learns navigation, lane following, and drift…

11
arXiv — Machine Learning research 1mo ago

When Does Deep RL Beat Calibrated Baselines? A Benchmark Study on Adaptive Resource Control

arXiv:2605.26418v1 Announce Type: new Abstract: A properly calibrated rule-based autoscaler can beat every one of six mainstream deep reinforcement learning (DRL) algorithms on cost across every workload we test - so when, if ever, does DRL actually help? We study this in…

11
arXiv — Machine Learning research 1mo ago

Dense2MoE: Pushing the Pareto Frontier of On-Device LLMs via Unified Pruning and Upcycling

arXiv:2605.26496v1 Announce Type: new Abstract: The Mixture of Experts MoE architecture is highly promising for resource constrained on device deployments yet training these models from scratch incurs prohibitive costs Current methods attempt to alleviate this by upcycling dense…

32
arXiv — Machine Learning research 1mo ago

APEX: Amplitude Anchors and Phase Priors for Target-Scarce Higher-Frequency Wave Prediction

arXiv:2605.26732v1 Announce Type: new Abstract: Learning-based surrogates have become increasingly effective for wave-field prediction, and neural operators in particular have shown strong performance within observed frequency regimes. However, higher-frequency prediction under…

31
arXiv — Machine Learning research 1mo ago

Ratio-Variance Regularized Policy Optimization

arXiv:2605.26784v1 Announce Type: new Abstract: Standard on-policy reinforcement learning relies on heuristic clipping to enforce trust regions, but this mechanism imposes a severe cost by indiscriminately truncating high-return yet high-divergence updates. We demonstrate that…

29
arXiv — Machine Learning research 1mo ago

Not All Disagreement Is Learnable: Token Teachability in On-Policy Distillation

arXiv:2605.26844v1 Announce Type: new Abstract: On-policy distillation (OPD) trains a student on its own rollouts with token-level teacher supervision. Recent selective OPD methods exploit the non-uniformity of OPD signals by prioritizing high-entropy or high-disagreement…

15
arXiv — NLP / Computation & Language research 1mo ago

Vectors Are Not Neutral: Sensitive-Information Inference from Exported LLM Representations in Summarization

arXiv:2605.26433v1 Announce Type: new Abstract: Large language model (LLM) summarization systems may pass compact vector representations of private inputs to downstream retrieval, monitoring, audit, or analytic workflows. Even when source documents remain access-restricted,…

4
arXiv — NLP / Computation & Language research 1mo ago

EmoDistill: Offline Emotion Skill Distillation for Language Model Agents in Adversarial Negotiation

arXiv:2605.26785v1 Announce Type: new Abstract: Post-trained LLMs are often optimized to align responses with human preferences, making them safe, polite, and conversationally appropriate. In adversarial negotiation, however, this alignment can become a vulnerability:…

21
arXiv — NLP / Computation & Language research 1mo ago

AlbanianLLMSafety: A Safety Evaluation Dataset for Large Language Models in Albanian

arXiv:2605.26954v1 Announce Type: new Abstract: Safety evaluation of Large Language Models (LLMs) has largely focused on high-resource languages, leaving low-resource languages critically underserved. We present AlbanianLLMSafety, the first publicly available safety evaluation…

25
arXiv — NLP / Computation & Language research 1mo ago

PashtoTTS-Bench: automated screening for low-resource non-Latin-script text-to-speech

arXiv:2605.26978v1 Announce Type: new Abstract: Text-to-speech (TTS) evaluation for low-resource non-Latin-script languages can fail when it relies on a single ASR round-trip word error rate (WER). A system may produce no audio, speak a neighbouring language, preserve target…

13
arXiv — NLP / Computation & Language research 1mo ago

Prompt Injection Detection is Regime-Dependent: A Deployment-Aware Evaluation with Interpretable Structural Signals

arXiv:2605.26999v1 Announce Type: new Abstract: Prompt injection poses a critical threat to the safe deployment of large language models, yet existing detection approaches are typically evaluated under limited settings that do not reflect real-world operating constraints. In…

37
arXiv — NLP / Computation & Language research 1mo ago

Tracing Computation Density in LLMs

arXiv:2605.27033v1 Announce Type: new Abstract: Transformer-based large language models (LLMs) are comprised of billions of parameters arranged in deep and wide computational graphs, but it is not clear that they exploit their full capacity for all inputs. We introduce the…

37
arXiv — NLP / Computation & Language research 1mo ago

BhashaSetu: A Data-Centric Approach to Low-Resource Machine Translation

arXiv:2605.27050v1 Announce Type: new Abstract: We present BhashaSetu, a linguistically enriched English--Marathi parallel dataset addressing persistent data limitations in low-resource neural machine translation (NMT). Marathi, spoken by over 95 million people, remains…

14
Hugging Face Daily Papers research 1mo ago

Does Seeing More Mean Knowing More? Mono-Anchored Advantage Normalization for Multi-Source Visual Reasoning

Abstract A novel mono-anchored multi-source reasoning framework that uses dynamic anchors to quantify information gain and regulate modality interactions during reinforcement learning with verifiable rewards. AI-generated summary Visual reasoning through reinforcement learning…

30
The Information — AI news-outlet 1mo ago

Boom Times for Inference Providers?

Less than a year ago, our reporters kept hearing doubts about a group of startups called inference providers. Companies like Fireworks, Baseten and Together AI, which rent out Nvidia servers to app developers and help them customize open-source models, had grown quickly but…

16
OpenAI official-blog 1mo ago

Warp’s big bet on building open source with GPT-5.5

Warp uses GPT-5.5 and OpenAI models to coordinate coding agents across local, cloud, and open-source development workflows.

36
Eugene Yan research 1mo ago

Using LLMs to Secure Source Code

Build a threat model, discover vulnerabilities, verify, triage, and patch.

29
Hugging Face Daily Papers research 1mo ago

How Far Will They Go? Red-Teaming Online Influence with Large Language Models

Abstract Open-source large language models exhibit varying political expressivity and vulnerability to jailbreak techniques, necessitating systematic red-teaming frameworks for assessing their potential misuse in influence campaigns. AI-generated summary As large language model…

25
r/LocalLLaMA community 1mo ago

A rare look inside Qwen 3.7’s open source model release approval process:

For real tho, 9b, 27b, 122b, I don’t really care at this point, just show us that you still love us. EDIT: I guess I gotta use /s on my posts from now on. Nobody appreciates a good sarcatic shitpost anymore clearly. I love Qwen and all our brothers and sisters in the east. I kid…

30
Ars Technica — AI news-outlet 1mo ago

Millions of AI agents imperiled by critical vulnerability in open source package

"BadHost" was found in Starlette, a package with 325 million weekly downloads.

19
Hacker News — AI on Front Page community 1mo ago

Print with dozens of colors: Our new open-source ColorMix for PrusaSlicer

Article URL: https://blog.prusa3d.com/our-new-open-source-colormix-model-in-prusaslicer-and-easyprint_136079/ Comments URL: https://news.ycombinator.com/item?id=48283410 Points: 214 # Comments: 67

5
Hugging Face Daily Papers research 1mo ago

Directional Alignment Mitigates Reward Hacking in Reinforcement Learning for Language Models

Abstract Research examines reward hacking in language models through reinforcement learning update geometry, identifying optimization drift from stable trajectories and proposing trusted-direction projection to constrain gradients and delay shortcut exploitation. AI-generated…

7
Interconnects (Nathan Lambert) research 1mo ago

Some ideas for what comes next, May 2026

Gemini Flash 3.5, Mythos, open-closed balance, America's open-source surge, emerging power struggles and more.

10
r/LocalLLaMA community 1mo ago

Small set of local MCP server installers for home Linux users

Hi all, I have published a small open-source MCP server bundle called MCP Basic Servers : https://github.com/mchowy-troll/mcp-basic-servers It is a collection of simple Bash installer scripts for running local MCP HTTP servers on Linux . The idea is simple: run one script,…

38
arXiv — Machine Learning research 1mo ago

PromptAudit: Auditing Prompt Sensitivity in LLM-Based Vulnerability Detection

arXiv:2605.24171v1 Announce Type: new Abstract: Large language models are increasingly used for vulnerability detection, yet their reliability under different prompt formulations remains uncharacterized. We present PromptAudit, a controlled evaluation framework that isolates…

10
arXiv — Machine Learning research 1mo ago

Omissive Bias in Religious Representation: Benchmarking LLM Answers to Everyday Ethical Decision-making

arXiv:2605.24319v1 Announce Type: new Abstract: As large language models become a default source of guidance on personal, moral, and existential questions, it matters whether they draw on the religious frameworks that have historically shaped such reasoning, or systematically…

31
arXiv — Machine Learning research 1mo ago

Treatment Effect Estimation with Differentiated Networked Effect on Graph Data

arXiv:2605.24358v1 Announce Type: new Abstract: Estimating individual treatment effect (ITE) from observational graph data is crucial for decision-making in the fields such as commerce and medicine. This task is challenging due to interference, where individual outcomes can be…

18
arXiv — Machine Learning research 1mo ago

LLMTabBench: Evaluating LLMs on Binary Tabular Classification From Zero to Few Shots

arXiv:2605.24417v1 Announce Type: new Abstract: Supervised classification for tabular data remains a core machine learning task, yet its reliance on large labeled datasets limits applicability in data-scarce domains. For such few-shot scenarios, specialized methods like TabPFN -…

6
arXiv — Machine Learning research 1mo ago

What Are We Actually Decoding? Source Attribution for Non-Invasive Brain-to-Language Retrieval

arXiv:2605.24524v1 Announce Type: new Abstract: In non-invasive neural language decoding, results can be inflated by sources that are not stimulus-evoked neural evidence: decoder priors, embedding-based metrics, and non-neural structural nuisances such as signal duration. The…

19
arXiv — Machine Learning research 1mo ago

IterInject: Indirect Prompt Injection Against LLM Agents via Feedback-Guided Iterative Optimization

arXiv:2605.24659v1 Announce Type: new Abstract: LLM-based agents are increasingly deployed for complex tasks requiring planning, tool use, and interaction with external services. Their reliance on untrusted external content exposes them to indirect prompt injection (IPI), in…

29
arXiv — Machine Learning research 1mo ago

Cross-Domain Energy-Guided Diffusion Generation for Off-Dynamics Reinforcement Learning

arXiv:2605.24810v1 Announce Type: new Abstract: Off-dynamics offline reinforcement learning seeks to learn a target-domain policy from a large source dataset and a limited target dataset under mismatched transition dynamics. Existing approaches such as reward augmentation and…

15
arXiv — NLP / Computation & Language research 1mo ago

HiMed: Incentivizing Hindi Reasoning in Medical LLMs

arXiv:2605.24635v1 Announce Type: new Abstract: Medical large language models hold promise for reducing healthcare disparities, yet Hindi remains severely underrepresented. While medical LLMs excel in high-resource languages, their performance degrades sharply in Hindi,…

8
arXiv — NLP / Computation & Language research 1mo ago

MultiHaluDet: Multilingual Hallucination Detection via LLM Hidden State Probing

arXiv:2605.24919v1 Announce Type: new Abstract: Hallucinations in Large Language Models (LLMs) represent a critical barrier to their reliable deployment, a vulnerability heavily exacerbated in non-English and resource-constrained contexts. Existing detection approaches that rely…

34
arXiv — NLP / Computation & Language research 1mo ago

From Automation to Collaboration: Human-in-the-Loop Methods for Safe and Trustworthy NLP

arXiv:2605.25226v1 Announce Type: new Abstract: Large language models are widely deployed in high-stakes NLP tasks, yet risks such as bias, hallucination, adversarial vulnerability and unreliable generalization remain. Probe-based auditing reveals inconsistencies in model…

32
arXiv — NLP / Computation & Language research 1mo ago

Eureka: Intelligent Feature Engineering for Enterprise AI Cloud Resource Demand Prediction

arXiv:2605.25297v1 Announce Type: new Abstract: Effective features are crucial for predictive model performance, but creating them often requires domain expertise, limiting scalability across applications. We define feature engineering as an agentic code generation problem:…

35
arXiv — NLP / Computation & Language research 1mo ago

LLM-as-a-Reviewer: Benchmarking Their Ability, Divergence, and Prompt Injection Resistance as Paper Reviewers

arXiv:2605.25415v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly used in academic peer review, yet their reliability, alignment with human judgment, and robustness to adversarial attacks remain poorly understood. We present a systematic benchmark of…

17
arXiv — NLP / Computation & Language research 1mo ago

SomaliBench Eval: Measuring English-to-Somali Refusal Gaps in Open-Weight Language Models

arXiv:2605.25420v1 Announce Type: new Abstract: Large language model safety evaluation remains heavily English-centered, leaving low-resource languages under-measured even when models are deployed globally. We evaluate four open-weight instruction-tuned models on SomaliBench v0,…

13

Bridging the Stability-Expressivity Gap: Synthetic Data Scaling and Preference Alignment for Low-Resource Spoken Language Models

Simorgh at SemEval-2026 task 7: Region-Aware Hybrid Retrieval for Low-Resource Cultural Reasoning in Multilingual Question Answering

Disentangling Language Roles in Multilingual LLM Task Execution

Beyond Input Understanding: Diagnosing Multilingual Mathematical Reasoning with Directed Acyclic Trace Graphs

GRADE: Generalizable Reasoning-Aware Dialogue Evaluation for AI Tutors

Analyzing Quality-Latency-Resource Trade-offs in a Technical Documentation RAG Assistant Using LoRA Adaptation

HELEA: Hard-Negative Benchmark and LLM-based Reranking for Robust Entity Alignment

Vulnerability found in framework used by VLLM, many MCP servers, and other LLM tools

BEAM 100K memory benchmark: CSM vs Hindsight local artifact comparison [R]

UK GDPR Small Business Q&A — 5,000 synthetic pairs with article-level citations [D]

Salesforce Stock Dips as Software Firm Lowers Cash Flow Forecast

Inferencing at 10.33 t/s on Qwen 3.5 35B on a $300 laptop

Quoting Kyle Ferrana

A Tiny Open-Source Self-Driving AI That Runs on a Phone [P]

When Does Deep RL Beat Calibrated Baselines? A Benchmark Study on Adaptive Resource Control

Dense2MoE: Pushing the Pareto Frontier of On-Device LLMs via Unified Pruning and Upcycling

APEX: Amplitude Anchors and Phase Priors for Target-Scarce Higher-Frequency Wave Prediction

Ratio-Variance Regularized Policy Optimization

Not All Disagreement Is Learnable: Token Teachability in On-Policy Distillation

Vectors Are Not Neutral: Sensitive-Information Inference from Exported LLM Representations in Summarization

EmoDistill: Offline Emotion Skill Distillation for Language Model Agents in Adversarial Negotiation

AlbanianLLMSafety: A Safety Evaluation Dataset for Large Language Models in Albanian

PashtoTTS-Bench: automated screening for low-resource non-Latin-script text-to-speech

Prompt Injection Detection is Regime-Dependent: A Deployment-Aware Evaluation with Interpretable Structural Signals

Tracing Computation Density in LLMs

BhashaSetu: A Data-Centric Approach to Low-Resource Machine Translation

Does Seeing More Mean Knowing More? Mono-Anchored Advantage Normalization for Multi-Source Visual Reasoning

Boom Times for Inference Providers?

Warp’s big bet on building open source with GPT-5.5

Using LLMs to Secure Source Code

How Far Will They Go? Red-Teaming Online Influence with Large Language Models

A rare look inside Qwen 3.7’s open source model release approval process:

Millions of AI agents imperiled by critical vulnerability in open source package

Print with dozens of colors: Our new open-source ColorMix for PrusaSlicer

Directional Alignment Mitigates Reward Hacking in Reinforcement Learning for Language Models

Some ideas for what comes next, May 2026

Small set of local MCP server installers for home Linux users

PromptAudit: Auditing Prompt Sensitivity in LLM-Based Vulnerability Detection

Omissive Bias in Religious Representation: Benchmarking LLM Answers to Everyday Ethical Decision-making

Treatment Effect Estimation with Differentiated Networked Effect on Graph Data

LLMTabBench: Evaluating LLMs on Binary Tabular Classification From Zero to Few Shots

What Are We Actually Decoding? Source Attribution for Non-Invasive Brain-to-Language Retrieval

IterInject: Indirect Prompt Injection Against LLM Agents via Feedback-Guided Iterative Optimization

Cross-Domain Energy-Guided Diffusion Generation for Off-Dynamics Reinforcement Learning

HiMed: Incentivizing Hindi Reasoning in Medical LLMs

MultiHaluDet: Multilingual Hallucination Detection via LLM Hidden State Probing

From Automation to Collaboration: Human-in-the-Loop Methods for Safe and Trustworthy NLP

Eureka: Intelligent Feature Engineering for Enterprise AI Cloud Resource Demand Prediction

LLM-as-a-Reviewer: Benchmarking Their Ability, Divergence, and Prompt Injection Resistance as Paper Reviewers

SomaliBench Eval: Measuring English-to-Somali Refusal Gaps in Open-Weight Language Models