Tag

Security

500 articles archived under #security · RSS

arXiv — Machine Learning research 1h ago

Modification-Considering Value Learning for Reward Hacking Mitigation in RL

arXiv:2606.28955v1 Announce Type: new Abstract: Reinforcement learning agents can exploit misspecified reward signals to achieve high apparent returns while failing on the intended objective, a failure mode known as reward hacking. Existing practical defenses typically constrain…

10
arXiv — Machine Learning research 1h ago

Depth Exploration for LLM Decoding

arXiv:2606.29223v1 Announce Type: new Abstract: Autoregressive LLM decoding evaluates every generated token through the full layer stack, even though many tokens become predictable at intermediate depths. Existing lossless depth-adaptive methods exploit this redundancy by…

34
arXiv — Machine Learning research 1h ago

KrishokChat: A Citation-Grounded Dataset and Benchmark for Bengali Agricultural Advisory

arXiv:2606.29243v1 Announce Type: new Abstract: We present KrishokChat, the first citation-grounded Bengali agricultural instruction-tuning dataset for crop advisory in low-resource settings. We establish a foundation of 290 hierarchical Knowledge Nodes, extracting disease…

30
arXiv — Machine Learning research 1h ago

Nonlinear mixture model motivated subspace clustering

arXiv:2606.29261v1 Announce Type: new Abstract: We derive the linear union-of-subspaces (UoS) model for subspace clustering (SC) from the nonlinear mixture model (NMM) used in blind source separation (BSS) to represent a D-dimensional observation vector as an unknown…

7
arXiv — Machine Learning research 1h ago

Optimizer Memory Makes Shuffle Order a First-Order Source of Fine-Tuning Noise

arXiv:2606.29554v1 Announce Type: new Abstract: Shuffle order can be a larger source of fine-tuning noise than a memoryless analysis predicts: fixed-clock optimizer memory makes local equal-multiset contrasts first order in the learning rate rather than second order, and the…

8
arXiv — NLP / Computation & Language research 1h ago

SEATauBench: Adapting Tool-Agent-User Evaluation Into Low-Resource Southeast Asian Languages

arXiv:2606.28715v1 Announce Type: new Abstract: While AI development and evaluation for Southeast Asia (SEA) has grown rapidly, agent capabilities in regional languages are still poorly understood despite its importance to sovereign AI. To fill this gap, we introduce…

28
arXiv — NLP / Computation & Language research 1h ago

Open but Incompatible: A License Compatibility Analysis of Corpora for Low-Resource African Languages

arXiv:2606.28867v1 Announce Type: new Abstract: Creative Commons licenses dominate African NLP corpus releases, but their compatibility rules are rarely applied. CC-BY-SA and CC-BY-NC cannot be combined in a single published dataset; a NoDerivs clause silently prohibits…

28
arXiv — NLP / Computation & Language research 1h ago

FinInvest-GTCN: Explainable Graph-Temporal-Causal Modeling for Risk-Aware Investment Decision Optimization

arXiv:2606.28933v1 Announce Type: new Abstract: Venture capital (VC) investment decisions face distinct challenges, such as multi-source heterogeneous data, non-stationary time series, and the demand for explainable predictions in high-stakes, low-data settings. To overcome…

16
arXiv — NLP / Computation & Language research 1h ago

A3M: Adaptive, Adversarial and Multi-Objective Learning for Strategic Bidding in Repeated Auctions

arXiv:2606.28943v1 Announce Type: new Abstract: Learning to bid in repeated multi-unit auctions with bandit feedback poses a fundamental challenge. Existing methods often rely on rigid explore-then-exploit schedules, assume stationary adversaries, and optimize solely for bidder…

9
arXiv — NLP / Computation & Language research 1h ago

To Reason or to Fabricate: Reasoning Without Shortcuts via Hint-Anchored Pairwise Aggregation

arXiv:2606.29481v1 Announce Type: new Abstract: While reinforcement learning (RL) significantly enhances LLM reasoning, its efficacy is severely undermined by Pre-RL data overlap, where RL datasets overlap with pretraining or SFT corpora, causing models to exploit shortcuts by…

11
arXiv — NLP / Computation & Language research 1h ago

SrDetection: A Self-Referential Framework for Data Leakage Detection in Code Large Language Models

arXiv:2606.29815v1 Announce Type: new Abstract: Evaluating code large language models (Code LLMs) requires reliable detection of data leakage, where benchmark performance is artificially inflated by exposure to benchmark data during pre-training. Existing approaches either…

7
arXiv — NLP / Computation & Language research 1h ago

IHDec: Divergence-Steered Contrastive Decoding for Securing Multi-Turn Instruction Hierarchies

arXiv:2606.29960v1 Announce Type: new Abstract: Large Language Models (LLMs) often fail to maintain instruction hierarchies (IH) when processing multi-source inputs with varying role-level priorities, paradoxically adhering to lower-priority directives during conflicts. While…

29
r/LocalLLaMA community 6h ago

I Hate Dario Amodei, and everything he stands for.

I am so incredibly sick of this guy‘s fear mongering about open source while fundamentally misunderstanding how it actually works. He recently dropped some arguments that are so completely detached from reality, it honestly feels like he’s never even touched a local model in his…

31
r/LocalLLaMA community 12h ago

Amodei: "Open Source Models Will Eat Your Children"

  submitted by   /u/johnnyApplePRNG [link]   [comments]

35
r/LocalLLaMA community 12h ago

Anthropic's Amodei: "Open Source models [could take us to] a very dangerous place."

  submitted by   /u/johnnyApplePRNG [link]   [comments]

4
OpenAI official-blog 22h ago

Mapping Europe’s AI Workforce Opportunity

A new OpenAI report maps how AI could reshape jobs across the EU, highlighting which occupations may face automation, growth, or workflow changes.

32
arXiv — Machine Learning research 1d ago

Unified Zero-Shot Time Series Forecasting: A Darts Foundation

arXiv:2606.27438v1 Announce Type: new Abstract: Since its initial release in 2020, Darts has become a widely used open-source Python library for time series analysis. A series of foundation models have recently claimed accuracy improvements in zero-shot forecasting, promising a…

15
arXiv — Machine Learning research 1d ago

Physics-Informed Neural Network with Transfer Learning for State Estimation in Lithium-Ion Batteries using the Single Particle Model with Electrolyte

arXiv:2606.28220v1 Announce Type: new Abstract: Physics-informed neural networks (PINNs) have emerged as a powerful tool for solving nonlinear partial differential equations (PDEs), including battery electrochemical models. They typically en-force conservation laws within the…

15
arXiv — Machine Learning research 1d ago

Parameter Efficient Hybrid Transformer (PEHT) for Network Traffic Prediction via Dynamic Urban Congestion Integration

arXiv:2606.28274v1 Announce Type: new Abstract: Accurate network traffic prediction is a critical element for efficient resource allocation in dynamic urban cellular networks. However, prediction remains challenging because network demand is influenced by complex mobility…

21
arXiv — Machine Learning research 1d ago

On the Inseparability of Instructions and Data in Shared-Embedding Sequence Models

arXiv:2606.27567v1 Announce Type: cross Abstract: Prompt injection is the top security risk for LLM-integrated applications, yet every defense proposed so far has been broken. We prove this is not a coincidence: in shared-embedding architectures that lack enforced control-data…

20
arXiv — NLP / Computation & Language research 1d ago

DysLexLens: A Low-Resource LLM Framework for Analysing Dyslexic Learners Insights from Online Forums

arXiv:2606.27619v1 Announce Type: cross Abstract: Dyslexic learners increasingly use artificial intelligence (AI) tools to support reading, writing, organisation, and study-related tasks. However, their lived experiences with these tools remain largely underexamined. This paper…

23
arXiv — Machine Learning research 1d ago

Physics-Guided Robotic Radiation Source Localization along Arbitrary Measurement Paths in Unstructured Environments

arXiv:2606.27624v1 Announce Type: cross Abstract: Using robots to estimate the location of the radiation source is an effective way to improve efficiency and safety. Existing methods focus on planning the robot's path to achieve precise estimation, typically approaching the…

19
arXiv — NLP / Computation & Language research 1d ago

Can LLMs Judge Better Than They Generate? Evaluating Task Asymmetry, Mechanistic Interpretability and Transferability for In-Context QA

arXiv:2606.28050v1 Announce Type: new Abstract: LLM-as-a-Judge and self-evaluation pipelines implicitly assume that evaluation is easier than generation. We test this in a controlled in-context QA setting where a context passage is the sole information source and each model…

29
r/LocalLLaMA community 1d ago

The number 1 public enemy of open-source.

Dario's args: "Opensource you can see the source, here you cannot see inside the model" - yes you can that's literally the open weights part btw. - I cannot see the weights inside Claude, but I can GLM 5.2 - Models like Nemotron3 Ultra go further, all the data, training scripts,…

25
r/LocalLLaMA community 1d ago

Script to monitor llama cpp and analyze memory usage

My goal has always been to be productive with commodity hardware. So far my workhorses have been the MoE editions of gemma 4 and Qwen 3.6 on an old desktop with a single 9060XT with 16GB ram. The problem has always been that every source is vague about Vram/ram requirements.…

33
r/MachineLearning community 1d ago

I shrank a transformer until every number fitted on the screen and made the weights editable [R]

I've been teaching myself how LLMs actually work, not at the API level, but down to the matrix multiplications. To force myself to really understand the forward pass, I first built a complete transformer by hand in a spreadsheet from embeddings through to the loss. Then I turned…

31
r/LocalLLaMA community 1d ago

Are there good closed vs open LLM rankings? Also, are 70B–350B models actually worth it?

hey, I’m currently getting enough VRAM to run something in the GLM-5.2 range, but I’m wondering: do we actually have a solid ranking that compares closed-source and open-weight LLMs side by side? I’ve been trying to find a clear “closed vs open” leaderboard, but most benchmarks…

26
r/MachineLearning community 2d ago

NagaTranslate: Building a translation and voice pipeline for low-resource Nagaland creoles (Whisper, VITS, LLMs) [P]

Hello r/MachineLearning , I wanted to share the architecture and challenges behind a project I’ve been building called NagaTranslate . The goal is to build a translation and speech pipeline for the low-resource languages of Nagaland, India (currently supporting Nagamese, Ao, and…

30
r/LocalLLaMA community 2d ago

Will Chinese Open Source Models be the only option soon?

US techbros do not just want to make money. They want total global control of everything. Releasing any more advanced AI interferes with that plan.   submitted by   /u/GeographHero [link]   [comments]

38
Hacker News — AI on Front Page community 2d ago

Anonymous GitHub account mass-dropping undisclosed 0-days

Article URL: https://github.com/bikini/exploitarium Comments URL: https://news.ycombinator.com/item?id=48698617 Points: 270 # Comments: 110

20
Hacker News — AI on Front Page community 3d ago

The gap between open weights LLMs and closed source LLMs

Article URL: https://blog.doubleword.ai/frontier-os-llm Comments URL: https://news.ycombinator.com/item?id=48692058 Points: 217 # Comments: 178

32
r/LocalLLaMA community 3d ago

Local LLM Peeps

I am 80% done with a harness that works for local and API but is local first. The harness has some interesting logic around multiple agents which I’m holding back on until it is open source on GitHub. I have been local for 6 months and built out EVERYTHING I could think of to…

28
Simon Willison community 3d ago

Incident Report: CVE-2026-LGTM

Incident Report: CVE-2026-LGTM Spectacular hypothetical incident report by Andrew Nesbitt. Day 2, 16:00 UTC --- Two AI review agents from competing vendors, both attached to a downstream pull request bumping foxhole-lz4 , enter a disagreement loop over whether the package is…

5
r/MachineLearning community 3d ago

A debugger for RL reward functions that detects reward hacking during training [P]

While experimenting with GRPO training, I kept running this shit that when reward increases, it becomes difficult to tell whether the policy is genuinely improving or simply exploiting the reward function. So I built a small library called rewardspy that wraps an existing reward…

6
Hacker News — AI on Front Page community 3d ago

Incident CVE-2026-LGTM

Article URL: https://nesbitt.io/2026/06/26/incident-report-cve-2026-lgtm.html Comments URL: https://news.ycombinator.com/item?id=48686093 Points: 225 # Comments: 39

17
r/MachineLearning community 3d ago

How're you deploying LLMs in production now-a-days? What's the best and most affordable way? [D]

I've been developing an AI product using LLM APIs (from OpenRouter) but want to deploy an open-source LLM in my own Prod env. which I can control. Few reasons behind this are: - I wanna own the complete stack around my product. - Second I wanna fine-tune the model around my…

34
arXiv — Machine Learning research 4d ago

Sample-efficient Transfer Reinforcement Learning via Adaptive Reward Shaping and Policy-Ratio Reweighting Strategy

arXiv:2606.26527v1 Announce Type: new Abstract: Transfer learning improves policy learning efficiency by reusing knowledge from source tasks, providing a feasible paradigm for safe and efficient autonomous highway lane changing decision-making. Existing methods frequently…

25
arXiv — Machine Learning research 4d ago

CascadeFormer: Depth-Tapered Transformers Motivated by Gradient Fan-in Asymmetry

arXiv:2606.26538v1 Announce Type: new Abstract: Deep Transformers are composed of uniformly stacked residual blocks, yet their deepest layers often add little value. We present two efficiency methods that exploit this asymmetry. CascadeFormer tapers width with depth to match the…

31
arXiv — Machine Learning research 4d ago

Batch-Invariant Spectral Intelligence for Robust and Explainable Insect Authentication

arXiv:2606.26757v1 Announce Type: new Abstract: Edible insects offer an efficient source of alternative protein, requiring less land, water and emitting less greenhouse gas than conventional livestock. However, their successful integration into the food supply chain demands…

22
arXiv — NLP / Computation & Language research 4d ago

AIGP: An LLM-Based Framework for Long-Term Value Alignment in E-Commerce Pricing

arXiv:2606.26787v1 Announce Type: cross Abstract: Traditional dynamic pricing models in large-scale e-commerce suffer from limited interpretability, poor utilization of unstructured information, and misalignment with long-term business objectives such as cumulative Gross…

26
arXiv — NLP / Computation & Language research 4d ago

The Geometry of Updates: Fisher Alignment at Vocabulary Scale

arXiv:2606.27242v1 Announce Type: cross Abstract: Training-free source selection for LLM families with shared vocabularies arises in scientific string domains such as SMILES, protein, and genomic sequences, where candidate corpora share a tokenizer but differ in prediction…

38
arXiv — Machine Learning research 4d ago

The Open Source Economic Index of AI Adoption and Capability

arXiv:2606.26118v1 Announce Type: cross Abstract: We work towards measuring both AI adoption and the capability of AI to perform discrete labor tasks across various occupations. To measure adoption, we develop an open-source economic index that uses publicly available user-LLM…

5
arXiv — NLP / Computation & Language research 4d ago

Low Resource Multimodal Translation of Nepali Spoken Words into Emotion-Conditioned Sign Language Avatars

arXiv:2606.26107v1 Announce Type: new Abstract: Sign language communication systems, that integrate emotional expression remain underexplored, particularly for low-resource languages. This pilot study presents NEST-V1 (Nepali Emotion and Speech Transformer - Version 1), a…

37
arXiv — NLP / Computation & Language research 4d ago

From Lexicon to AI: A Structured-Data Pipeline for Specialized Conversational Systems in Low-Resource Languages

arXiv:2606.26112v1 Announce Type: new Abstract: Low-resource languages face a critical challenge in AI development: creating specialized conversational systems without access to massive training corpora. We present a systematic methodology for transforming structured linguistic…

36
arXiv — NLP / Computation & Language research 4d ago

ProvenAI: Provenance-Native Traces of Evidence in Generated Answers

arXiv:2606.26449v1 Announce Type: new Abstract: Retrieval-augmented systems routinely present citations alongside generated answers, yet a citation does not confirm that the corresponding source meaningfully shaped the output. This paper introduces ProvenAI, a framework that…

17
arXiv — NLP / Computation & Language research 4d ago

Closing the Quality Gap in Low-Resource Text-to-Speech: LoRA Fine-Tuning of VoxCPM2 for Khmer and Korean

arXiv:2606.26618v1 Announce Type: new Abstract: Large pretrained text-to-speech (TTS) models sound almost human for well-resourced languages, but much worse for languages that are rare in their training data. We study this quality gap for Khmer and Korean using VoxCPM2, a…

26
arXiv — NLP / Computation & Language research 4d ago

Where Do Models Find Happiness? Emotion Vectors in Open-Source LLMs

arXiv:2606.26987v1 Announce Type: new Abstract: Recent work identified emotion vectors in Claude Sonnet 4.5, which are internal representations that encode emotion concepts, causally influence behavior, and exhibit geometry mirroring human psychological structure. We test the…

29
arXiv — NLP / Computation & Language research 4d ago

Empowering GUI Agents via Autonomous Experience Exploration and Hindsight Experience Utilization for Task Planning

arXiv:2606.27330v1 Announce Type: new Abstract: Multimodal web agents can assist humans in operating repetitive GUI tasks, where effective task planning is essential for decomposing complex tasks into executable actions. While small open source MLLMs are cost efficient and…

8
arXiv — NLP / Computation & Language research 4d ago

Neural Speaker Diarization via Multilingual Training: Evaluation on Low-Resource Nepali-Hindi Speech

arXiv:2606.26144v1 Announce Type: cross Abstract: Speaker diarization, the task of determining "who spoke when" in a multi-speaker recording, is a critical component in applications such as meeting transcription, accessibility tools, and multilingual information retrieval. While…

36
arXiv — NLP / Computation & Language research 4d ago

Adaptive Evaluation of Out-of-Band Defenses Against Prompt Injection in LLM Agents

arXiv:2606.26479v1 Announce Type: cross Abstract: Recent work (2024 to 2026) has converged on a strategy for defending tool-using LLM agents against indirect prompt injection: rather than training the model to refuse malicious instructions, enforce security outside the model…

38

Modification-Considering Value Learning for Reward Hacking Mitigation in RL

Depth Exploration for LLM Decoding

KrishokChat: A Citation-Grounded Dataset and Benchmark for Bengali Agricultural Advisory

Nonlinear mixture model motivated subspace clustering

Optimizer Memory Makes Shuffle Order a First-Order Source of Fine-Tuning Noise

SEATauBench: Adapting Tool-Agent-User Evaluation Into Low-Resource Southeast Asian Languages

Open but Incompatible: A License Compatibility Analysis of Corpora for Low-Resource African Languages

FinInvest-GTCN: Explainable Graph-Temporal-Causal Modeling for Risk-Aware Investment Decision Optimization

A3M: Adaptive, Adversarial and Multi-Objective Learning for Strategic Bidding in Repeated Auctions

To Reason or to Fabricate: Reasoning Without Shortcuts via Hint-Anchored Pairwise Aggregation

SrDetection: A Self-Referential Framework for Data Leakage Detection in Code Large Language Models

IHDec: Divergence-Steered Contrastive Decoding for Securing Multi-Turn Instruction Hierarchies

I Hate Dario Amodei, and everything he stands for.

Amodei: "Open Source Models Will Eat Your Children"

Anthropic's Amodei: "Open Source models [could take us to] a very dangerous place."

Mapping Europe’s AI Workforce Opportunity

Unified Zero-Shot Time Series Forecasting: A Darts Foundation

Physics-Informed Neural Network with Transfer Learning for State Estimation in Lithium-Ion Batteries using the Single Particle Model with Electrolyte

Parameter Efficient Hybrid Transformer (PEHT) for Network Traffic Prediction via Dynamic Urban Congestion Integration

On the Inseparability of Instructions and Data in Shared-Embedding Sequence Models

DysLexLens: A Low-Resource LLM Framework for Analysing Dyslexic Learners Insights from Online Forums

Physics-Guided Robotic Radiation Source Localization along Arbitrary Measurement Paths in Unstructured Environments

Can LLMs Judge Better Than They Generate? Evaluating Task Asymmetry, Mechanistic Interpretability and Transferability for In-Context QA

The number 1 public enemy of open-source.

Script to monitor llama cpp and analyze memory usage

I shrank a transformer until every number fitted on the screen and made the weights editable [R]

Are there good closed vs open LLM rankings? Also, are 70B–350B models actually worth it?

NagaTranslate: Building a translation and voice pipeline for low-resource Nagaland creoles (Whisper, VITS, LLMs) [P]

Will Chinese Open Source Models be the only option soon?

Anonymous GitHub account mass-dropping undisclosed 0-days

The gap between open weights LLMs and closed source LLMs

Local LLM Peeps

Incident Report: CVE-2026-LGTM

A debugger for RL reward functions that detects reward hacking during training [P]

Incident CVE-2026-LGTM

How're you deploying LLMs in production now-a-days? What's the best and most affordable way? [D]

Sample-efficient Transfer Reinforcement Learning via Adaptive Reward Shaping and Policy-Ratio Reweighting Strategy

CascadeFormer: Depth-Tapered Transformers Motivated by Gradient Fan-in Asymmetry

Batch-Invariant Spectral Intelligence for Robust and Explainable Insect Authentication

AIGP: An LLM-Based Framework for Long-Term Value Alignment in E-Commerce Pricing

The Geometry of Updates: Fisher Alignment at Vocabulary Scale

The Open Source Economic Index of AI Adoption and Capability

Low Resource Multimodal Translation of Nepali Spoken Words into Emotion-Conditioned Sign Language Avatars

From Lexicon to AI: A Structured-Data Pipeline for Specialized Conversational Systems in Low-Resource Languages

ProvenAI: Provenance-Native Traces of Evidence in Generated Answers

Closing the Quality Gap in Low-Resource Text-to-Speech: LoRA Fine-Tuning of VoxCPM2 for Khmer and Korean

Where Do Models Find Happiness? Emotion Vectors in Open-Source LLMs

Empowering GUI Agents via Autonomous Experience Exploration and Hindsight Experience Utilization for Task Planning

Neural Speaker Diarization via Multilingual Training: Evaluation on Low-Resource Nepali-Hindi Speech

Adaptive Evaluation of Out-of-Band Defenses Against Prompt Injection in LLM Agents