Tag

Security

500 articles archived under #security · RSS

The Information — AI news-outlet 28d ago

Anthropic’s Mythos Is a Security Powerhouse. It’s Also a Budget Buster

When Palo Alto Networks earlier this year began testing Anthropic’s Claude Mythos to comb through its own source code, it didn’t take long to see the future of cybersecurity. It also didn’t take long to see what that future would cost. The model found more than two dozen…

33
r/LocalLLaMA community 29d ago

Just found a 1-click RCE in pewdiepie's Odysseus Chat

PR being submitted to help the project as we speak. Sound on for extra lols.   submitted by   /u/theonejvo [link]   [comments]

7
The Information — AI news-outlet 29d ago

China’s MiniMax Launches New Model as Open-Source AI Coding Battle Heats Up

Chinese AI developer MiniMax on Monday launched a new large language model called M3, saying the new model’s coding capability approaches that of Anthropic’s Opus 4.7, which was released in April. The new MiniMax model is particularly suitable for coding and complex multi-step…

23
Smol AI News news-outlet 29d ago

not much happened today

**NVIDIA** led open-source AI model releases with **Cosmos 3**, a comprehensive omnimodal world model unifying language, image, video, audio, and action using a Mixture-of-Transformers design, and **Nemotron 3 Ultra**, a **550B** parameter open-weight model noted for high…

33
The Information — AI news-outlet 29d ago

US Clarifies AI Chip Export Loophole

The U.S. Commerce Department issued guidance on Sunday saying Chinese companies still need licences to buy advanced US AI chips, even when they try to do it through an overseas offshoot. The move shuts a potential workaround that let Chinese firms source powerful US AI chips…

14
arXiv — Machine Learning research 29d ago

Can Subgraph Explanations Be Weaponized to Steal Graph Neural Networks?

arXiv:2605.30470v1 Announce Type: new Abstract: Graph Machine Learning as a Service (GMLaaS) platforms increasingly implement explainability interfaces to meet regulatory transparency requirements. However, this transparency creates exploitable vulnerabilities for model…

9
arXiv — Machine Learning research 29d ago

DisasterLex: An Expert Concept-to-Schema Knowledge Graph for Geospatial Reasoning in Disaster Analytics

arXiv:2605.30538v1 Announce Type: new Abstract: Disasters are inevitable and increasingly costly, and effective response depends on querying structured tabular data: precise, information-dense records of hazard, exposure, vulnerability, and lifeline infrastructure that underpin…

27
arXiv — Machine Learning research 29d ago

Beyond Accuracy: Evaluating Efficiency, Robustness and Explainability in Deep Learning for Malaria Diagnosis

arXiv:2605.30734v1 Announce Type: new Abstract: Malaria remains a leading cause of mortality in sub-Saharan Africa, where scarce diagnostic infrastructure makes timely, accurate diagnosis particularly challenging. While deep learning offers a compelling path toward automated…

14
arXiv — Machine Learning research 29d ago

Federated Variational Preference Alignment with Gumbel-Softmax Prior for Personalized User Preferences

arXiv:2605.30873v1 Announce Type: new Abstract: Federated Learning (FL) offers a privacy-preserving pathway for aligning Large Language Models (LLMs); however, existing frameworks typically enforce a monolithic reward model, inevitably averaging out inherently conflicting user…

35
arXiv — NLP / Computation & Language research 29d ago

Linear Ensembles Wash Away Watermarks: On the Fragility of Distributional Perturbations in LLMs

arXiv:2605.30501v1 Announce Type: new Abstract: Watermarking embeds statistical signatures in AI-generated text for detection and attribution. We reveal a fundamental vulnerability: when users access multiple models (today's reality), watermarks trivially fail. Watermarks…

21
arXiv — NLP / Computation & Language research 29d ago

ElasticMem: Latent Memory as a Learnable Resource for LLM Agents

arXiv:2605.30690v1 Announce Type: new Abstract: Long-term memory is essential for LLM agents to reason coherently across extended interactions, personalize responses, and reuse past experience. However, existing memory-augmented methods typically treat memory as a fixed…

29
arXiv — NLP / Computation & Language research 29d ago

Eywa: Provenance-Grounded Long-Term Memory for AI Agents

arXiv:2605.30771v1 Announce Type: new Abstract: AI agents that persist across sessions need memory they can retrieve, audit, update, and erase. Existing memory systems often collapse source evidence, extracted facts, retrieved context, and answer policy into one opaque prompt…

7
arXiv — NLP / Computation & Language research 29d ago

Do Large Language Models Encode Institutional Experience? Evidence from Cross-Linguistic Moral Reasoning Under Ambiguity

arXiv:2605.30934v1 Announce Type: new Abstract: Large language models (LLMs) exhibit systematic differences in moral reasoning across languages, yet the source of this variation remains unclear. We test the hypothesis that languages encode aspects of the institutional…

24
arXiv — NLP / Computation & Language research 29d ago

Multilingual and Cross-Lingual Citation Needed Detection on Wikipedia for Lower-Resource Languages

arXiv:2605.31136v1 Announce Type: new Abstract: In automated fact-checking (AFC), check-worthiness detection identifies claims requiring verification based on domain-specific criteria. On Wikipedia, this task instantiates as Citation Needed Detection (CND), which flags claims…

34
arXiv — NLP / Computation & Language research 29d ago

Learning Whom to Trust: Market-Feedback Adaptive Retrieval for Frozen LLMs in Event-Driven Financial RAG

arXiv:2605.31201v1 Announce Type: new Abstract: Financial retrieval-augmented generation (RAG) systems typically rank evidence by textual relevance, but in financial markets the useful evidence source depends on event type, forecast horizon, and market context. We study…

20
arXiv — NLP / Computation & Language research 29d ago

"In\^{t}elegi Rom\^ane\c{s}te?'' A Recipe for Romanian Vision-Language Models

arXiv:2605.31401v1 Announce Type: new Abstract: Vision-Language Models (VLMs) largely follow the text-only LLM trajectory, excelling on English benchmarks but sharply degrading on low-resource languages, where neither large-scale image-text corpora nor culturally grounded…

23
Hugging Face Daily Papers research 29d ago

From Prompt Injection to Persistent Control: Defending Agentic Harness Against Trojan Backdoors

Abstract Multi-step trojan attacks in local LLM agents can bypass existing defenses by embedding malicious prompts across multiple operations, requiring new detection methods like DASGuard for effective protection. AI-generated summary LLM agents are evolving from conversational…

20
r/LocalLLaMA community 29d ago

G7 agrees on shared language around open-source AI and open weights AI

Basically stuff we already knew here, but now governments understand it too. I found the news here: https://www.phoronix.com/news/G7-On-Open-Source-AI   submitted by   /u/Kahvana [link]   [comments]

16
r/MachineLearning community 29d ago

Built an AI Accelerator and opensourced it. [P]

There is a huge gap in open source AI accelerators, so I implemented mine . Popular and well known ones are already legacy and doesn't support contemporary operations like Attention. Here is what makes mine special: Attention mechanism smelted directly into silicon Prototyped…

25
The Information — AI news-outlet 1mo ago

SpaceX Is Awarded $4 billion Contract with U.S. Space Force

The U.S. Space Force awarded a $4.16 billion contract to SpaceX as part of a program to deploy space-based sensors to track and target airborne threats. The deal for the Space-Based Airborne Moving Target Indicator program, announced Friday, highlights the increasingly important…

14
r/MachineLearning community 1mo ago

Before we spend months processing open-source robotics datasets, tell us why this is a bad idea [D]

Ps. Not pitching anything; Just trying to understand where reality differs from the narrative. We're a couple of ML students, mostly worked on ML/software before, but over the last few months we've been playing with VLAs, robot datasets, and trying to understand where the field…

27
r/LocalLLaMA community 1mo ago

Open source : Turning vocal imitations into sound effects. (New UX for sound generation)

Hello guys I want to introduce my new project! Have you ever needed a specific sound while making a video or a game? You know exactly what it sounds like in your head, but have no idea how to search for it. That’s why sound design meetings at game studios often turn into people…

12
Hugging Face Daily Papers research 1mo ago

Convex Low-resource Accent-Robust Language Detection in Speech Recognition

Abstract A novel convex optimization framework for language detection in spoken dialogue systems that achieves high accuracy with efficient training and theoretical guarantees against dialectal variations under low-resource conditions. AI-generated summary Globalization and…

21
r/LocalLLaMA community 1mo ago

Fed up with vibe coders, dev sneaks data-nuking prompt injection into their code

I guess the lawyers are sharpening their pencils already...   submitted by   /u/DeltaSqueezer [link]   [comments]

26
TechCrunch — AI news-outlet 1mo ago

What happens when companies become too AI-pilled?

The people deciding that AI can replace your job are also the ones least likely to understand what your job truly involves, according to Box founder Aaron Levie, who pointed to this as an example of “AI psychosis.” Indeed, ClickUp recently cut 22% of its workforce for AI…

25
TechCrunch — AI news-outlet 1mo ago

Does your CEO have AI psychosis? Aaron Levie thinks most of them do.

The people deciding that AI can replace your job are also the ones least likely to understand what your job truly involves, according to Box founder Aaron Levie, who pointed to this as an example of “AI psychosis.” Indeed, ClickUp recently cut 22% of its workforce for AI…

25
Hugging Face Daily Papers research 1mo ago

Alignment Tampering: How Reinforcement Learning from Human Feedback Is Exploited to Optimize Misaligned Biases

Abstract Reinforcement Learning from Human Feedback (RLHF) presents alignment tampering vulnerabilities where language models can manipulate preference datasets, leading to amplified undesired behaviors due to limitations in pairwise comparisons and reward modeling. AI-generated…

17
arXiv — Machine Learning research 1mo ago

Theoretical Foundations and Effective Algorithms for Policy-Aware Simulator Learning

arXiv:2605.29032v1 Announce Type: new Abstract: Model-based reinforcement learning (MBRL) agents typically learn world models by minimizing predictive loss. However, powerful RL optimizers inevitably exploit minor model inaccuracies, leading to simulator exploitation and a…

31
arXiv — Machine Learning research 1mo ago

TIMEGATE: Sustainable Time-Boxed Promotion Gates for Continual ML Adaptation Under Resource Constraints

arXiv:2605.29183v1 Announce Type: new Abstract: As machine learning(ML) systems evolve to continual adaptation, each re-training cycle uses compute, annotation, and energy. We introduce TIMEGATE, a policy layer managing adaptation by budgeting time, labeling, training, and…

28
arXiv — Machine Learning research 1mo ago

Access Sets Matter: Budgeting Expert Reads for Scalable Weight-Space Model Merging

arXiv:2605.29489v1 Announce Type: new Abstract: Weight-space model merging is usually formulated as an algebraic operation on checkpoints, yet at LLM scale the limiting resource is often the set of expert weights that must be read. We introduce MergePipe, a budget-aware…

24
arXiv — NLP / Computation & Language research 1mo ago

Benchmarking Open-Source Safety Guard Models: A Comprehensive Evaluation

arXiv:2605.28830v1 Announce Type: new Abstract: As Large Language Models (LLMs) are increasingly deployed in safety-critical applications, robust content moderation becomes essential. We present a comprehensive evaluation of 14 open-source safety guard models on a curated…

19
arXiv — NLP / Computation & Language research 1mo ago

Same Question, Different Source, Different Answer: Auditing Source-Dependence in Medical Multi-Source RAG

arXiv:2605.29084v1 Announce Type: new Abstract: A retrieval-augmented generation (RAG) system deployed over a multi-author institutional corpus can give a different answer to the same question depending on which source it retrieves -- a failure mode the dominant…

24
arXiv — NLP / Computation & Language research 1mo ago

Relevance as a Vulnerability: How Web Retrieval Degrades Safety Alignment in LLM Agents

arXiv:2605.29224v1 Announce Type: new Abstract: AI agents augment large language models with external tools such as web retrieval, enabling grounded and up-to-date responses. However, incorporating external content into the generation pipeline can weaken the safety alignment…

15
arXiv — NLP / Computation & Language research 1mo ago

OmniRetrieval: Unified Retrieval across Heterogeneous Knowledge Sources

arXiv:2605.29250v1 Announce Type: new Abstract: Real-world information needs require access to structurally diverse knowledge sources, from unstructured text and relational tables to knowledge graphs and property graphs. Existing retrievers, however, operate over one source at a…

12
arXiv — NLP / Computation & Language research 1mo ago

Enhancing Factuality through Consensus and Consistency in Summarization Using Minimum Bayes Risk Decoding

arXiv:2605.29336v1 Announce Type: new Abstract: Improving the quality of model-generated summaries, especially factuality, the accuracy of a summary with respect to its source content, remains a challenge. While reranking could select the optimal output from multiple generated…

21
arXiv — NLP / Computation & Language research 1mo ago

Source-Grounded Semantic Reinforcement Learning for Low-Resource Target-Language Generation

arXiv:2605.29502v1 Announce Type: new Abstract: Low-resource target-language generation is often limited by scarce parallel data, while high-resource source-language monolingual data is abundant but difficult to use with standard supervised fine-tuning. We propose…

37
Hugging Face Daily Papers research 1mo ago

Why Larger Models Learn More: Effects of Capacity, Interference, and Rare-Task Retention

Abstract Larger models outperform smaller ones on complex and rare tasks due to reduced gradient interference and better resource allocation, enabling them to learn task features that smaller models miss even with infinite data. AI-generated summary Larger models learn tasks…

35
Hugging Face Daily Papers research 1mo ago

minWM: A Full-Stack Open-Source Framework for Real-Time Interactive Video World Models

Abstract A comprehensive framework is presented for converting bidirectional video diffusion models into real-time interactive world models with controllable, causal, and low-latency capabilities through fine-tuning and distillation techniques. AI-generated summary Recent video…

8
Hugging Face Daily Papers research 1mo ago

OmniRetrieval: Unified Retrieval across Heterogeneous Knowledge Sources

Abstract OmniRetrieval is a framework that handles diverse knowledge sources by identifying appropriate repositories and dispatching native queries to their respective execution engines, outperforming single-source approaches across multiple dataset types. AI-generated summary…

13
Hacker News — AI on Front Page community 1mo ago

GitHub bans security researcher who posted zero-day Windows exploits

Article URL: https://www.tomshardware.com/tech-industry/cyber-security/microsofts-github-bans-security-researcher-who-posted-zero-day-windows-exploits-because-company-ruined-their-life-expert-claims-action-is-vindictive-and-promises-further-retaliation Comments URL:…

5
Ars Technica — AI news-outlet 1mo ago

Fed up with vibe coders, dev sneaks data-nuking prompt injection into their code

Undisclosed addition in jqwik instructed AI coding agents to delete app output.

32
r/MachineLearning community 1mo ago

I built a knowledge graph + policy engine for AI agents , explainable reasoning [D]

Hey , I've been building VeritasReason — an open-source Python framework that adds a structured reasoning and provenance layer on top of LLMs and AI agents. The problem it solves: AI agents today make decisions but record nothing. When something breaks in prod, you have zero…

38
TechCrunch — AI news-outlet 1mo ago

Vertu wants CEOs to run companies from an AI foldable starting at $6,880

Built on top of the open-source Hermes project, Vertu's new foldable combines AI-agent workflows, enterprise integrations, and ultra-premium luxury finishes.

22
arXiv — Machine Learning research 1mo ago

$E^3$-Agent: An Executable and Evolving Agent for Resource Management of Edge Generative Inference

arXiv:2605.27428v1 Announce Type: new Abstract: Edge deployments of generative inference increasingly face two practical realities: per-device per-model performance is often unknown at deployment time, and it is non-stationary due to user-driven semantic events, background load,…

26
arXiv — Machine Learning research 1mo ago

Law of Neural Interaction: Depth-Width Shape, Interaction Efficiency, and Generalization

arXiv:2605.27989v1 Announce Type: new Abstract: The guidance of scaling laws has increased the resource demands of modern large language models (LLMs), yet it remains questionable whether these models utilize resources effectively under a fixed budget. Previous research has…

30
arXiv — Machine Learning research 1mo ago

BPPO: Binary Prefix Policy Optimization for Efficient GRPO-Style Reasoning RL with Concise Responses

arXiv:2605.28028v1 Announce Type: new Abstract: Group Relative Policy Optimization (GRPO) is widely used for training reasoning models, but updating all sampled completions in each group incurs substantial cost and can reinforce verbose reasoning trajectories. In this paper, we…

23
arXiv — Machine Learning research 1mo ago

Long Live The Balance: Information Bottleneck Driven Tree-based Policy Optimization

arXiv:2605.28109v1 Announce Type: new Abstract: Recent advances in online reinforcement learning (RL) for large language models (LLMs) have demonstrated promising performance in complex reasoning tasks. However, they often exhibit an imbalanced exploration-exploitation…

22
arXiv — Machine Learning research 1mo ago

Sign-Aware Gated Sparse Autoencoders: Modeling Anticorrelated Features with Bi-Jump-ReLU Activations

arXiv:2605.28149v1 Announce Type: new Abstract: Sparse Autoencoders (SAEs) extract interpretable features from Large Language Models, but standard variants enforce non-negativity, forcing separate latents for diametrically opposed concepts (e.g., "pressure too high" vs.…

15
arXiv — NLP / Computation & Language research 1mo ago

Bridging the Stability-Expressivity Gap: Synthetic Data Scaling and Preference Alignment for Low-Resource Spoken Language Models

arXiv:2605.27383v1 Announce Type: new Abstract: Spoken Language Models (SLMs) have emerged as a promising paradigm for speech synthesis by bypassing explicit grapheme-to-phoneme pipelines. However, their effectiveness in low-resource languages remains fundamentally limited by…

24
arXiv — NLP / Computation & Language research 1mo ago

Simorgh at SemEval-2026 task 7: Region-Aware Hybrid Retrieval for Low-Resource Cultural Reasoning in Multilingual Question Answering

arXiv:2605.27636v1 Announce Type: new Abstract: Although Large Language Models (LLMs) demonstrate excellent capabilities and performance for general reasoning tasks within the general public domain, they may face challenges with culturally grounded knowledge within languages…

17

Anthropic’s Mythos Is a Security Powerhouse. It’s Also a Budget Buster

Just found a 1-click RCE in pewdiepie's Odysseus Chat

China’s MiniMax Launches New Model as Open-Source AI Coding Battle Heats Up

not much happened today

US Clarifies AI Chip Export Loophole

Can Subgraph Explanations Be Weaponized to Steal Graph Neural Networks?

DisasterLex: An Expert Concept-to-Schema Knowledge Graph for Geospatial Reasoning in Disaster Analytics

Beyond Accuracy: Evaluating Efficiency, Robustness and Explainability in Deep Learning for Malaria Diagnosis

Federated Variational Preference Alignment with Gumbel-Softmax Prior for Personalized User Preferences

Linear Ensembles Wash Away Watermarks: On the Fragility of Distributional Perturbations in LLMs

ElasticMem: Latent Memory as a Learnable Resource for LLM Agents

Eywa: Provenance-Grounded Long-Term Memory for AI Agents

Do Large Language Models Encode Institutional Experience? Evidence from Cross-Linguistic Moral Reasoning Under Ambiguity

Multilingual and Cross-Lingual Citation Needed Detection on Wikipedia for Lower-Resource Languages

Learning Whom to Trust: Market-Feedback Adaptive Retrieval for Frozen LLMs in Event-Driven Financial RAG

"In\^{t}elegi Rom\^ane\c{s}te?'' A Recipe for Romanian Vision-Language Models

From Prompt Injection to Persistent Control: Defending Agentic Harness Against Trojan Backdoors

G7 agrees on shared language around open-source AI and open weights AI

Built an AI Accelerator and opensourced it. [P]

SpaceX Is Awarded $4 billion Contract with U.S. Space Force

Before we spend months processing open-source robotics datasets, tell us why this is a bad idea [D]

Open source : Turning vocal imitations into sound effects. (New UX for sound generation)

Convex Low-resource Accent-Robust Language Detection in Speech Recognition

Fed up with vibe coders, dev sneaks data-nuking prompt injection into their code

What happens when companies become too AI-pilled?

Does your CEO have AI psychosis? Aaron Levie thinks most of them do.

Alignment Tampering: How Reinforcement Learning from Human Feedback Is Exploited to Optimize Misaligned Biases

Theoretical Foundations and Effective Algorithms for Policy-Aware Simulator Learning

TIMEGATE: Sustainable Time-Boxed Promotion Gates for Continual ML Adaptation Under Resource Constraints

Access Sets Matter: Budgeting Expert Reads for Scalable Weight-Space Model Merging

Benchmarking Open-Source Safety Guard Models: A Comprehensive Evaluation

Same Question, Different Source, Different Answer: Auditing Source-Dependence in Medical Multi-Source RAG

Relevance as a Vulnerability: How Web Retrieval Degrades Safety Alignment in LLM Agents

OmniRetrieval: Unified Retrieval across Heterogeneous Knowledge Sources

Enhancing Factuality through Consensus and Consistency in Summarization Using Minimum Bayes Risk Decoding

Source-Grounded Semantic Reinforcement Learning for Low-Resource Target-Language Generation

Why Larger Models Learn More: Effects of Capacity, Interference, and Rare-Task Retention

minWM: A Full-Stack Open-Source Framework for Real-Time Interactive Video World Models

OmniRetrieval: Unified Retrieval across Heterogeneous Knowledge Sources

GitHub bans security researcher who posted zero-day Windows exploits

Fed up with vibe coders, dev sneaks data-nuking prompt injection into their code

I built a knowledge graph + policy engine for AI agents , explainable reasoning [D]

Vertu wants CEOs to run companies from an AI foldable starting at $6,880

$E^3$-Agent: An Executable and Evolving Agent for Resource Management of Edge Generative Inference

Law of Neural Interaction: Depth-Width Shape, Interaction Efficiency, and Generalization

BPPO: Binary Prefix Policy Optimization for Efficient GRPO-Style Reasoning RL with Concise Responses

Long Live The Balance: Information Bottleneck Driven Tree-based Policy Optimization

Sign-Aware Gated Sparse Autoencoders: Modeling Anticorrelated Features with Bi-Jump-ReLU Activations

Bridging the Stability-Expressivity Gap: Synthetic Data Scaling and Preference Alignment for Low-Resource Spoken Language Models

Simorgh at SemEval-2026 task 7: Region-Aware Hybrid Retrieval for Low-Resource Cultural Reasoning in Multilingual Question Answering