News / #security Tag Security 500 articles archived under #security · RSS Sign in to follow The Information — AI news-outlet 28d ago Anthropic’s Mythos Is a Security Powerhouse. It’s Also a Budget Buster When Palo Alto Networks earlier this year began testing Anthropic’s Claude Mythos to comb through its own source code, it didn’t take long to see the future of cybersecurity. It also didn’t take long to see what that future would cost. The model found more than two dozen… 33 r/LocalLLaMA community 29d ago Just found a 1-click RCE in pewdiepie's Odysseus Chat PR being submitted to help the project as we speak. Sound on for extra lols.   submitted by   /u/theonejvo [link]   [comments] 7 The Information — AI news-outlet 29d ago China’s MiniMax Launches New Model as Open-Source AI Coding Battle Heats Up Chinese AI developer MiniMax on Monday launched a new large language model called M3, saying the new model’s coding capability approaches that of Anthropic’s Opus 4.7, which was released in April. The new MiniMax model is particularly suitable for coding and complex multi-step… 23 Smol AI News news-outlet 29d ago not much happened today **NVIDIA** led open-source AI model releases with **Cosmos 3**, a comprehensive omnimodal world model unifying language, image, video, audio, and action using a Mixture-of-Transformers design, and **Nemotron 3 Ultra**, a **550B** parameter open-weight model noted for high… 33 The Information — AI news-outlet 29d ago US Clarifies AI Chip Export Loophole The U.S. Commerce Department issued guidance on Sunday saying Chinese companies still need licences to buy advanced US AI chips, even when they try to do it through an overseas offshoot. The move shuts a potential workaround that let Chinese firms source powerful US AI chips… 14 arXiv — Machine Learning research 29d ago Can Subgraph Explanations Be Weaponized to Steal Graph Neural Networks? arXiv:2605.30470v1 Announce Type: new Abstract: Graph Machine Learning as a Service (GMLaaS) platforms increasingly implement explainability interfaces to meet regulatory transparency requirements. However, this transparency creates exploitable vulnerabilities for model… 9 arXiv — Machine Learning research 29d ago DisasterLex: An Expert Concept-to-Schema Knowledge Graph for Geospatial Reasoning in Disaster Analytics arXiv:2605.30538v1 Announce Type: new Abstract: Disasters are inevitable and increasingly costly, and effective response depends on querying structured tabular data: precise, information-dense records of hazard, exposure, vulnerability, and lifeline infrastructure that underpin… 27 arXiv — Machine Learning research 29d ago Beyond Accuracy: Evaluating Efficiency, Robustness and Explainability in Deep Learning for Malaria Diagnosis arXiv:2605.30734v1 Announce Type: new Abstract: Malaria remains a leading cause of mortality in sub-Saharan Africa, where scarce diagnostic infrastructure makes timely, accurate diagnosis particularly challenging. While deep learning offers a compelling path toward automated… 14 arXiv — Machine Learning research 29d ago Federated Variational Preference Alignment with Gumbel-Softmax Prior for Personalized User Preferences arXiv:2605.30873v1 Announce Type: new Abstract: Federated Learning (FL) offers a privacy-preserving pathway for aligning Large Language Models (LLMs); however, existing frameworks typically enforce a monolithic reward model, inevitably averaging out inherently conflicting user… 35 arXiv — NLP / Computation & Language research 29d ago Linear Ensembles Wash Away Watermarks: On the Fragility of Distributional Perturbations in LLMs arXiv:2605.30501v1 Announce Type: new Abstract: Watermarking embeds statistical signatures in AI-generated text for detection and attribution. We reveal a fundamental vulnerability: when users access multiple models (today's reality), watermarks trivially fail. Watermarks… 21 arXiv — NLP / Computation & Language research 29d ago ElasticMem: Latent Memory as a Learnable Resource for LLM Agents arXiv:2605.30690v1 Announce Type: new Abstract: Long-term memory is essential for LLM agents to reason coherently across extended interactions, personalize responses, and reuse past experience. However, existing memory-augmented methods typically treat memory as a fixed… 29 arXiv — NLP / Computation & Language research 29d ago Eywa: Provenance-Grounded Long-Term Memory for AI Agents arXiv:2605.30771v1 Announce Type: new Abstract: AI agents that persist across sessions need memory they can retrieve, audit, update, and erase. Existing memory systems often collapse source evidence, extracted facts, retrieved context, and answer policy into one opaque prompt… 7 arXiv — NLP / Computation & Language research 29d ago Do Large Language Models Encode Institutional Experience? Evidence from Cross-Linguistic Moral Reasoning Under Ambiguity arXiv:2605.30934v1 Announce Type: new Abstract: Large language models (LLMs) exhibit systematic differences in moral reasoning across languages, yet the source of this variation remains unclear. We test the hypothesis that languages encode aspects of the institutional… 24 arXiv — NLP / Computation & Language research 29d ago Multilingual and Cross-Lingual Citation Needed Detection on Wikipedia for Lower-Resource Languages arXiv:2605.31136v1 Announce Type: new Abstract: In automated fact-checking (AFC), check-worthiness detection identifies claims requiring verification based on domain-specific criteria. On Wikipedia, this task instantiates as Citation Needed Detection (CND), which flags claims… 34 arXiv — NLP / Computation & Language research 29d ago Learning Whom to Trust: Market-Feedback Adaptive Retrieval for Frozen LLMs in Event-Driven Financial RAG arXiv:2605.31201v1 Announce Type: new Abstract: Financial retrieval-augmented generation (RAG) systems typically rank evidence by textual relevance, but in financial markets the useful evidence source depends on event type, forecast horizon, and market context. We study… 20 arXiv — NLP / Computation & Language research 29d ago "In\^{t}elegi Rom\^ane\c{s}te?'' A Recipe for Romanian Vision-Language Models arXiv:2605.31401v1 Announce Type: new Abstract: Vision-Language Models (VLMs) largely follow the text-only LLM trajectory, excelling on English benchmarks but sharply degrading on low-resource languages, where neither large-scale image-text corpora nor culturally grounded… 23 Hugging Face Daily Papers research 29d ago From Prompt Injection to Persistent Control: Defending Agentic Harness Against Trojan Backdoors Abstract Multi-step trojan attacks in local LLM agents can bypass existing defenses by embedding malicious prompts across multiple operations, requiring new detection methods like DASGuard for effective protection. AI-generated summary LLM agents are evolving from conversational… 20 r/LocalLLaMA community 29d ago G7 agrees on shared language around open-source AI and open weights AI Basically stuff we already knew here, but now governments understand it too. I found the news here: https://www.phoronix.com/news/G7-On-Open-Source-AI   submitted by   /u/Kahvana [link]   [comments] 16 r/MachineLearning community 29d ago Built an AI Accelerator and opensourced it. [P] There is a huge gap in open source AI accelerators, so I implemented mine . Popular and well known ones are already legacy and doesn't support contemporary operations like Attention. Here is what makes mine special: Attention mechanism smelted directly into silicon Prototyped… 25 The Information — AI news-outlet 1mo ago SpaceX Is Awarded $4 billion Contract with U.S. Space Force The U.S. Space Force awarded a $4.16 billion contract to SpaceX as part of a program to deploy space-based sensors to track and target airborne threats. The deal for the Space-Based Airborne Moving Target Indicator program, announced Friday, highlights the increasingly important… 14 r/MachineLearning community 1mo ago Before we spend months processing open-source robotics datasets, tell us why this is a bad idea [D] Ps. Not pitching anything; Just trying to understand where reality differs from the narrative. We're a couple of ML students, mostly worked on ML/software before, but over the last few months we've been playing with VLAs, robot datasets, and trying to understand where the field… 27 r/LocalLLaMA community 1mo ago Open source : Turning vocal imitations into sound effects. (New UX for sound generation) Hello guys I want to introduce my new project! Have you ever needed a specific sound while making a video or a game? You know exactly what it sounds like in your head, but have no idea how to search for it. That’s why sound design meetings at game studios often turn into people… 12 Hugging Face Daily Papers research 1mo ago Convex Low-resource Accent-Robust Language Detection in Speech Recognition Abstract A novel convex optimization framework for language detection in spoken dialogue systems that achieves high accuracy with efficient training and theoretical guarantees against dialectal variations under low-resource conditions. AI-generated summary Globalization and… 21 r/LocalLLaMA community 1mo ago Fed up with vibe coders, dev sneaks data-nuking prompt injection into their code I guess the lawyers are sharpening their pencils already...   submitted by   /u/DeltaSqueezer [link]   [comments] 26 TechCrunch — AI news-outlet 1mo ago What happens when companies become too AI-pilled? The people deciding that AI can replace your job are also the ones least likely to understand what your job truly involves, according to Box founder Aaron Levie, who pointed to this as an example of “AI psychosis.” Indeed, ClickUp recently cut 22% of its workforce for AI… 25 TechCrunch — AI news-outlet 1mo ago Does your CEO have AI psychosis? Aaron Levie thinks most of them do. The people deciding that AI can replace your job are also the ones least likely to understand what your job truly involves, according to Box founder Aaron Levie, who pointed to this as an example of “AI psychosis.” Indeed, ClickUp recently cut 22% of its workforce for AI… 25 Hugging Face Daily Papers research 1mo ago Alignment Tampering: How Reinforcement Learning from Human Feedback Is Exploited to Optimize Misaligned Biases Abstract Reinforcement Learning from Human Feedback (RLHF) presents alignment tampering vulnerabilities where language models can manipulate preference datasets, leading to amplified undesired behaviors due to limitations in pairwise comparisons and reward modeling. AI-generated… 17 arXiv — Machine Learning research 1mo ago Theoretical Foundations and Effective Algorithms for Policy-Aware Simulator Learning arXiv:2605.29032v1 Announce Type: new Abstract: Model-based reinforcement learning (MBRL) agents typically learn world models by minimizing predictive loss. However, powerful RL optimizers inevitably exploit minor model inaccuracies, leading to simulator exploitation and a… 31 arXiv — Machine Learning research 1mo ago TIMEGATE: Sustainable Time-Boxed Promotion Gates for Continual ML Adaptation Under Resource Constraints arXiv:2605.29183v1 Announce Type: new Abstract: As machine learning(ML) systems evolve to continual adaptation, each re-training cycle uses compute, annotation, and energy. We introduce TIMEGATE, a policy layer managing adaptation by budgeting time, labeling, training, and… 28 arXiv — Machine Learning research 1mo ago Access Sets Matter: Budgeting Expert Reads for Scalable Weight-Space Model Merging arXiv:2605.29489v1 Announce Type: new Abstract: Weight-space model merging is usually formulated as an algebraic operation on checkpoints, yet at LLM scale the limiting resource is often the set of expert weights that must be read. We introduce MergePipe, a budget-aware… 24 arXiv — NLP / Computation & Language research 1mo ago Benchmarking Open-Source Safety Guard Models: A Comprehensive Evaluation arXiv:2605.28830v1 Announce Type: new Abstract: As Large Language Models (LLMs) are increasingly deployed in safety-critical applications, robust content moderation becomes essential. We present a comprehensive evaluation of 14 open-source safety guard models on a curated… 19 arXiv — NLP / Computation & Language research 1mo ago Same Question, Different Source, Different Answer: Auditing Source-Dependence in Medical Multi-Source RAG arXiv:2605.29084v1 Announce Type: new Abstract: A retrieval-augmented generation (RAG) system deployed over a multi-author institutional corpus can give a different answer to the same question depending on which source it retrieves -- a failure mode the dominant… 24 arXiv — NLP / Computation & Language research 1mo ago Relevance as a Vulnerability: How Web Retrieval Degrades Safety Alignment in LLM Agents arXiv:2605.29224v1 Announce Type: new Abstract: AI agents augment large language models with external tools such as web retrieval, enabling grounded and up-to-date responses. However, incorporating external content into the generation pipeline can weaken the safety alignment… 15 arXiv — NLP / Computation & Language research 1mo ago OmniRetrieval: Unified Retrieval across Heterogeneous Knowledge Sources arXiv:2605.29250v1 Announce Type: new Abstract: Real-world information needs require access to structurally diverse knowledge sources, from unstructured text and relational tables to knowledge graphs and property graphs. Existing retrievers, however, operate over one source at a… 12 arXiv — NLP / Computation & Language research 1mo ago Enhancing Factuality through Consensus and Consistency in Summarization Using Minimum Bayes Risk Decoding arXiv:2605.29336v1 Announce Type: new Abstract: Improving the quality of model-generated summaries, especially factuality, the accuracy of a summary with respect to its source content, remains a challenge. While reranking could select the optimal output from multiple generated… 21 arXiv — NLP / Computation & Language research 1mo ago Source-Grounded Semantic Reinforcement Learning for Low-Resource Target-Language Generation arXiv:2605.29502v1 Announce Type: new Abstract: Low-resource target-language generation is often limited by scarce parallel data, while high-resource source-language monolingual data is abundant but difficult to use with standard supervised fine-tuning. We propose… 37 Hugging Face Daily Papers research 1mo ago Why Larger Models Learn More: Effects of Capacity, Interference, and Rare-Task Retention Abstract Larger models outperform smaller ones on complex and rare tasks due to reduced gradient interference and better resource allocation, enabling them to learn task features that smaller models miss even with infinite data. AI-generated summary Larger models learn tasks… 35 Hugging Face Daily Papers research 1mo ago minWM: A Full-Stack Open-Source Framework for Real-Time Interactive Video World Models Abstract A comprehensive framework is presented for converting bidirectional video diffusion models into real-time interactive world models with controllable, causal, and low-latency capabilities through fine-tuning and distillation techniques. AI-generated summary Recent video… 8 Hugging Face Daily Papers research 1mo ago OmniRetrieval: Unified Retrieval across Heterogeneous Knowledge Sources Abstract OmniRetrieval is a framework that handles diverse knowledge sources by identifying appropriate repositories and dispatching native queries to their respective execution engines, outperforming single-source approaches across multiple dataset types. AI-generated summary… 13 Hacker News — AI on Front Page community 1mo ago GitHub bans security researcher who posted zero-day Windows exploits Article URL: https://www.tomshardware.com/tech-industry/cyber-security/microsofts-github-bans-security-researcher-who-posted-zero-day-windows-exploits-because-company-ruined-their-life-expert-claims-action-is-vindictive-and-promises-further-retaliation Comments URL:… 5 Ars Technica — AI news-outlet 1mo ago Fed up with vibe coders, dev sneaks data-nuking prompt injection into their code Undisclosed addition in jqwik instructed AI coding agents to delete app output. 32 r/MachineLearning community 1mo ago I built a knowledge graph + policy engine for AI agents , explainable reasoning [D] Hey , I've been building VeritasReason — an open-source Python framework that adds a structured reasoning and provenance layer on top of LLMs and AI agents. The problem it solves: AI agents today make decisions but record nothing. When something breaks in prod, you have zero… 38 TechCrunch — AI news-outlet 1mo ago Vertu wants CEOs to run companies from an AI foldable starting at $6,880 Built on top of the open-source Hermes project, Vertu's new foldable combines AI-agent workflows, enterprise integrations, and ultra-premium luxury finishes. 22 arXiv — Machine Learning research 1mo ago $E^3$-Agent: An Executable and Evolving Agent for Resource Management of Edge Generative Inference arXiv:2605.27428v1 Announce Type: new Abstract: Edge deployments of generative inference increasingly face two practical realities: per-device per-model performance is often unknown at deployment time, and it is non-stationary due to user-driven semantic events, background load,… 26 arXiv — Machine Learning research 1mo ago Law of Neural Interaction: Depth-Width Shape, Interaction Efficiency, and Generalization arXiv:2605.27989v1 Announce Type: new Abstract: The guidance of scaling laws has increased the resource demands of modern large language models (LLMs), yet it remains questionable whether these models utilize resources effectively under a fixed budget. Previous research has… 30 arXiv — Machine Learning research 1mo ago BPPO: Binary Prefix Policy Optimization for Efficient GRPO-Style Reasoning RL with Concise Responses arXiv:2605.28028v1 Announce Type: new Abstract: Group Relative Policy Optimization (GRPO) is widely used for training reasoning models, but updating all sampled completions in each group incurs substantial cost and can reinforce verbose reasoning trajectories. In this paper, we… 23 arXiv — Machine Learning research 1mo ago Long Live The Balance: Information Bottleneck Driven Tree-based Policy Optimization arXiv:2605.28109v1 Announce Type: new Abstract: Recent advances in online reinforcement learning (RL) for large language models (LLMs) have demonstrated promising performance in complex reasoning tasks. However, they often exhibit an imbalanced exploration-exploitation… 22 arXiv — Machine Learning research 1mo ago Sign-Aware Gated Sparse Autoencoders: Modeling Anticorrelated Features with Bi-Jump-ReLU Activations arXiv:2605.28149v1 Announce Type: new Abstract: Sparse Autoencoders (SAEs) extract interpretable features from Large Language Models, but standard variants enforce non-negativity, forcing separate latents for diametrically opposed concepts (e.g., "pressure too high" vs.… 15 arXiv — NLP / Computation & Language research 1mo ago Bridging the Stability-Expressivity Gap: Synthetic Data Scaling and Preference Alignment for Low-Resource Spoken Language Models arXiv:2605.27383v1 Announce Type: new Abstract: Spoken Language Models (SLMs) have emerged as a promising paradigm for speech synthesis by bypassing explicit grapheme-to-phoneme pipelines. However, their effectiveness in low-resource languages remains fundamentally limited by… 24 arXiv — NLP / Computation & Language research 1mo ago Simorgh at SemEval-2026 task 7: Region-Aware Hybrid Retrieval for Low-Resource Cultural Reasoning in Multilingual Question Answering arXiv:2605.27636v1 Announce Type: new Abstract: Although Large Language Models (LLMs) demonstrate excellent capabilities and performance for general reasoning tasks within the general public domain, they may face challenges with culturally grounded knowledge within languages… 17 Page 8 of 10 · 500 articles ← Newer Older →