News / #leak Tag Leak 37 articles archived under #leak · RSS Sign in to follow Simon Willison community 3d ago What happened after 2,000 people tried to hack my AI assistant What happened after 2,000 people tried to hack my AI assistant Fernando Irarrázaval ran a challenge on hackmyclaw.com to see if anyone could leak secrets held by his OpenClaw test instance by sending it email. Surprisingly, after 6,000 attempts (and $500 in token spend and a… 19 arXiv — Machine Learning research 5d ago Speculative Decoding at Temperature Zero: A Scoped Safety-Invariance Screen with a 48,072-Sample Expansion arXiv:2606.25097v1 Announce Type: new Abstract: Speculative decoding accelerates inference by letting a draft model propose tokens for a target model to verify, raising a concrete safety question: at temperature zero, can draft-side behavior leak into safety-scored outputs? We… 7 Hacker News — AI on Front Page community 6d ago Meta Pauses Employee-Tracking Program Following Internal Data Leak Article URL: https://www.wired.com/story/meta-pauses-employee-tracking-program-following-internal-security-breach/ Comments URL: https://news.ycombinator.com/item?id=48653575 Points: 214 # Comments: 139 38 arXiv — NLP / Computation & Language research 11d ago Your Mouse and Eyes Secretly Leak Your Preference: LLM Alignment using Implicit Feedback from Users arXiv:2606.20482v1 Announce Type: new Abstract: To align a Large Language Model (LLM), most existing methods collect explicit human feedback and train a reward model to predict the human preference based on the response text. These existing methods have two key limitations.… 7 r/LocalLLaMA community 12d ago Leaked financial docs show OpenAI is losing billions of dollars a year   submitted by   /u/johnnyApplePRNG [link]   [comments] 16 Ars Technica — AI news-outlet 13d ago Leaked financial docs show OpenAI is losing billions of dollars a year Audited accounting shows growing revenues being dwarfed by R&D, other expenses. 29 Ars Technica — AI news-outlet 13d ago Critical Copilot vulnerability allowed hackers to seal 2FA code from users SearchLeak exploit shows why the industry's approach to LLM security fails over and over. 4 TechCrunch — AI news-outlet 17d ago Mistral is rumored to be raising €3B at €20B valuation The funding round would value the company at around €20 billion (about $23.15 billion), nearly double its Series C valuation of €11.7 billion. 23 arXiv — NLP / Computation & Language research 18d ago An End-to-End Hybrid Framework for Rumour Detection in Low-Resources Algerian Dialect arXiv:2606.13411v1 Announce Type: new Abstract: The rapid growth of social media has intensified the spread of rumours. This issue is more challenging in the Algerian context due to the informal and code-switched nature of dialectal content, the scarcity of annotated resources,… 23 r/LocalLLaMA community 20d ago Releasing Cohere North Mini Code Hi folks! Jay here from Cohere. we just officially launched North Mini Code after getting some great feedback from you guys this weekend on the unreleased version. I wanted to come here and answer some of the questions you asked and provide some extra detail about the model… 13 arXiv — Machine Learning research 21d ago Beyond Homophily: Towards Generalized Graph Reconstruction Attack and Defense arXiv:2606.08067v1 Announce Type: new Abstract: Graph neural networks (GNNs) are widely deployed on relational data, yet they can leak sensitive or proprietary information about the training graph adjacency, e.g., social ties, transactions, and interactions. This work studies… 25 Hacker News — AI on Front Page community 21d ago Apple reveals new AI architecture built around Google Gemini models Article URL: https://www.macrumors.com/2026/06/08/apple-reveals-new-ai-architecture/ Comments URL: https://news.ycombinator.com/item?id=48450142 Points: 288 # Comments: 276 32 r/MachineLearning community 21d ago Greater than 80% of researchers at CVPR are chinese. This speak volumes on the chinese nexus in research, and something needs to be done about it. [D] There are coordinated efforts where people have favoured and jeopardised the double blind review process. No doubt out of these 80% there are great talent but we have to acknowledge that non chinese have been sobotaged and this was also reflected in the recent leaks of the… 36 llama.cpp releases dev-tools 23d ago b9544 common/chat : fix LFM2/LFM2.5 reasoning round-trip and leak ( #24234 ) common/chat : fix LFM2 reasoning round-trip and stray leak Gate by reasoning format and whether the template supports macOS/iOS: macOS Apple Silicon (arm64) macOS Apple Silicon (arm64, KleidiAI enabled)… 30 r/LocalLLaMA community 23d ago Cohere's unreleased coding model (early access for localllama) Hey, Nick here from Cohere. Thanks for all the feedback on Command A+ the other week everyone. I read these threads all the time about other releases so it was fun to read one about our own :) we would like to do more of it. We actually have our first coding model we’re getting… 24 Hacker News — AI on Front Page community 24d ago Astronauts on ISS told to shelter as repairs under way to fix air leaks Article URL: https://www.bbc.com/news/live/c4g44ew3g1kt Comments URL: https://news.ycombinator.com/item?id=48413464 Points: 203 # Comments: 139 28 Hugging Face Daily Papers research 24d ago LLMs Can Leak Training Data But Do They Want To? A Propensity-Aware Evaluation of Memorization in LLMs Abstract PropMe framework evaluates language model memorization by distinguishing between forced reproduction capabilities and natural propensity, using SimpleTrace for deterministic attribution and propensity-transformed metrics across open models and datasets. Generated by… 15 arXiv — NLP / Computation & Language research 25d ago LLMs Can Leak Training Data But Do They Want To? A Propensity-Aware Evaluation of Memorization in LLMs arXiv:2606.06286v1 Announce Type: new Abstract: Large language models can reproduce training data, but existing memorization evaluations mostly measure whether models can be forced to do so, rather than whether they do so under ordinary use. We introduce PropMe, a… 26 arXiv — NLP / Computation & Language research 26d ago Token Rankings are Unforgeable Language Model Signatures arXiv:2606.04459v1 Announce Type: cross Abstract: Language model parameters are known to impose unique (to each model) geometric constraints on their logit outputs, which serves as a signature that identifies the model, but also leaks the model's final layer parameters when an… 27 Hacker News — AI on Front Page community 26d ago MacBook Neo Is So Popular That Apple Doubled Production Article URL: https://www.macrumors.com/2026/06/03/macbook-neo-production-doubled-says-kuo/ Comments URL: https://news.ycombinator.com/item?id=48386238 Points: 217 # Comments: 216 17 arXiv — NLP / Computation & Language research 27d ago CoEval: Ranking Language Models for Custom Tasks Without Labeled Data or Trustworthy Benchmarks arXiv:2606.03650v1 Announce Type: new Abstract: Choosing or ranking language models for a specific application is hardest when no task-specific labeled data exists, and standard public benchmarks cannot be trusted, their items having likely leaked into pretraining, so scores… 13 arXiv — NLP / Computation & Language research 29d ago MosaicLeaks:Privacy Risks in Querying-in-the-Open for Deep Research Agents arXiv:2605.30727v1 Announce Type: new Abstract: Deep research agents increasingly combine private local documents with external tools like web retrieval, creating a privacy risk: an agent's external queries may leak sensitive information from its local context. This risk is… 10 Simon Willison community 1mo ago I think Anthropic and OpenAI have found product-market fit Anthropic are strongly rumored to be about to have their first profitable quarter. Stories are circulating of companies surprised at how expensive their LLM bills are becoming from usage by their staff. I think this is because OpenAI and Anthropic have both found product-market… 21 llama.cpp releases dev-tools 1mo ago b9320 TP: fix ggml context size calculation ( #22616 ) TP: fix ggml context size calculation, memory leak move split state cache back into the context revert to constant ggml context size for cgraphs increase headroom for statically allocated tensors remove obsolete include macOS/iOS:… 33 r/LocalLLaMA community 1mo ago GPT 5.5 "secret sauce" is just having the thinking be some stupid caveman mode? I think I had GPT-5.5 leak its trace during a normal conversation, and it really reads like the caveman mode fad from a few months back. Maybe we can achieve better token efficiency by taking some high-quality thinking trace from an open model, "caveman-izing" it, and… 17 r/MachineLearning community 1mo ago Anthropic posted a profit while xAI burned $4.2B. The AI profitability numbers finally leaked.[D] This week basically forced everyone to stop guessing about AI margins. Three major financial reality checks hit at once: OpenAI confidentially filing their S-1, xAI’s Q1 numbers leaking via SpaceX, and Anthropic somehow posting an actual operating profit. If you are building an… 4 Hacker News — AI on Front Page community 1mo ago CISA tries to contain data leak Article URL: https://krebsonsecurity.com/2026/05/lawmakers-demand-answers-as-cisa-tries-to-contain-data-leak/ Comments URL: https://news.ycombinator.com/item?id=48238429 Points: 233 # Comments: 53 27 r/LocalLLaMA community 1mo ago Latest b9274 Addresses MTP VRAM leak B9274 I have been having an issue with MTP models unloading after a couple minutes of use. Can't figure out why. Anyways z I don't think this is relevant to that but I did observe the vram creep so hopefully this helps. server : free draft/MTP resources on sleep to fix VRAM leak… 17 llama.cpp releases dev-tools 1mo ago b9274 server : free draft/MTP resources on sleep to fix VRAM leak ( #23461 ) The destroy() function in server_context_impl only cleaned up the main model and context (via llama_init.reset()) but did not free the speculative decoder (spec), draft context (ctx_dft), or draft model… 22 arXiv — Machine Learning research 1mo ago TEMPO: Temporal Enforcement via Mode-Separated Policy Optimization for Trustworthy LLM Backtesting arXiv:2605.18843v1 Announce Type: new Abstract: Backtesting large language models on historical events requires reasoning exclusively from information available before a specified cutoff date. Yet models routinely leak post-cutoff knowledge from pre-training into their… 37 Hacker News — AI on Front Page community 1mo ago CISA Admin Leaked AWS GovCloud Keys on GitHub Article URL: https://krebsonsecurity.com/2026/05/cisa-admin-leaked-aws-govcloud-keys-on-github/ Comments URL: https://news.ycombinator.com/item?id=48190454 Points: 248 # Comments: 104 13 llama.cpp releases dev-tools 1mo ago b9142 opencl: add q5_0 and q5_1 MoE for Adreno ( #22985 ) opencl: add q5_0 moe support opencl: add q5_1 moe support opencl: avoid potential leak opencl: suppress unused var warning when building for non-Adreno Co-authored-by: Li He [email protected] macOS/iOS: macOS Apple Silicon… 35 Smol AI News news-outlet 2mo ago Anthropic @ $30B ARR, Project GlassWing and Claude Mythos Preview — first model too dangerous to release since GPT-2 **Anthropic** strategically challenges **OpenAI** amid its upcoming IPO concerns by announcing a jump from **$19B ARR in March** to **$30B ARR in April**, highlighting a differential growth rate and higher cost efficiency. The company also revealed **Claude Mythos**, rumored as… 30 ThursdAI news-outlet 2mo ago 📅 ThursdAI - Apr 2 - Gemma 4 is the new LLama, Claude Code Leak, OpenAI raises $122B & more AI news Listen now | From Weights & Biases: Gemma 4 w/ Omar from Deepmind, OpenAI raises $122B largest funding round, we cover the Claude Code leak with the guy who put it on Github and got >100K stars in 24 hours & more 23 The Algorithmic Bridge news-outlet 3mo ago Anthropic Accidentally Leaked the Secret Roadmap of Claude Code The source code of Claude Code reveals unreleased features, internal codenames, and the future of your new favorite AI product. Here's what it all means. 28 Smol AI News news-outlet 3mo ago The Claude Code Source Leak **Anthropic's** closed-source coding product **Claude Code** experienced a significant source leak exposing over **500k lines** of orchestration logic, including autonomous modes and memory systems, but not model weights. The leak led to rapid public reverse-engineering,… 14 Smol AI News news-outlet 3mo ago not much happened today **Gemini 3.1 Flash-Lite** is highlighted by **Demis Hassabis** for its speed and cost-efficiency, focusing on latency and cost per capability rather than raw performance. **NotebookLM Studio** introduces a new feature for generating immersive cinematic video overviews. Rumors… 20