Tag

Leak

37 articles archived under #leak · RSS

Simon Willison community 3d ago

What happened after 2,000 people tried to hack my AI assistant

What happened after 2,000 people tried to hack my AI assistant Fernando Irarrázaval ran a challenge on hackmyclaw.com to see if anyone could leak secrets held by his OpenClaw test instance by sending it email. Surprisingly, after 6,000 attempts (and $500 in token spend and a…

19
arXiv — Machine Learning research 5d ago

Speculative Decoding at Temperature Zero: A Scoped Safety-Invariance Screen with a 48,072-Sample Expansion

arXiv:2606.25097v1 Announce Type: new Abstract: Speculative decoding accelerates inference by letting a draft model propose tokens for a target model to verify, raising a concrete safety question: at temperature zero, can draft-side behavior leak into safety-scored outputs? We…

7
Hacker News — AI on Front Page community 6d ago

Meta Pauses Employee-Tracking Program Following Internal Data Leak

Article URL: https://www.wired.com/story/meta-pauses-employee-tracking-program-following-internal-security-breach/ Comments URL: https://news.ycombinator.com/item?id=48653575 Points: 214 # Comments: 139

38
arXiv — NLP / Computation & Language research 11d ago

Your Mouse and Eyes Secretly Leak Your Preference: LLM Alignment using Implicit Feedback from Users

arXiv:2606.20482v1 Announce Type: new Abstract: To align a Large Language Model (LLM), most existing methods collect explicit human feedback and train a reward model to predict the human preference based on the response text. These existing methods have two key limitations.…

7
r/LocalLLaMA community 12d ago

Leaked financial docs show OpenAI is losing billions of dollars a year

  submitted by   /u/johnnyApplePRNG [link]   [comments]

16
Ars Technica — AI news-outlet 13d ago

Leaked financial docs show OpenAI is losing billions of dollars a year

Audited accounting shows growing revenues being dwarfed by R&D, other expenses.

29
Ars Technica — AI news-outlet 13d ago

Critical Copilot vulnerability allowed hackers to seal 2FA code from users

SearchLeak exploit shows why the industry's approach to LLM security fails over and over.

4
TechCrunch — AI news-outlet 17d ago

Mistral is rumored to be raising €3B at €20B valuation

The funding round would value the company at around €20 billion (about $23.15 billion), nearly double its Series C valuation of €11.7 billion.

23
arXiv — NLP / Computation & Language research 18d ago

An End-to-End Hybrid Framework for Rumour Detection in Low-Resources Algerian Dialect

arXiv:2606.13411v1 Announce Type: new Abstract: The rapid growth of social media has intensified the spread of rumours. This issue is more challenging in the Algerian context due to the informal and code-switched nature of dialectal content, the scarcity of annotated resources,…

23
r/LocalLLaMA community 20d ago

Releasing Cohere North Mini Code

Hi folks! Jay here from Cohere. we just officially launched North Mini Code after getting some great feedback from you guys this weekend on the unreleased version. I wanted to come here and answer some of the questions you asked and provide some extra detail about the model…

13
arXiv — Machine Learning research 21d ago

Beyond Homophily: Towards Generalized Graph Reconstruction Attack and Defense

arXiv:2606.08067v1 Announce Type: new Abstract: Graph neural networks (GNNs) are widely deployed on relational data, yet they can leak sensitive or proprietary information about the training graph adjacency, e.g., social ties, transactions, and interactions. This work studies…

25
Hacker News — AI on Front Page community 21d ago

Apple reveals new AI architecture built around Google Gemini models

Article URL: https://www.macrumors.com/2026/06/08/apple-reveals-new-ai-architecture/ Comments URL: https://news.ycombinator.com/item?id=48450142 Points: 288 # Comments: 276

32
r/MachineLearning community 21d ago

Greater than 80% of researchers at CVPR are chinese. This speak volumes on the chinese nexus in research, and something needs to be done about it. [D]

There are coordinated efforts where people have favoured and jeopardised the double blind review process. No doubt out of these 80% there are great talent but we have to acknowledge that non chinese have been sobotaged and this was also reflected in the recent leaks of the…

36
llama.cpp releases dev-tools 23d ago

b9544

common/chat : fix LFM2/LFM2.5 reasoning round-trip and leak ( #24234 ) common/chat : fix LFM2 reasoning round-trip and stray leak Gate by reasoning format and whether the template supports macOS/iOS: macOS Apple Silicon (arm64) macOS Apple Silicon (arm64, KleidiAI enabled)…

30
r/LocalLLaMA community 23d ago

Cohere's unreleased coding model (early access for localllama)

Hey, Nick here from Cohere. Thanks for all the feedback on Command A+ the other week everyone. I read these threads all the time about other releases so it was fun to read one about our own :) we would like to do more of it. We actually have our first coding model we’re getting…

24
Hacker News — AI on Front Page community 24d ago

Astronauts on ISS told to shelter as repairs under way to fix air leaks

Article URL: https://www.bbc.com/news/live/c4g44ew3g1kt Comments URL: https://news.ycombinator.com/item?id=48413464 Points: 203 # Comments: 139

28
Hugging Face Daily Papers research 24d ago

LLMs Can Leak Training Data But Do They Want To? A Propensity-Aware Evaluation of Memorization in LLMs

Abstract PropMe framework evaluates language model memorization by distinguishing between forced reproduction capabilities and natural propensity, using SimpleTrace for deterministic attribution and propensity-transformed metrics across open models and datasets. Generated by…

15
arXiv — NLP / Computation & Language research 25d ago

LLMs Can Leak Training Data But Do They Want To? A Propensity-Aware Evaluation of Memorization in LLMs

arXiv:2606.06286v1 Announce Type: new Abstract: Large language models can reproduce training data, but existing memorization evaluations mostly measure whether models can be forced to do so, rather than whether they do so under ordinary use. We introduce PropMe, a…

26
arXiv — NLP / Computation & Language research 26d ago

Token Rankings are Unforgeable Language Model Signatures

arXiv:2606.04459v1 Announce Type: cross Abstract: Language model parameters are known to impose unique (to each model) geometric constraints on their logit outputs, which serves as a signature that identifies the model, but also leaks the model's final layer parameters when an…

27
Hacker News — AI on Front Page community 26d ago

MacBook Neo Is So Popular That Apple Doubled Production

Article URL: https://www.macrumors.com/2026/06/03/macbook-neo-production-doubled-says-kuo/ Comments URL: https://news.ycombinator.com/item?id=48386238 Points: 217 # Comments: 216

17
arXiv — NLP / Computation & Language research 27d ago

CoEval: Ranking Language Models for Custom Tasks Without Labeled Data or Trustworthy Benchmarks

arXiv:2606.03650v1 Announce Type: new Abstract: Choosing or ranking language models for a specific application is hardest when no task-specific labeled data exists, and standard public benchmarks cannot be trusted, their items having likely leaked into pretraining, so scores…

13
arXiv — NLP / Computation & Language research 29d ago

MosaicLeaks:Privacy Risks in Querying-in-the-Open for Deep Research Agents

arXiv:2605.30727v1 Announce Type: new Abstract: Deep research agents increasingly combine private local documents with external tools like web retrieval, creating a privacy risk: an agent's external queries may leak sensitive information from its local context. This risk is…

10
Simon Willison community 1mo ago

I think Anthropic and OpenAI have found product-market fit

Anthropic are strongly rumored to be about to have their first profitable quarter. Stories are circulating of companies surprised at how expensive their LLM bills are becoming from usage by their staff. I think this is because OpenAI and Anthropic have both found product-market…

21
llama.cpp releases dev-tools 1mo ago

b9320

TP: fix ggml context size calculation ( #22616 ) TP: fix ggml context size calculation, memory leak move split state cache back into the context revert to constant ggml context size for cgraphs increase headroom for statically allocated tensors remove obsolete include macOS/iOS:…

33
r/LocalLLaMA community 1mo ago

GPT 5.5 "secret sauce" is just having the thinking be some stupid caveman mode?

I think I had GPT-5.5 leak its trace during a normal conversation, and it really reads like the caveman mode fad from a few months back. Maybe we can achieve better token efficiency by taking some high-quality thinking trace from an open model, "caveman-izing" it, and…

17
r/MachineLearning community 1mo ago

Anthropic posted a profit while xAI burned $4.2B. The AI profitability numbers finally leaked.[D]

This week basically forced everyone to stop guessing about AI margins. Three major financial reality checks hit at once: OpenAI confidentially filing their S-1, xAI’s Q1 numbers leaking via SpaceX, and Anthropic somehow posting an actual operating profit. If you are building an…

4
Hacker News — AI on Front Page community 1mo ago

CISA tries to contain data leak

Article URL: https://krebsonsecurity.com/2026/05/lawmakers-demand-answers-as-cisa-tries-to-contain-data-leak/ Comments URL: https://news.ycombinator.com/item?id=48238429 Points: 233 # Comments: 53

27
r/LocalLLaMA community 1mo ago

Latest b9274 Addresses MTP VRAM leak

B9274 I have been having an issue with MTP models unloading after a couple minutes of use. Can't figure out why. Anyways z I don't think this is relevant to that but I did observe the vram creep so hopefully this helps. server : free draft/MTP resources on sleep to fix VRAM leak…

17
llama.cpp releases dev-tools 1mo ago

b9274

server : free draft/MTP resources on sleep to fix VRAM leak ( #23461 ) The destroy() function in server_context_impl only cleaned up the main model and context (via llama_init.reset()) but did not free the speculative decoder (spec), draft context (ctx_dft), or draft model…

22
arXiv — Machine Learning research 1mo ago

TEMPO: Temporal Enforcement via Mode-Separated Policy Optimization for Trustworthy LLM Backtesting

arXiv:2605.18843v1 Announce Type: new Abstract: Backtesting large language models on historical events requires reasoning exclusively from information available before a specified cutoff date. Yet models routinely leak post-cutoff knowledge from pre-training into their…

37
Hacker News — AI on Front Page community 1mo ago

CISA Admin Leaked AWS GovCloud Keys on GitHub

Article URL: https://krebsonsecurity.com/2026/05/cisa-admin-leaked-aws-govcloud-keys-on-github/ Comments URL: https://news.ycombinator.com/item?id=48190454 Points: 248 # Comments: 104

13
llama.cpp releases dev-tools 1mo ago

b9142

opencl: add q5_0 and q5_1 MoE for Adreno ( #22985 ) opencl: add q5_0 moe support opencl: add q5_1 moe support opencl: avoid potential leak opencl: suppress unused var warning when building for non-Adreno Co-authored-by: Li He [email protected] macOS/iOS: macOS Apple Silicon…

35
Smol AI News news-outlet 2mo ago

Anthropic @ $30B ARR, Project GlassWing and Claude Mythos Preview — first model too dangerous to release since GPT-2

**Anthropic** strategically challenges **OpenAI** amid its upcoming IPO concerns by announcing a jump from **$19B ARR in March** to **$30B ARR in April**, highlighting a differential growth rate and higher cost efficiency. The company also revealed **Claude Mythos**, rumored as…

30
ThursdAI news-outlet 2mo ago

📅 ThursdAI - Apr 2 - Gemma 4 is the new LLama, Claude Code Leak, OpenAI raises $122B & more AI news

Listen now | From Weights & Biases: Gemma 4 w/ Omar from Deepmind, OpenAI raises $122B largest funding round, we cover the Claude Code leak with the guy who put it on Github and got >100K stars in 24 hours & more

23
The Algorithmic Bridge news-outlet 3mo ago

Anthropic Accidentally Leaked the Secret Roadmap of Claude Code

The source code of Claude Code reveals unreleased features, internal codenames, and the future of your new favorite AI product. Here's what it all means.

28
Smol AI News news-outlet 3mo ago

The Claude Code Source Leak

**Anthropic's** closed-source coding product **Claude Code** experienced a significant source leak exposing over **500k lines** of orchestration logic, including autonomous modes and memory systems, but not model weights. The leak led to rapid public reverse-engineering,…

14
Smol AI News news-outlet 3mo ago

not much happened today

**Gemini 3.1 Flash-Lite** is highlighted by **Demis Hassabis** for its speed and cost-efficiency, focusing on latency and cost per capability rather than raw performance. **NotebookLM Studio** introduces a new feature for generating immersive cinematic video overviews. Rumors…

20

What happened after 2,000 people tried to hack my AI assistant

Speculative Decoding at Temperature Zero: A Scoped Safety-Invariance Screen with a 48,072-Sample Expansion

Meta Pauses Employee-Tracking Program Following Internal Data Leak

Your Mouse and Eyes Secretly Leak Your Preference: LLM Alignment using Implicit Feedback from Users

Leaked financial docs show OpenAI is losing billions of dollars a year

Leaked financial docs show OpenAI is losing billions of dollars a year

Critical Copilot vulnerability allowed hackers to seal 2FA code from users

Mistral is rumored to be raising €3B at €20B valuation

An End-to-End Hybrid Framework for Rumour Detection in Low-Resources Algerian Dialect

Releasing Cohere North Mini Code

Beyond Homophily: Towards Generalized Graph Reconstruction Attack and Defense

Apple reveals new AI architecture built around Google Gemini models

Greater than 80% of researchers at CVPR are chinese. This speak volumes on the chinese nexus in research, and something needs to be done about it. [D]

b9544

Cohere's unreleased coding model (early access for localllama)

Astronauts on ISS told to shelter as repairs under way to fix air leaks

LLMs Can Leak Training Data But Do They Want To? A Propensity-Aware Evaluation of Memorization in LLMs

LLMs Can Leak Training Data But Do They Want To? A Propensity-Aware Evaluation of Memorization in LLMs

Token Rankings are Unforgeable Language Model Signatures

MacBook Neo Is So Popular That Apple Doubled Production

CoEval: Ranking Language Models for Custom Tasks Without Labeled Data or Trustworthy Benchmarks

MosaicLeaks:Privacy Risks in Querying-in-the-Open for Deep Research Agents

I think Anthropic and OpenAI have found product-market fit

b9320

GPT 5.5 "secret sauce" is just having the thinking be some stupid caveman mode?

Anthropic posted a profit while xAI burned $4.2B. The AI profitability numbers finally leaked.[D]

CISA tries to contain data leak

Latest b9274 Addresses MTP VRAM leak

b9274

TEMPO: Temporal Enforcement via Mode-Separated Policy Optimization for Trustworthy LLM Backtesting

CISA Admin Leaked AWS GovCloud Keys on GitHub

b9142

Anthropic @ $30B ARR, Project GlassWing and Claude Mythos Preview — first model too dangerous to release since GPT-2

📅 ThursdAI - Apr 2 - Gemma 4 is the new LLama, Claude Code Leak, OpenAI raises $122B & more AI news

Anthropic Accidentally Leaked the Secret Roadmap of Claude Code

The Claude Code Source Leak

not much happened today