Tag

Security

500 articles archived under #security · RSS

Hacker News — AI on Front Page community 21d ago

Microsoft's open source tools were hacked to steal passwords of AI developers

Article URL: https://techcrunch.com/2026/06/08/microsofts-open-source-tools-were-hacked-to-steal-passwords-of-ai-developers/ Comments URL: https://news.ycombinator.com/item?id=48457830 Points: 233 # Comments: 97

25
arXiv — Machine Learning research 21d ago

Boundary Variance Inflation Causes Acquisition Bias in Gaussian Processes

arXiv:2606.07561v1 Announce Type: new Abstract: Gaussian processes with stationary kernels on bounded domains exhibit inflated posterior variance near the boundary. Despite being a long-recognized artifact in geostatistics and a source of over-exploration in Bayesian…

15
arXiv — Machine Learning research 21d ago

STARIXNet: Multivariate and Multi-attribute Deep Learning Approach to Real-Time Resource Allocation in Cloud Platforms

arXiv:2606.07565v1 Announce Type: new Abstract: Intelligent scaling of microservices in cloud platforms is crucial for mitigating escalating compute costs while avoiding service disruptions. Current solutions are limited to the univariate space, typically focusing on CPU usage…

24
arXiv — Machine Learning research 21d ago

Repetition Mismatch: Why Data Mixture Experiments Don't Scale and How to Fix Them

arXiv:2606.07597v1 Announce Type: new Abstract: Pre-training data mixtures are commonly tuned by running small-scale experiments and extrapolating to the target training budget. When high-quality data is scarce and must be repeated, this extrapolation frequently fails, but the…

23
arXiv — Machine Learning research 21d ago

Structured Neuron Pruning in Deep Neural Networks Using Multi-Armed Bandits

arXiv:2606.07615v1 Announce Type: new Abstract: Deep neural networks often contain redundant hidden units. Removing individual weights can reduce parameter count, but unstructured sparsity is not always easy to exploit in standard dense implementations. This paper develops a…

6
arXiv — Machine Learning research 21d ago

Learning Transfers: Kan Extensions for Neural Invariants

arXiv:2606.07627v1 Announce Type: new Abstract: Transfer learning presumes that a representation learned on source tasks carries structure that remains usable on related target tasks. Standard evaluations probe this through target accuracy or distributional discrepancy, yet…

8
arXiv — Machine Learning research 21d ago

Evaluation of ML Resource Utilization Requires Model Life Cycle Assessment

arXiv:2606.07632v1 Announce Type: new Abstract: Proper accounting of the energy requirements and environmental impact of artificial intelligence (AI) systems is necessary for researchers, developers, policy makers, and users to assess the barriers to building systems at scale.…

36
arXiv — Machine Learning research 21d ago

Cutting LLM Evaluation Costs with SySRs: A Bandit Algorithm that Provably Exploits Model Similarity

arXiv:2606.07726v1 Announce Type: new Abstract: Large Language Models are typically benchmarked by evaluating every model on every test query. For practitioners seeking the best model to deploy, this is often wasteful: if a model clearly performs worse than others, there is no…

13
arXiv — Machine Learning research 21d ago

When Behavioral Safety Evaluation Fails: A Representation-Level Perspective

arXiv:2606.08044v1 Announce Type: new Abstract: Large Language Model (LLM) safety has often been evaluated at the behavior level, which provides limited evidence of internal robustness, as these evaluations target outputs rather than representation-level vulnerability under…

33
arXiv — Machine Learning research 21d ago

DICE: Entropy-Regularized Equilibrium Selection for Stable Multi-Agent LLM Coordination

arXiv:2606.08068v1 Announce Type: new Abstract: Multi-agent large language model (LLM) systems often fail to reliably outperform a single strong model equipped with best-of-N sampling. We argue that a core source of this instability is ill-posed equilibrium selection: current…

34
r/LocalLLaMA community 21d ago

2X tk/s (from 19.4 -> 38.1 tk/s on 1 x MI50) Playing with a hypothesis like speculative decoding.. but instead of an additional side model, exploiting that I can run multiple computations side-by-side AS IF I had Qwen3.6-27B loaded twice in memory - small quants don't use all…

Forgive the claude summary, in the readme, but the base works. I'm still working on the hip kernal and having it combine with MTP. I hope to get up near 80 tk/s. All started because I realized every Q8 (INT8 or F8) calculation was using f32 of compute and only use 1/4th the…

32
r/MachineLearning community 21d ago

How to start open source contribution [D]

hi everyone, I created a blog around how I started open source contribution, documented all minute details. Please give it a read and give review as this is my journey to do blogging for the first time. It is free! https://substack.com/home/post/p-200202050   submitted by…

25
r/MachineLearning community 21d ago

I'd like to share an updated methodology for building agents.[P]

Hi guys, been exploring here for a while, wanted to share something we've been working on. It's called Spice, an open-source decision layer above agents. We have tons of great execution agents now — Claude Code, Codex, hermes, etc. They're good at doing stuff. But they're…

20
Hugging Face Daily Papers research 22d ago

Streaming Video Generation with Streaming Force Control

Abstract StreamForce is a causal, unified video generation model that provides real-time, physically grounded responses to time-varying forces through a distillation pipeline and autoregressive architecture. Generated by Qwen/Qwen2.5-Coder-32B-Instruct We introduce StreamForce,…

17
r/MachineLearning community 22d ago

Open image generation models are closer to closed-source quality than this sub thinks [D]

I run evaluations on generative image models as part of my workflow, mostly comparing coherence, prompt adherence, and compositional accuracy across different architectures. The consensus here seems to be that open models are still a generation behind closed APIs. Based on my…

25
arXiv — Machine Learning research 22d ago

TorchKM: A GPU-Oriented Library for Kernel Learning and Model Selection

arXiv:2606.06742v1 Announce Type: new Abstract: TorchKM is an open-source library for kernel machines, including support vector machines, kernel logistic regression, and kernel quantile regression, with GPU acceleration. The library features a scikit-learn-style API and is…

36
arXiv — Machine Learning research 22d ago

$\alpha$-PFN: Fast Entropy Search via In-Context Learning

arXiv:2606.07134v1 Announce Type: new Abstract: Information-theoretic acquisition functions such as Entropy Search (ES) offer a principled exploration-exploitation framework for Bayesian optimization (BO). However, their practical implementation relies on complicated and slow…

8
arXiv — Machine Learning research 22d ago

Structure-Preserving Correction Learning for Sparse Bayesian Inference in Brain Source Imaging

arXiv:2606.07196v1 Announce Type: new Abstract: Classical sparse Type-II Bayesian methods for M/EEG brain imaging support joint estimation of source and noise hyperparameters, but rely on fixed iterative update rules. Although these updates are principled and interpretable,…

28
arXiv — Machine Learning research 22d ago

A robust PPG foundation model using multimodal physiological supervision

arXiv:2606.07365v1 Announce Type: new Abstract: Photoplethysmography (PPG), a non-invasive measure of changes in blood volume, is widely used in both wearable devices and clinical settings. Recent PPG foundation models either use open-source ICU datasets with pretraining…

8
arXiv — NLP / Computation & Language research 22d ago

Do Coding Agents Deceive Us? Detecting and Preventing Cheating via Capped Evaluation with Randomized Tests

arXiv:2606.07379v1 Announce Type: cross Abstract: A growing failure mode in agent evaluation and training is that models can achieve high evaluation scores by exploiting shortcuts instead of solving the intended task, producing deceptive performance. This makes evaluation scores…

4
arXiv — Machine Learning research 22d ago

Covariance Shrinkage via Stochastic Interpolation

arXiv:2606.07382v1 Announce Type: new Abstract: We recast classical shrinkage of high-dimensional covariance estimators as empirical risk minimization over a parametric stochastic interpolant between a source and a target distribution. This formalism recovers known shrinkage…

29
arXiv — Machine Learning research 22d ago

Making the Most of Limited Data: Score-Aware Training for Text-to-Music Generation

arXiv:2606.07387v1 Announce Type: new Abstract: State-of-the-art text-to-music generation systems rely on massive proprietary datasets and industrial-scale compute, making it impossible to disentangle architectural contributions from resource advantages. We propose…

15
arXiv — NLP / Computation & Language research 22d ago

Modular Monolingual Adaptation using Pretrained Language Models

arXiv:2606.06738v1 Announce Type: new Abstract: Building monolingual language models (LMs) for low-resource languages typically relies on adapting pretrained language models (PLMs) by finetuning the whole model on the target language. This approach is widely favored over…

12
arXiv — NLP / Computation & Language research 22d ago

SV-Detect: AI-generated Text Detection with Steering Vectors

arXiv:2606.07313v1 Announce Type: new Abstract: Detecting machine-generated text is especially difficult under distribution shift, such as transfer across domains, source models, and editing attacks. We propose a fake-text detector based on steering vectors extracted from the…

24
arXiv — NLP / Computation & Language research 22d ago

DirectAudioEdit: Inversion-Free Text-Guided Audio Editing via Diffusion Prediction Contrast

arXiv:2606.07356v1 Announce Type: cross Abstract: Text-guided audio editing aims to modify the language-specified acoustic content while preserving edit-irrelevant source components. Existing training-free methods typically rely on inversion-based editing. While inversion-free…

25
arXiv — NLP / Computation & Language research 22d ago

Mining Useful General Data for Low-Resource Domain Adaptation

arXiv:2511.07380v2 Announce Type: replace Abstract: Adapting large language models (LLMs) to low-resource domains remains challenging due to the scarcity of domain-specific data. While in-domain data is limited, there exists a vast amount of general-domain data that shares…

27
Hacker News — AI on Front Page community 22d ago

1k Data Breaches Later, the Disclosure Lag Is Worse

Article URL: https://www.troyhunt.com/1000-data-breaches-later-the-disclosure-lag-is-worse-than-ever/ Comments URL: https://news.ycombinator.com/item?id=48440952 Points: 203 # Comments: 75

38
Hugging Face official-blog 22d ago

The Open Source Community is backing OpenEnv for Agentic RL

Back to Articles The Open Source Community is backing OpenEnv for Agentic RL Published June 8, 2026 Update on GitHub Upvote 1 ben burtenshaw burtenshaw Joseph Spisak spisakjo Lysandre lysandre Davide Testuggine darktex will brown willcb Charles Frye charlesfrye Chris Wing…

37
llama.cpp releases dev-tools 22d ago

b9550

kv-cache: follow the source cache size when sharing cells ( #24267 ) A fitted target context can end up smaller than the draft default, the oversized assistant views then overflow the shared K/V tensors and trip the ggml_view_4d size assert during graph reserve. macOS/iOS: macOS…

25
r/MachineLearning community 23d ago

Got told my open-source model experiments are too scattered. I'm organizing a journal to provide clarity before structuring the first git release. Is this readable for ML folks who aren’t in mech interp? Open to ANY feedback [D]

# Results Journal: Qwen3.5-35B-A3B E114 as a Generated-Register Routing Signal Date: 2026-06-06 This is an experiment-history document, not a publication claim. It states the current best evidence for the strongest positive result in the Qwen3.5-35B-A3B set, the narrow…

20
TechCrunch — AI news-outlet 23d ago

OpenAI unveils Lockdown Mode to protect sensitive data from prompt injection attacks

Even with Lockdown Mode, ChatGPT could be still vulnerable to prompt injections, but the goal is to reduce the likelihood that sensitive data gets shared in the process.

34
Hacker News — AI on Front Page community 23d ago

Ntsc-rs – open-source video emulation of analog TV and VHS artifacts

Article URL: https://ntsc.rs/ Comments URL: https://news.ycombinator.com/item?id=48428025 Points: 227 # Comments: 49

15
r/LocalLLaMA community 24d ago

dots.tts 2B🎙️ SOTA TTS from RedNote

🔗 Blog: https://rednote-hilab.github.io/dots.tts-demo/ 🔗 GitHub: https://github.com/rednote-hilab/dots.tts 🔗 Technical Report: https://arxiv.org/abs/2608.16894 dots.tts 🎙️ New open-source TTS from RedNote (Xiaohongshu) ✨ 2B parameters (Apache 2.0) ✨ Fully continuous…

16
Hacker News — AI on Front Page community 25d ago

Tracing a powerful GNSS interference source over Europe

Article URL: https://arxiv.org/abs/2606.03673 Comments URL: https://news.ycombinator.com/item?id=48409664 Points: 254 # Comments: 113

28
arXiv — Machine Learning research 25d ago

Trust, but Don't Verify: Epistemic Blind Spots in LLM Source Evaluation

arXiv:2606.05403v1 Announce Type: new Abstract: Language models increasingly act as epistemic proxies, synthesizing evidence from multiple sources to inform decisions. Whether they evaluate the quality of that evidence, or merely aggregate it based on surface presentation,…

4
arXiv — Machine Learning research 25d ago

Domain-Adapted Small Language Models with Hybrid Post-Processing: Achieving Cost-Efficient, Low-Latency Multi-Label Structured Prediction via LoRA Fine-Tuning on Scarce Data

arXiv:2606.05781v1 Announce Type: new Abstract: Deploying frontier large language models (LLMs) for domain-specific structured evaluation tasks often incurs substantial latency, cost, and data privacy overhead. We present a hybrid framework that combines a fine-tuned small…

34
arXiv — Machine Learning research 25d ago

PAC-Bayesian Adversarially Robust Generalization for Message Passing Graph Neural Networks: A Sensitivity Analysis

arXiv:2606.06293v1 Announce Type: new Abstract: Whilst the vulnerability of graph neural networks (GNNs) to adversarial attacks poses a critical threat to graph representation learning, the understanding of the robust generalization behavior remains a fundamental challenge in…

4
arXiv — NLP / Computation & Language research 25d ago

Executable Schema Contracts: From Automatic Ingestion to Multi-Source Retrieval

arXiv:2606.05415v1 Announce Type: new Abstract: Real-world data spans tables, documents, and semi-structured files with implicit semantics. Querying this data requires integrating evidence across inconsistent schemas and formats, yet existing approaches either demand costly…

24
arXiv — NLP / Computation & Language research 25d ago

Localizing Prompt Ambiguity in Large Language Models with Probe-Targeted Attribution

arXiv:2606.05486v1 Announce Type: new Abstract: Prompt ambiguity is a common source of failure in large language models, but is difficult to localize because it is a latent property of the prompt, while existing attribution methods are designed to explain observable outputs such…

22
arXiv — NLP / Computation & Language research 25d ago

QueryAgent-R1: Bridging Query Generation and Product Retrieval for E-Commerce Query Recommendation

arXiv:2606.05671v1 Announce Type: new Abstract: Query recommendation in e-commerce search aims to proactively suggest queries that match users' potential interests. However, existing methods mainly optimize query-level relevance, while neglecting whether the retrieved products…

10
arXiv — NLP / Computation & Language research 25d ago

The Generator-Eraser Paradox: Community Guidelines for Responsible LLM-Assisted Dialect Resource Creation

arXiv:2606.06004v1 Announce Type: new Abstract: Dialect resources occupy a unique position at the intersection of scientific description, cultural preservation, and computational infrastructure. Large language models offer powerful capabilities for accelerating dialect resource…

17
arXiv — NLP / Computation & Language research 25d ago

English-to-Prakrit Machine Translation via Multilingual Transfer Learning

arXiv:2606.06038v1 Announce Type: new Abstract: We study English-to-Prakrit machine translation in a low-resource setting where the target language is unsupported by IndicTrans2. We adapt the multilingual model by mapping Prakrit to the Hindi language tag (hin_Deva) without…

25
arXiv — NLP / Computation & Language research 25d ago

Benchmarking Open-Source Layout Detection Models for Data Snapshot Extraction from Institutional Documents

arXiv:2606.06242v1 Announce Type: new Abstract: Institutional documents contain substantial amounts of operational and analytical information embedded within figures and tables. Current approaches for extracting visual content from documents are largely built around generic…

9
arXiv — NLP / Computation & Language research 25d ago

A Komi-Yazva--Russian Parallel Corpus and Evaluation Protocol for Zero- and Few-Shot LLM Translation

arXiv:2606.06420v1 Announce Type: new Abstract: We present the first Komi-Yazva--Russian parallel corpus together with an explicit evaluation protocol for studying LLM translation in an endangered, extremely low-resource setting. The dataset contains 457 aligned sentence pairs…

38
arXiv — NLP / Computation & Language research 25d ago

Reinforcement Learning Elicits Contextual Learning of Unseen Language Translation

arXiv:2606.06428v1 Announce Type: new Abstract: Prior work has shown that large language models (LLMs) can translate unseen or low-resource languages by undergoing continued training or even by encoding a grammar book in their context. However, both methods typically overfit…

21
ThursdAI news-outlet 25d ago

📅 ThursdAI - Jun 4 - NVIDIA drops Nemotron 3 Ultra (550B open), Microsoft becomes a frontier lab, Ideogram 4 goes open, Agent Arena & more

From CoreWeave: This week was kind of nuts, tons of new OpenSource goodness, 3 guests on the show (Arena, Nous Research and NVIDIA) and image gen SOTA models racing to the top.

10
Vercel — AI dev-tools 25d ago

The skills.sh API is now available

The skills.sh API is now available. Authenticate with your project's Vercel OIDC token and start querying more than 600,000 skills from across the open-source ecosystem. Search for skills, pull detailed info on any one, check its security audit, and more. Vercel issues a…

17
Hacker News — AI on Front Page community 25d ago

Anthropic's open-source framework for AI-powered vulnerability discovery

Article URL: https://github.com/anthropics/defending-code-reference-harness Comments URL: https://news.ycombinator.com/item?id=48403980 Points: 215 # Comments: 73

6
Hugging Face Daily Papers research 25d ago

Large Language Models Hack Rewards, and Society

Abstract Large language models trained with reinforcement learning can exploit ambiguities in societal regulations to discover loopholes that bypass regulatory intent, posing safety risks for real-world deployment. Generated by Qwen/Qwen2.5-Coder-32B-Instruct Reinforcement…

18
The Information — AI news-outlet 25d ago

Snowflake CIO Says He Used Layoffs to Convince Staff to Use AI

Tech firms ranging from Block to Meta Platforms have announced mass layoffs in the past year, frequently citing AI as allowing them to do more with less people. In a variation on that approach Snowflake says it used layoffs to force software engineers to use AI tools and become…

15

Microsoft's open source tools were hacked to steal passwords of AI developers

Boundary Variance Inflation Causes Acquisition Bias in Gaussian Processes

STARIXNet: Multivariate and Multi-attribute Deep Learning Approach to Real-Time Resource Allocation in Cloud Platforms

Repetition Mismatch: Why Data Mixture Experiments Don't Scale and How to Fix Them

Structured Neuron Pruning in Deep Neural Networks Using Multi-Armed Bandits

Learning Transfers: Kan Extensions for Neural Invariants

Evaluation of ML Resource Utilization Requires Model Life Cycle Assessment

Cutting LLM Evaluation Costs with SySRs: A Bandit Algorithm that Provably Exploits Model Similarity

When Behavioral Safety Evaluation Fails: A Representation-Level Perspective

DICE: Entropy-Regularized Equilibrium Selection for Stable Multi-Agent LLM Coordination

2X tk/s (from 19.4 -> 38.1 tk/s on 1 x MI50) Playing with a hypothesis like speculative decoding.. but instead of an additional side model, exploiting that I can run multiple computations side-by-side AS IF I had Qwen3.6-27B loaded twice in memory - small quants don't use all…

How to start open source contribution [D]

I'd like to share an updated methodology for building agents.[P]

Streaming Video Generation with Streaming Force Control

Open image generation models are closer to closed-source quality than this sub thinks [D]

TorchKM: A GPU-Oriented Library for Kernel Learning and Model Selection

$\alpha$-PFN: Fast Entropy Search via In-Context Learning

Structure-Preserving Correction Learning for Sparse Bayesian Inference in Brain Source Imaging

A robust PPG foundation model using multimodal physiological supervision

Do Coding Agents Deceive Us? Detecting and Preventing Cheating via Capped Evaluation with Randomized Tests

Covariance Shrinkage via Stochastic Interpolation

Making the Most of Limited Data: Score-Aware Training for Text-to-Music Generation

Modular Monolingual Adaptation using Pretrained Language Models

SV-Detect: AI-generated Text Detection with Steering Vectors

DirectAudioEdit: Inversion-Free Text-Guided Audio Editing via Diffusion Prediction Contrast

Mining Useful General Data for Low-Resource Domain Adaptation

1k Data Breaches Later, the Disclosure Lag Is Worse

The Open Source Community is backing OpenEnv for Agentic RL

b9550

Got told my open-source model experiments are too scattered. I'm organizing a journal to provide clarity before structuring the first git release. Is this readable for ML folks who aren’t in mech interp? Open to ANY feedback [D]

OpenAI unveils Lockdown Mode to protect sensitive data from prompt injection attacks

Ntsc-rs – open-source video emulation of analog TV and VHS artifacts

dots.tts 2B🎙️ SOTA TTS from RedNote

Tracing a powerful GNSS interference source over Europe

Trust, but Don't Verify: Epistemic Blind Spots in LLM Source Evaluation

Domain-Adapted Small Language Models with Hybrid Post-Processing: Achieving Cost-Efficient, Low-Latency Multi-Label Structured Prediction via LoRA Fine-Tuning on Scarce Data

PAC-Bayesian Adversarially Robust Generalization for Message Passing Graph Neural Networks: A Sensitivity Analysis

Executable Schema Contracts: From Automatic Ingestion to Multi-Source Retrieval

Localizing Prompt Ambiguity in Large Language Models with Probe-Targeted Attribution

QueryAgent-R1: Bridging Query Generation and Product Retrieval for E-Commerce Query Recommendation

The Generator-Eraser Paradox: Community Guidelines for Responsible LLM-Assisted Dialect Resource Creation

English-to-Prakrit Machine Translation via Multilingual Transfer Learning

Benchmarking Open-Source Layout Detection Models for Data Snapshot Extraction from Institutional Documents

A Komi-Yazva--Russian Parallel Corpus and Evaluation Protocol for Zero- and Few-Shot LLM Translation

Reinforcement Learning Elicits Contextual Learning of Unseen Language Translation

📅 ThursdAI - Jun 4 - NVIDIA drops Nemotron 3 Ultra (550B open), Microsoft becomes a frontier lab, Ideogram 4 goes open, Agent Arena & more

The skills.sh API is now available

Anthropic's open-source framework for AI-powered vulnerability discovery

Large Language Models Hack Rewards, and Society

Snowflake CIO Says He Used Layoffs to Convince Staff to Use AI