News / #security Tag Security 500 articles archived under #security · RSS Sign in to follow Hacker News — AI on Front Page community 21d ago Microsoft's open source tools were hacked to steal passwords of AI developers Article URL: https://techcrunch.com/2026/06/08/microsofts-open-source-tools-were-hacked-to-steal-passwords-of-ai-developers/ Comments URL: https://news.ycombinator.com/item?id=48457830 Points: 233 # Comments: 97 25 arXiv — Machine Learning research 21d ago Boundary Variance Inflation Causes Acquisition Bias in Gaussian Processes arXiv:2606.07561v1 Announce Type: new Abstract: Gaussian processes with stationary kernels on bounded domains exhibit inflated posterior variance near the boundary. Despite being a long-recognized artifact in geostatistics and a source of over-exploration in Bayesian… 15 arXiv — Machine Learning research 21d ago STARIXNet: Multivariate and Multi-attribute Deep Learning Approach to Real-Time Resource Allocation in Cloud Platforms arXiv:2606.07565v1 Announce Type: new Abstract: Intelligent scaling of microservices in cloud platforms is crucial for mitigating escalating compute costs while avoiding service disruptions. Current solutions are limited to the univariate space, typically focusing on CPU usage… 24 arXiv — Machine Learning research 21d ago Repetition Mismatch: Why Data Mixture Experiments Don't Scale and How to Fix Them arXiv:2606.07597v1 Announce Type: new Abstract: Pre-training data mixtures are commonly tuned by running small-scale experiments and extrapolating to the target training budget. When high-quality data is scarce and must be repeated, this extrapolation frequently fails, but the… 23 arXiv — Machine Learning research 21d ago Structured Neuron Pruning in Deep Neural Networks Using Multi-Armed Bandits arXiv:2606.07615v1 Announce Type: new Abstract: Deep neural networks often contain redundant hidden units. Removing individual weights can reduce parameter count, but unstructured sparsity is not always easy to exploit in standard dense implementations. This paper develops a… 6 arXiv — Machine Learning research 21d ago Learning Transfers: Kan Extensions for Neural Invariants arXiv:2606.07627v1 Announce Type: new Abstract: Transfer learning presumes that a representation learned on source tasks carries structure that remains usable on related target tasks. Standard evaluations probe this through target accuracy or distributional discrepancy, yet… 8 arXiv — Machine Learning research 21d ago Evaluation of ML Resource Utilization Requires Model Life Cycle Assessment arXiv:2606.07632v1 Announce Type: new Abstract: Proper accounting of the energy requirements and environmental impact of artificial intelligence (AI) systems is necessary for researchers, developers, policy makers, and users to assess the barriers to building systems at scale.… 36 arXiv — Machine Learning research 21d ago Cutting LLM Evaluation Costs with SySRs: A Bandit Algorithm that Provably Exploits Model Similarity arXiv:2606.07726v1 Announce Type: new Abstract: Large Language Models are typically benchmarked by evaluating every model on every test query. For practitioners seeking the best model to deploy, this is often wasteful: if a model clearly performs worse than others, there is no… 13 arXiv — Machine Learning research 21d ago When Behavioral Safety Evaluation Fails: A Representation-Level Perspective arXiv:2606.08044v1 Announce Type: new Abstract: Large Language Model (LLM) safety has often been evaluated at the behavior level, which provides limited evidence of internal robustness, as these evaluations target outputs rather than representation-level vulnerability under… 33 arXiv — Machine Learning research 21d ago DICE: Entropy-Regularized Equilibrium Selection for Stable Multi-Agent LLM Coordination arXiv:2606.08068v1 Announce Type: new Abstract: Multi-agent large language model (LLM) systems often fail to reliably outperform a single strong model equipped with best-of-N sampling. We argue that a core source of this instability is ill-posed equilibrium selection: current… 34 r/LocalLLaMA community 21d ago 2X tk/s (from 19.4 -> 38.1 tk/s on 1 x MI50) Playing with a hypothesis like speculative decoding.. but instead of an additional side model, exploiting that I can run multiple computations side-by-side AS IF I had Qwen3.6-27B loaded twice in memory - small quants don't use all… Forgive the claude summary, in the readme, but the base works. I'm still working on the hip kernal and having it combine with MTP. I hope to get up near 80 tk/s. All started because I realized every Q8 (INT8 or F8) calculation was using f32 of compute and only use 1/4th the… 32 r/MachineLearning community 21d ago How to start open source contribution [D] hi everyone, I created a blog around how I started open source contribution, documented all minute details. Please give it a read and give review as this is my journey to do blogging for the first time. It is free! https://substack.com/home/post/p-200202050   submitted by… 25 r/MachineLearning community 21d ago I'd like to share an updated methodology for building agents.[P] Hi guys, been exploring here for a while, wanted to share something we've been working on. It's called Spice, an open-source decision layer above agents. We have tons of great execution agents now — Claude Code, Codex, hermes, etc. They're good at doing stuff. But they're… 20 Hugging Face Daily Papers research 22d ago Streaming Video Generation with Streaming Force Control Abstract StreamForce is a causal, unified video generation model that provides real-time, physically grounded responses to time-varying forces through a distillation pipeline and autoregressive architecture. Generated by Qwen/Qwen2.5-Coder-32B-Instruct We introduce StreamForce,… 17 r/MachineLearning community 22d ago Open image generation models are closer to closed-source quality than this sub thinks [D] I run evaluations on generative image models as part of my workflow, mostly comparing coherence, prompt adherence, and compositional accuracy across different architectures. The consensus here seems to be that open models are still a generation behind closed APIs. Based on my… 25 arXiv — Machine Learning research 22d ago TorchKM: A GPU-Oriented Library for Kernel Learning and Model Selection arXiv:2606.06742v1 Announce Type: new Abstract: TorchKM is an open-source library for kernel machines, including support vector machines, kernel logistic regression, and kernel quantile regression, with GPU acceleration. The library features a scikit-learn-style API and is… 36 arXiv — Machine Learning research 22d ago $\alpha$-PFN: Fast Entropy Search via In-Context Learning arXiv:2606.07134v1 Announce Type: new Abstract: Information-theoretic acquisition functions such as Entropy Search (ES) offer a principled exploration-exploitation framework for Bayesian optimization (BO). However, their practical implementation relies on complicated and slow… 8 arXiv — Machine Learning research 22d ago Structure-Preserving Correction Learning for Sparse Bayesian Inference in Brain Source Imaging arXiv:2606.07196v1 Announce Type: new Abstract: Classical sparse Type-II Bayesian methods for M/EEG brain imaging support joint estimation of source and noise hyperparameters, but rely on fixed iterative update rules. Although these updates are principled and interpretable,… 28 arXiv — Machine Learning research 22d ago A robust PPG foundation model using multimodal physiological supervision arXiv:2606.07365v1 Announce Type: new Abstract: Photoplethysmography (PPG), a non-invasive measure of changes in blood volume, is widely used in both wearable devices and clinical settings. Recent PPG foundation models either use open-source ICU datasets with pretraining… 8 arXiv — NLP / Computation & Language research 22d ago Do Coding Agents Deceive Us? Detecting and Preventing Cheating via Capped Evaluation with Randomized Tests arXiv:2606.07379v1 Announce Type: cross Abstract: A growing failure mode in agent evaluation and training is that models can achieve high evaluation scores by exploiting shortcuts instead of solving the intended task, producing deceptive performance. This makes evaluation scores… 4 arXiv — Machine Learning research 22d ago Covariance Shrinkage via Stochastic Interpolation arXiv:2606.07382v1 Announce Type: new Abstract: We recast classical shrinkage of high-dimensional covariance estimators as empirical risk minimization over a parametric stochastic interpolant between a source and a target distribution. This formalism recovers known shrinkage… 29 arXiv — Machine Learning research 22d ago Making the Most of Limited Data: Score-Aware Training for Text-to-Music Generation arXiv:2606.07387v1 Announce Type: new Abstract: State-of-the-art text-to-music generation systems rely on massive proprietary datasets and industrial-scale compute, making it impossible to disentangle architectural contributions from resource advantages. We propose… 15 arXiv — NLP / Computation & Language research 22d ago Modular Monolingual Adaptation using Pretrained Language Models arXiv:2606.06738v1 Announce Type: new Abstract: Building monolingual language models (LMs) for low-resource languages typically relies on adapting pretrained language models (PLMs) by finetuning the whole model on the target language. This approach is widely favored over… 12 arXiv — NLP / Computation & Language research 22d ago SV-Detect: AI-generated Text Detection with Steering Vectors arXiv:2606.07313v1 Announce Type: new Abstract: Detecting machine-generated text is especially difficult under distribution shift, such as transfer across domains, source models, and editing attacks. We propose a fake-text detector based on steering vectors extracted from the… 24 arXiv — NLP / Computation & Language research 22d ago DirectAudioEdit: Inversion-Free Text-Guided Audio Editing via Diffusion Prediction Contrast arXiv:2606.07356v1 Announce Type: cross Abstract: Text-guided audio editing aims to modify the language-specified acoustic content while preserving edit-irrelevant source components. Existing training-free methods typically rely on inversion-based editing. While inversion-free… 25 arXiv — NLP / Computation & Language research 22d ago Mining Useful General Data for Low-Resource Domain Adaptation arXiv:2511.07380v2 Announce Type: replace Abstract: Adapting large language models (LLMs) to low-resource domains remains challenging due to the scarcity of domain-specific data. While in-domain data is limited, there exists a vast amount of general-domain data that shares… 27 Hacker News — AI on Front Page community 22d ago 1k Data Breaches Later, the Disclosure Lag Is Worse Article URL: https://www.troyhunt.com/1000-data-breaches-later-the-disclosure-lag-is-worse-than-ever/ Comments URL: https://news.ycombinator.com/item?id=48440952 Points: 203 # Comments: 75 38 Hugging Face official-blog 22d ago The Open Source Community is backing OpenEnv for Agentic RL Back to Articles The Open Source Community is backing OpenEnv for Agentic RL Published June 8, 2026 Update on GitHub Upvote 1 ben burtenshaw burtenshaw Joseph Spisak spisakjo Lysandre lysandre Davide Testuggine darktex will brown willcb Charles Frye charlesfrye Chris Wing… 37 llama.cpp releases dev-tools 22d ago b9550 kv-cache: follow the source cache size when sharing cells ( #24267 ) A fitted target context can end up smaller than the draft default, the oversized assistant views then overflow the shared K/V tensors and trip the ggml_view_4d size assert during graph reserve. macOS/iOS: macOS… 25 r/MachineLearning community 23d ago Got told my open-source model experiments are too scattered. I'm organizing a journal to provide clarity before structuring the first git release. Is this readable for ML folks who aren’t in mech interp? Open to ANY feedback [D] # Results Journal: Qwen3.5-35B-A3B E114 as a Generated-Register Routing Signal Date: 2026-06-06 This is an experiment-history document, not a publication claim. It states the current best evidence for the strongest positive result in the Qwen3.5-35B-A3B set, the narrow… 20 TechCrunch — AI news-outlet 23d ago OpenAI unveils Lockdown Mode to protect sensitive data from prompt injection attacks Even with Lockdown Mode, ChatGPT could be still vulnerable to prompt injections, but the goal is to reduce the likelihood that sensitive data gets shared in the process. 34 Hacker News — AI on Front Page community 23d ago Ntsc-rs – open-source video emulation of analog TV and VHS artifacts Article URL: https://ntsc.rs/ Comments URL: https://news.ycombinator.com/item?id=48428025 Points: 227 # Comments: 49 15 r/LocalLLaMA community 24d ago dots.tts 2B🎙️ SOTA TTS from RedNote 🔗 Blog: https://rednote-hilab.github.io/dots.tts-demo/ 🔗 GitHub: https://github.com/rednote-hilab/dots.tts 🔗 Technical Report: https://arxiv.org/abs/2608.16894 dots.tts 🎙️ New open-source TTS from RedNote (Xiaohongshu) ✨ 2B parameters (Apache 2.0) ✨ Fully continuous… 16 Hacker News — AI on Front Page community 25d ago Tracing a powerful GNSS interference source over Europe Article URL: https://arxiv.org/abs/2606.03673 Comments URL: https://news.ycombinator.com/item?id=48409664 Points: 254 # Comments: 113 28 arXiv — Machine Learning research 25d ago Trust, but Don't Verify: Epistemic Blind Spots in LLM Source Evaluation arXiv:2606.05403v1 Announce Type: new Abstract: Language models increasingly act as epistemic proxies, synthesizing evidence from multiple sources to inform decisions. Whether they evaluate the quality of that evidence, or merely aggregate it based on surface presentation,… 4 arXiv — Machine Learning research 25d ago Domain-Adapted Small Language Models with Hybrid Post-Processing: Achieving Cost-Efficient, Low-Latency Multi-Label Structured Prediction via LoRA Fine-Tuning on Scarce Data arXiv:2606.05781v1 Announce Type: new Abstract: Deploying frontier large language models (LLMs) for domain-specific structured evaluation tasks often incurs substantial latency, cost, and data privacy overhead. We present a hybrid framework that combines a fine-tuned small… 34 arXiv — Machine Learning research 25d ago PAC-Bayesian Adversarially Robust Generalization for Message Passing Graph Neural Networks: A Sensitivity Analysis arXiv:2606.06293v1 Announce Type: new Abstract: Whilst the vulnerability of graph neural networks (GNNs) to adversarial attacks poses a critical threat to graph representation learning, the understanding of the robust generalization behavior remains a fundamental challenge in… 4 arXiv — NLP / Computation & Language research 25d ago Executable Schema Contracts: From Automatic Ingestion to Multi-Source Retrieval arXiv:2606.05415v1 Announce Type: new Abstract: Real-world data spans tables, documents, and semi-structured files with implicit semantics. Querying this data requires integrating evidence across inconsistent schemas and formats, yet existing approaches either demand costly… 24 arXiv — NLP / Computation & Language research 25d ago Localizing Prompt Ambiguity in Large Language Models with Probe-Targeted Attribution arXiv:2606.05486v1 Announce Type: new Abstract: Prompt ambiguity is a common source of failure in large language models, but is difficult to localize because it is a latent property of the prompt, while existing attribution methods are designed to explain observable outputs such… 22 arXiv — NLP / Computation & Language research 25d ago QueryAgent-R1: Bridging Query Generation and Product Retrieval for E-Commerce Query Recommendation arXiv:2606.05671v1 Announce Type: new Abstract: Query recommendation in e-commerce search aims to proactively suggest queries that match users' potential interests. However, existing methods mainly optimize query-level relevance, while neglecting whether the retrieved products… 10 arXiv — NLP / Computation & Language research 25d ago The Generator-Eraser Paradox: Community Guidelines for Responsible LLM-Assisted Dialect Resource Creation arXiv:2606.06004v1 Announce Type: new Abstract: Dialect resources occupy a unique position at the intersection of scientific description, cultural preservation, and computational infrastructure. Large language models offer powerful capabilities for accelerating dialect resource… 17 arXiv — NLP / Computation & Language research 25d ago English-to-Prakrit Machine Translation via Multilingual Transfer Learning arXiv:2606.06038v1 Announce Type: new Abstract: We study English-to-Prakrit machine translation in a low-resource setting where the target language is unsupported by IndicTrans2. We adapt the multilingual model by mapping Prakrit to the Hindi language tag (hin_Deva) without… 25 arXiv — NLP / Computation & Language research 25d ago Benchmarking Open-Source Layout Detection Models for Data Snapshot Extraction from Institutional Documents arXiv:2606.06242v1 Announce Type: new Abstract: Institutional documents contain substantial amounts of operational and analytical information embedded within figures and tables. Current approaches for extracting visual content from documents are largely built around generic… 9 arXiv — NLP / Computation & Language research 25d ago A Komi-Yazva--Russian Parallel Corpus and Evaluation Protocol for Zero- and Few-Shot LLM Translation arXiv:2606.06420v1 Announce Type: new Abstract: We present the first Komi-Yazva--Russian parallel corpus together with an explicit evaluation protocol for studying LLM translation in an endangered, extremely low-resource setting. The dataset contains 457 aligned sentence pairs… 38 arXiv — NLP / Computation & Language research 25d ago Reinforcement Learning Elicits Contextual Learning of Unseen Language Translation arXiv:2606.06428v1 Announce Type: new Abstract: Prior work has shown that large language models (LLMs) can translate unseen or low-resource languages by undergoing continued training or even by encoding a grammar book in their context. However, both methods typically overfit… 21 ThursdAI news-outlet 25d ago 📅 ThursdAI - Jun 4 - NVIDIA drops Nemotron 3 Ultra (550B open), Microsoft becomes a frontier lab, Ideogram 4 goes open, Agent Arena & more From CoreWeave: This week was kind of nuts, tons of new OpenSource goodness, 3 guests on the show (Arena, Nous Research and NVIDIA) and image gen SOTA models racing to the top. 10 Vercel — AI dev-tools 25d ago The skills.sh API is now available The skills.sh API is now available. Authenticate with your project's Vercel OIDC token and start querying more than 600,000 skills from across the open-source ecosystem. Search for skills, pull detailed info on any one, check its security audit, and more. Vercel issues a… 17 Hacker News — AI on Front Page community 25d ago Anthropic's open-source framework for AI-powered vulnerability discovery Article URL: https://github.com/anthropics/defending-code-reference-harness Comments URL: https://news.ycombinator.com/item?id=48403980 Points: 215 # Comments: 73 6 Hugging Face Daily Papers research 25d ago Large Language Models Hack Rewards, and Society Abstract Large language models trained with reinforcement learning can exploit ambiguities in societal regulations to discover loopholes that bypass regulatory intent, posing safety risks for real-world deployment. Generated by Qwen/Qwen2.5-Coder-32B-Instruct Reinforcement… 18 The Information — AI news-outlet 25d ago Snowflake CIO Says He Used Layoffs to Convince Staff to Use AI Tech firms ranging from Block to Meta Platforms have announced mass layoffs in the past year, frequently citing AI as allowing them to do more with less people. In a variation on that approach Snowflake says it used layoffs to force software engineers to use AI tools and become… 15 Page 6 of 10 · 500 articles ← Newer Older →