arXiv — Machine Learning
500 articles archived · Visit source ↗ · RSS
-
arXiv — Machine Learning research 6d ago
Systematic Exploration of 4-Expert Heterogeneous Mixture-of-Experts via Automated Pipeline Search
arXiv:2606.23739v1 Announce Type: new Abstract: We present an automated large-scale search pipeline for heterogeneous 4-Expert Mixture-of-Experts (MoE4) architectures within the LEMUR neural network dataset ecosystem. Building on a hand-crafted heterogeneous MoE reference model,…
36 -
arXiv — Machine Learning research 6d ago
Weight-Space Geometry of Offline Reasoning Training
arXiv:2606.23740v1 Announce Type: new Abstract: Offline reinforcement-learning losses (RFT, RIFT, DFT, Offline GRPO, DPO) are widely used to distill reasoning from large teachers into smaller students, and are typically compared on downstream accuracy alone. We ask whether they…
6 -
arXiv — Machine Learning research 6d ago
A Survey on Federated Causal Discovery and Inference
arXiv:2606.23741v1 Announce Type: new Abstract: Causal reasoning, which encompasses the discovery of causal structures and the inference of causal effects, is fundamental to data-driven decision making. In practice, data for reliable causal analysis are often distributed across…
7 -
arXiv — Machine Learning research 6d ago
Low-power analogue neural networks with trainable nonlinear connections for continuous control
arXiv:2606.23742v1 Announce Type: new Abstract: Physical neural networks promise low-power machine learning by computing directly with analogue device physics, but most architectures force nonlinear device responses to act as scalar weights. Inspired by Kolmogorov-Arnold…
28 -
arXiv — Machine Learning research 6d ago
Synergizing Physically Constrained MCMC and Chemical-Informed Gaussian Processes for Reaction Network Discovery
arXiv:2606.23757v1 Announce Type: new Abstract: Extracting interpretable governing equations from sparse, noisy chemical time-series data remains difficult because discrete reaction topology and continuous kinetic parameters are tightly coupled. We present PC-MCMC-CIGP, a…
33 -
arXiv — Machine Learning research 6d ago
Exploring Dualistic Meta-Learning to Enhance Domain Generalization in Open Set Scenarios
arXiv:2606.23758v1 Announce Type: new Abstract: Domain generalization learns from multiple source domains to generalize to unseen target domains. However, it often neglects the realistic case of label mismatch between source and target. Open set domain generalization is then…
35 -
-
arXiv — Machine Learning research 6d ago
Deciphering Fingerprints of 3D Molecular Surfaces for Accurate Epitope Prediction
arXiv:2606.23830v1 Announce Type: new Abstract: Molecular surfaces encode the geometric and physicochemical patterns that determine antibody-antigen recognition, central to epitope prediction. However, existing methods rely on sequences or backbone structures and struggle to…
17 -
-
arXiv — Machine Learning research 6d ago
The Degeneracy Distillery
arXiv:2606.23838v1 Announce Type: new Abstract: When two or more parameters or labels produce similar data, they are degenerate, or hard to distinguish. Degeneracies render both label prediction and inverse problems difficult, since both machine learning algorithms and…
5 -
-
arXiv — Machine Learning research 6d ago
Sesame: Structure-Aware Molecular Generation via Spatial Density-Map Conditioning
arXiv:2606.23856v1 Announce Type: new Abstract: Generative molecular models for drug design are a promising direction with much active research. In the next phase of computational drug design, such models will need to understand small molecule structure and protein-ligand…
25 -
arXiv — Machine Learning research 6d ago
Are Safety Guarantees in Neural Networks Safe? How to Compute Trustworthy Robustness Certifications
arXiv:2606.23858v1 Announce Type: new Abstract: A primary challenge in AI safety is the existence of adversarial examples -- slightly distorted inputs that cause a neural network (NN) to misclassify. To mitigate this problem, recent research focuses on the computation of…
12 -
arXiv — Machine Learning research 6d ago
Exact Schur-Sylvester Dimensionality Reductions for Non-Smooth Stochastic Complexity and Manifold Sampling
arXiv:2606.23867v1 Announce Type: new Abstract: The exact computation of the Normalized Maximum Likelihood (NML) codelength for regular non-smooth estimators (e.g., Lasso) has been historically limited by the cubic scaling walls of manifold-constrained projection and volume…
11 -
-
arXiv — Machine Learning research 6d ago
MGI: Member vs Generated Inference
arXiv:2606.23872v1 Announce Type: new Abstract: As generative models increasingly produce samples that are indistinguishable from human-created content, it becomes difficult to determine whether a given data point was part of a model's natural training set or was generated by…
8 -
arXiv — Machine Learning research 6d ago
GRACE: Gated Refinement for Accurate Causal Edge Discovery in High-Dimensional Time Series
arXiv:2606.23880v1 Announce Type: new Abstract: From climate teleconnections to gene regulation, modern time-series datasets encompass tens or hundreds of interacting variables, making causal discovery increasingly challenging. Constraint-based methods offer statistical rigor…
30 -
arXiv — Machine Learning research 6d ago
ARIA: Adaptive Region-Based Importance Allocation for Conditional Diffusion Distillation
arXiv:2606.23898v1 Announce Type: new Abstract: Distilling conditional diffusion models aims to transfer the behavior of a large teacher to a smaller student while preserving alignment across conditioning inputs. Unlike recognition tasks, knowledge distillation in conditional…
14 -
arXiv — Machine Learning research 6d ago
Closing the Loop: Formally Verified Law as a Reward Signal for Self-Improving Legal AI
arXiv:2606.23913v1 Announce Type: new Abstract: This article develops an architecture that creates a formally verifiable reward signal to train legal AI, adapting the LLM proposes, verifier disposes paradigm from mathematical AI to the distinctive demands of law. We present an…
13 -
arXiv — Machine Learning research 6d ago
Catastrophic Compositional Generation: Why Vanilla Diffusion Models Fail to Extrapolate
arXiv:2606.23920v1 Announce Type: new Abstract: The task of compositional generation involves using a conditional generative model, trained only on a subset of the possible conditions, to produce samples from compositionally-defined target distributions such as a geometric…
17 -
arXiv — Machine Learning research 6d ago
KLip-PPO: A per-sample KL perspective on PPO-Clip
arXiv:2606.23932v1 Announce Type: new Abstract: Proximal Policy Optimization (PPO) is the standard policy-gradient algorithm for on-policy reinforcement learning. The literature presents it in two forms, a clipped surrogate that bounds the importance ratio between successive…
8 -
arXiv — Machine Learning research 6d ago
DREG: A Layer-Wise Jacobian Regularization as a General-Purpose Penalty
arXiv:2606.23942v1 Announce Type: new Abstract: We present a large-scale empirical study isolating the contributions of the Derivative Regularization penalty (DREG). Across a fully-crossed factorial sweep of 960 experiments spanning 4 activations, 6 regularizers, 8 datasets, and…
28 -
arXiv — Machine Learning research 6d ago
Learning the Koopman Operator using Attention Free Transformers
arXiv:2606.23957v1 Announce Type: new Abstract: Learning Koopman operators with autoencoders enables linear prediction in a latent space, but long-horizon rollouts often drift off the learned manifold, leading to phase and amplitude errors on systems with switching, continuous…
8 -
arXiv — Machine Learning research 6d ago
Forget Without Compromise: Nexus Sampling for Streaming KV-Cache Eviction Under Fixed Budgets
arXiv:2606.23961v1 Announce Type: new Abstract: Long-context and agentic LLM workloads push the KV cache past any fixed memory budget, forcing the inference stack to permanently evict tokens at every step of a continuous-inference stream. Existing methods all share the same…
20 -
arXiv — Machine Learning research 6d ago
3D Masked Autoencoders are Robust Learners of Volumetric and Multimodal Cellular Representations for Microscopy
arXiv:2606.23964v1 Announce Type: new Abstract: Self-supervised learning in fluorescence microscopy often relies on 2D projections, despite the inherently three-dimensional nature of cells. We present a systematic comparison of 2D and 3D masked autoencoders (MAE-2D vs. MAE-3D)…
34 -
arXiv — Machine Learning research 6d ago
A Comparative Study of Bayesian Contextual Bandits for Real-Time Warehouse Sorter Optimization
arXiv:2606.23977v1 Announce Type: new Abstract: Efficient sorter diversion control of automated material handling systems (MHS) is critical for optimizing operational efficiency in large-scale warehouse environments. In this study, we use an inbound receiving sorter at a…
19 -
arXiv — Machine Learning research 6d ago
Offline Reinforcement Learning for Warehouse SLAM Throughput Control
arXiv:2606.23978v1 Announce Type: new Abstract: We present an offline reinforcement learning (RL) framework for optimizing SLAM throughput control in a warehouse fulfillment environment. SLAM (Scan/Label/Apply/Manifest) throughput directly influences system congestion and…
18 -
arXiv — Machine Learning research 6d ago
Learning to Trigger: Reinforcement Learning at the Large Hadron Collider
arXiv:2606.23993v1 Announce Type: new Abstract: High-throughput scientific facilities such as the Large Hadron Collider depend on real-time event filtering (\textit{triggering}) under tight constraints on bandwidth, latency, and storage. In practice, trigger menus are largely…
24 -
arXiv — Machine Learning research 6d ago
EMAgnet: Parameter-Space EMA Regularization for Policy Gradient Self-Play in Large Games
arXiv:2606.23995v1 Announce Type: new Abstract: Recent work has established that regularized policy gradient methods such as PPO, when used in self-play, can match or exceed specialized game-theoretic algorithms for solving two-player zero-sum imperfect-information games. The…
25 -
arXiv — Machine Learning research 6d ago
Cyclic Denoising Reveals Ultrastable Memories in Diffusion Models
arXiv:2606.24000v1 Announce Type: new Abstract: We introduce cyclic denoising -- repeated forward and reverse diffusion at controlled noise amplitudes -- as an extraction attack for image diffusion models. Inspired by random organization in disordered solids, cyclic denoising…
17 -
arXiv — Machine Learning research 6d ago
Fast and Slow Variational Continual Learning
arXiv:2606.24007v1 Announce Type: new Abstract: Continual learning remains a major challenge for modern deep networks, partly because commonly used optimizers lack inherent mechanisms for continual adaptation. One such natural mechanism is fast and slow adaptation to balance…
37 -
arXiv — Machine Learning research 6d ago
You Don't Need to Run Every Eval
arXiv:2606.24020v1 Announce Type: new Abstract: A modern model release reports scores on 40+ benchmarks and the same evaluations were run many more times before it: to track training progress, compare design choices, and select the checkpoint for the release. But do we need to…
29 -
arXiv — Machine Learning research 6d ago
Information-Theoretic Classifier-Free Guidance with Adaptive Schedule Optimization
arXiv:2606.24025v1 Announce Type: new Abstract: Diffusion models have achieved strong performance in image, text-to-image, and video generation, where conditional generation is often controlled by classifier-free guidance (CFG). CFG improves condition consistency by increasing a…
35 -
arXiv — Machine Learning research 6d ago
RoPE-Aware Bit Allocation for KV-Cache Quantization
arXiv:2606.24033v1 Announce Type: new Abstract: Existing low-bit KV-cache quantizers often treat each cached key as a flat vector. Under RoPE, however, a key's contribution to a future attention logit decomposes into a position-dependent sum over two-dimensional frequency…
5 -
arXiv — Machine Learning research 6d ago
Rapid FinFET Modelling Using an Autoencoder
arXiv:2606.24046v1 Announce Type: new Abstract: This work presents a machine learning framework that leverages an autoencoder (AE) for the efficient modeling of FinFET. We first calibrated a BSIM-CMG model to generate a dataset of current-voltage (ID-VG) characteristics. This…
7 -
arXiv — Machine Learning research 6d ago
RAVEN: A Regime-Aware Variable-context Expert Network for Financial Time Series Forecasting
arXiv:2606.24062v1 Announce Type: new Abstract: Financial time series forecasting presents structural challenges absent from standard benchmarks. Log-returns are non-stationary, exhibit exceptionally low signal-to-noise (SNR) ratios, and are governed by regime-dependent temporal…
8 -
arXiv — Machine Learning research 6d ago
Blockwise Policy-Drift Gating for On-Policy Distillation
arXiv:2606.24084v1 Announce Type: new Abstract: On-policy distillation (OPD) trains a student policy using teacher signals computed on trajectories sampled by the student itself. Recent work shows that sampled-token OPD can be fragile on long-horizon reasoning tasks and that…
30 -
arXiv — Machine Learning research 6d ago
NeuroSonic: Conditional Flow Matching for EEG-to-Speech Reconstruction
arXiv:2606.24087v1 Announce Type: new Abstract: Reconstructing continuous speech from scalp electroencephalography (EEG) remains fundamentally challenging. EEG provides a weak, spatially diffuse, and highly variable measurement of distributed cortical activity, whereas speech is…
9 -
arXiv — Machine Learning research 6d ago
FedUP: One-Shot Federated Unlearning via Centroid-Guided Plug-in Filters
arXiv:2606.24113v1 Announce Type: new Abstract: Federated unlearning (FU) is critical for complying with legal mandates like the right to be forgotten in decentralized systems, yet current methods face a persistent dilemma between non-target knowledge loss and high request…
28 -
arXiv — Machine Learning research 6d ago
When Top-1 Fails: Calibrating LoRA Monitors for Masked Diffusion LMs
arXiv:2606.24119v1 Announce Type: new Abstract: Discrete diffusion language model (DLM) fine-tuning inherits inexpensive diagnostics from denoising-time confidence monitors, but their PEFT-training meaning is untested. We test top-1 argmax concentration as a collapse warning.…
12 -
arXiv — Machine Learning research 6d ago
Holistic Data Scheduler for LLM Pre-training via Multi-Objective Reinforcement Learning
arXiv:2606.24133v1 Announce Type: new Abstract: The composition of training data, governed by the diversity of sources and their mixing strategy, is a cornerstone of Large Language Model (LLM) pre-training. Online Data Mixing (ODM), the technique of adaptively adjusting data…
13 -
arXiv — Machine Learning research 6d ago
A Time-Reparameterized Cumulative Intensity Extrapolation Sampler for Discrete Flow Matching
arXiv:2606.24140v1 Announce Type: new Abstract: Discrete flow matching (DFM) provides a principled framework for generative modeling on discrete state spaces via continuous-time Markov chain dynamics. In practice, sampling for DFM commonly employs discretizations such as…
15 -
arXiv — Machine Learning research 6d ago
AsyncOPD: How Stale Can On-Policy Distillation Be?
arXiv:2606.24143v1 Announce Type: new Abstract: On-policy distillation (OPD) trains a student on its own rollouts guided by teacher feedback and is becoming increasingly important for large language model (LLM) post-training. Like reinforcement learning (RL), however, OPD faces…
10 -
arXiv — Machine Learning research 6d ago
Lightweight Transformer Models for On-Device Fault Detection: A Benchmark Study on Resource-Constrained Deployment
arXiv:2606.24173v1 Announce Type: new Abstract: On-device fault detection enables real-time diagnostics without cloud dependency, but deploying machine learning models on resource-constrained hardware demands careful tradeoffs between accuracy, latency, and model size. We…
14 -
arXiv — Machine Learning research 6d ago
Project Ariadne: Prompt-Conditioned Route Generation for Synthesis Planning
arXiv:2606.24184v1 Announce Type: new Abstract: Retrosynthetic planning seeks to connect a target molecule to commercially available starting materials through a multistep route. Classical planners construct such routes by iteratively applying single-step reaction models within…
26 -
arXiv — Machine Learning research 6d ago
Managing Task Execution for Unknown Workloads in Batteryless IoT: A Hardware-Agnostic Evaluation
arXiv:2606.24340v1 Announce Type: new Abstract: In recent years, the Internet of Things (IoT) paradigm has been shifting toward batteryless, energy-harvesting architectures. Sustaining reliable operation in these systems requires intelligent management of highly volatile stored…
30 -
arXiv — Machine Learning research 6d ago
Parallel Manifold Steering: Efficient Adaptation of Large Associative Memories via Residual Energy Shaping
arXiv:2606.24396v1 Announce Type: new Abstract: Large Transformer models function as Dense Associative Memories (DAMs), retrieving knowledge via high-dimensional attractor dynamics driven by the self-attention mechanism \citep{ramsauer2020hopfield, wu2024attention}. However,…
34 -
arXiv — Machine Learning research 6d ago
Natural Identifiers for Privacy and Data Audits in Large Language Models
arXiv:2606.24408v1 Announce Type: new Abstract: Assessing the privacy of large language models (LLMs) presents significant challenges. In particular, most existing methods for auditing differential privacy require the insertion of specially crafted canary data during training,…
28 -
arXiv — Machine Learning research 6d ago
Data Augmentation: A Fourier Analysis Perspective
arXiv:2606.24418v1 Announce Type: new Abstract: Data augmentation is a simple and model-agnostic approach for exploiting known invariances in learning problems. Given a group acting on the input space, one augments the training set with transformed copies of each sample. Because…
37 -
arXiv — Machine Learning research 6d ago
An LLM-based Two-Stage Transformer Framework for Cross-Domain Bearing Fault Diagnosis with Limited Data
arXiv:2606.24459v1 Announce Type: new Abstract: Bearing fault diagnosis faces critical challenges when dataset heterogeneity, operating condition variations, and limited labeled data occur simultaneously in industrial environments. Existing approaches address these issues in…
30