arXiv — Machine Learning

116 articles archived · Visit source ↗ · RSS

arXiv — Machine Learning research 17h ago

Interpretable EEG Microstate Discovery via Variational Deep Embedding: A Systematic Architecture Search with Multi-Quadrant Evaluation

arXiv:2605.10947v1 Announce Type: new Abstract: EEG microstate analysis segments continuous brain electrical activity into brief, quasi-stable topographic configurations that reflect discrete functional brain states. Conventional approaches such as Modified K-Means operate…

22
arXiv — Machine Learning research 17h ago

QuIDE: Mastering the Quantized Intelligence Trade-off via Active Optimization

arXiv:2605.10959v1 Announce Type: new Abstract: There is currently no unified metric for evaluating the efficiency of quantized neural networks. We propose QuIDE, built around the Intelligence Index I = (C x P)/log_2(T+1), which collapses the compression-accuracy-latency…

22
arXiv — Machine Learning research 17h ago

Steering Without Breaking: Mechanistically Informed Interventions for Discrete Diffusion Language Models

arXiv:2605.10971v1 Announce Type: new Abstract: Discrete diffusion language models (DLMs) generate text by iteratively denoising all positions in parallel, offering an alternative to autoregressive models. Controlled generation methods for DLMs, imported from autoregressive…

4
arXiv — Machine Learning research 17h ago

Rotation-Preserving Supervised Fine-Tuning

arXiv:2605.10973v1 Announce Type: new Abstract: Supervised fine-tuning (SFT) improves in-domain performance but can degrade out-of-domain (OOD) generalization. Prior work suggests that this degradation is related to changes in dominant singular subspaces of pretrained weight…

22
arXiv — Machine Learning research 17h ago

Vertex-Softmax: Tight Transformer Verification via Exact Softmax Optimization

arXiv:2605.10974v1 Announce Type: new Abstract: Certified verification of transformer attention requires bounding the softmax function over interval constraints on the pre-softmax scores. Existing verifiers relax softmax ndependently of the downstream objective, leaving…

26
arXiv — Machine Learning research 17h ago

Hierarchical Multi-Scale Graph Neural Networks: Scalable Heterophilous Learning with Oversmoothing and Oversquashing Mitigation

arXiv:2605.10975v1 Announce Type: new Abstract: Graphs with heterophily, where adjacent nodes carry different labels, are prevalent in real-world applications, from social networks to molecular interactions. However, existing spectral Graph Neural Network (GNN) approaches…

24
arXiv — Machine Learning research 17h ago

LEAP: Unlocking dLLM Parallelism via Lookahead Early-Convergence Token Detection

arXiv:2605.10980v1 Announce Type: new Abstract: Diffusion Language Models (dLLMs) have garnered significant attention for their potential in highly parallel processing. The parallel capabilities of existing dLLMs stem from the assumption of conditional independence at high…

35
arXiv — Machine Learning research 17h ago

$\xi$-DPO: Direct Preference Optimization via Ratio Reward Margin

arXiv:2605.10981v1 Announce Type: new Abstract: Reference-free preference optimization has emerged as an efficient alternative to reinforcement learning from human feedback, with Simple Preference Optimization(SimPO) demonstrating strong performance by eliminating the explicit…

23
arXiv — Machine Learning research 17h ago

TMPO: Trajectory Matching Policy Optimization for Diverse and Efficient Diffusion Alignment

arXiv:2605.10983v1 Announce Type: new Abstract: Reinforcement learning (RL) has shown extraordinary potential in aligning diffusion models to downstream tasks, yet most of them still suffer from significant reward hacking, which degrades generative diversity and quality by…

10
arXiv — Machine Learning research 17h ago

Structural Interpretations of Protein Language Model Representations via Differentiable Graph Partitioning

arXiv:2605.10985v1 Announce Type: new Abstract: Protein language models such as ESM-2 learn rich residue representations that achieve strong performance on protein function prediction, but their features remain difficult to interpret as structural $\&$ evolutionary signals are…

17
arXiv — Machine Learning research 17h ago

AESOP: Adversarial Execution-path Selection to Overload Deep Learning Pipelines

arXiv:2605.10987v1 Announce Type: new Abstract: Modern machine learning deployments increasingly compose specialized models into dynamic inference pipelines, where upstream components produce intermediate predictions that determine the workload and inputs of downstream…

21
arXiv — Machine Learning research 17h ago

Seeing the Needle in the Haystack: Towards Weakly-Supervised Log Instance Anomaly Localization via Counterfactual Perturbation

arXiv:2605.10988v1 Announce Type: new Abstract: Log anomaly detection is a critical task for system operations and security assurance. However, in networked systems at scale, log data are generated at massive scale while instance-level annotations are prohibitively expensive,…

29
arXiv — Machine Learning research 17h ago

SURGE: Surrogate Gradient Adaptation in Binary Neural Networks

arXiv:2605.10989v1 Announce Type: new Abstract: The training of Binary Neural Networks (BNNs) is fundamentally based on gradient approximation for non-differentiable binarization operations (e.g., sign function). However, prevailing methods including the Straight-Through…

11
arXiv — Machine Learning research 17h ago

Test-Time Personalization: A Diagnostic Framework and Probabilistic Fix for Scaling Failures

arXiv:2605.10991v1 Announce Type: new Abstract: Existing approaches to LLM personalization focus on constructing better personalized models or inputs, while treating inference as a single-shot process. In this work, we study Test-Time Personalization (TTP) along an unexplored…

11
arXiv — Machine Learning research 17h ago

SkillGen: Verified Inference-Time Agent Skill Synthesis

arXiv:2605.10999v1 Announce Type: new Abstract: Skills are a promising way to improve LLM agent capabilities without retraining, while keeping the added procedure reusable and controllable. However, high-quality skills are still largely written by hand. We introduce SkillGen, a…

33
arXiv — Machine Learning research 17h ago

Finite Volume-Informed Neural Network Framework for 2D Shallow Water Equations: Rugged Loss Landscapes and the Importance of Data Guidance

arXiv:2605.11001v1 Announce Type: new Abstract: Physics-informed neural networks (PINNs) are a simple surrogate-modelling paradigm for partial differential equations, but their standard strong-form residual formulation is ill suited to the shallow water equations (SWE). It…

20
arXiv — Machine Learning research 17h ago

DisagMoE: Computation-Communication overlapped MoE Training via Disaggregated AF-Pipe Parallelism

arXiv:2605.11005v1 Announce Type: new Abstract: Mixture-of-experts (MoE) architectures enable trillion-parameter LLMs with sparsely activated experts. Expert parallelism (EP) is a widely adopted MoE training strategy, but it suffers from severe all-to-all communication…

25
arXiv — Machine Learning research 17h ago

RT-Transformer: The Transformer Block as a Spherical State Estimator

arXiv:2605.11007v1 Announce Type: new Abstract: We show that the core components of the Transformer block -- attention, residual connections, and normalization -- arise naturally from a single geometric estimation problem. Modeling the latent state as a direction on the…

19
arXiv — Machine Learning research 17h ago

When and How to Canonize: A Generalization Perspective

arXiv:2605.11008v1 Announce Type: new Abstract: While invariant architectures are standard for processing symmetric data, there is growing interest in achieving invariance by applying group averaging or canonization to non-invariant backbones. However, the theoretical…

12
arXiv — Machine Learning research 17h ago

ACSAC: Adaptive Chunk Size Actor-Critic with Causal Transformer Q-Network

arXiv:2605.11009v1 Announce Type: new Abstract: Long-horizon, sparse-reward tasks pose a fundamental challenge for reinforcement learning, since single-step TD learning suffers from bootstrapping error accumulation across successive Bellman updates. Actor-critic methods with…

34
arXiv — Machine Learning research 17h ago

A Comparative Study of Federated Learning Aggregation Strategies under Homogeneous and Heterogeneous Data Distributions

arXiv:2605.11010v1 Announce Type: new Abstract: Federated Learning has emerged as a transformative paradigm for collaborative machine learning across distributed environments. However, its performance is strongly influenced by the aggregation strategy used to combine local model…

17
arXiv — Machine Learning research 17h ago

LoopUS: Recasting Pretrained LLMs into Looped Latent Refinement Models

arXiv:2605.11011v1 Announce Type: new Abstract: Looped computation shows promise in improving the reasoning-oriented performance of LLMs by scaling test-time compute. However, existing approaches typically require either training recurrent models from scratch or applying…

37
arXiv — Machine Learning research 17h ago

Backbone-Equated Diffusion OOD via Sparse Internal Snapshots

arXiv:2605.11014v1 Announce Type: new Abstract: Fair comparison between diffusion-based OOD detectors is challenging, as conclusions can vary with backbone choice, corruption parameterization, and test-time budget. We address this issue through a Mutualized Backbone-Equated…

30
arXiv — Machine Learning research 17h ago

Simpson's Paradox in Behavioral Curves: How Aggregation Distorts Parametric Models of User Dynamics

arXiv:2605.11017v1 Announce Type: new Abstract: Behavioral curve modeling -- fitting parametric functions to engagement-versus-exposure data -- is standard practice in recommendation, advertising, and clinical dosing. We show that aggregation introduces a systematic distortion:…

13
arXiv — Machine Learning research 17h ago

Efficient LLM Reasoning via Variational Posterior Guidance with Efficiency Awareness

arXiv:2605.11019v1 Announce Type: new Abstract: Although large language models rely on chain-of-thought for complex reasoning, the overthinking phenomenon severely degrades inference efficiency. Existing reinforcement learning methods compress reasoning chains by designing…

23
arXiv — Machine Learning research 17h ago

Trust Region Inverse Reinforcement Learning: Explicit Dual Ascent using Local Policy Updates

arXiv:2605.11020v1 Announce Type: new Abstract: Inverse reinforcement learning (IRL) is typically formulated as maximizing entropy subject to matching the distribution of expert trajectories. Classical (dual-ascent) IRL guarantees monotonic performance improvement but requires…

14
arXiv — Machine Learning research 17h ago

A Switching System Theory of Q-Learning with Linear Function Approximation

arXiv:2605.11021v1 Announce Type: new Abstract: This paper develops a switching-system interpretation of Q-learning with linear function approximation (LFA) based on the joint spectral radius (JSR). We derive an exact linear switched model for the mean dynamics and relate…

11
arXiv — Machine Learning research 17h ago

ASD-Bench: A Four-Axis Comprehensive Benchmark of AI Models for Autism Spectrum Disorder

arXiv:2605.11091v1 Announce Type: new Abstract: Automated ASD screening tools remain limited by single-architecture evaluations, axis-restricted assessment, and near-exclusive focus on adult cohorts, obscuring age-specific diagnostic patterns critical for early intervention. We…

4
arXiv — Machine Learning research 17h ago

Enabling Performant and Flexible Model-Internal Observability for LLM Inference

arXiv:2605.11093v1 Announce Type: new Abstract: Today's inference-time workloads increasingly depend on timely access to a model's internal states. We present DMI-Lib, a high-speed deep model inspector that treats internal observability as a first-class systems primitive,…

18
arXiv — Machine Learning research 17h ago

Newton's Lantern: A Reinforcement Learning Framework for Finetuning AC Power Flow Warm Start Models

arXiv:2605.11102v1 Announce Type: new Abstract: Neural warm starts can sharply reduce the number of Newton-Raphson iterations required to solve the AC power flow problem, but existing supervised approaches generalize poorly on heavily loaded instances near voltage collapse. We…

11
arXiv — Machine Learning research 17h ago

GRAFT-ATHENA: Self-Improving Agentic Teams for Autonomous Discovery and Evolutionary Numerical Algorithms

arXiv:2605.11117v1 Announce Type: new Abstract: Scientific discovery can be modeled as a sequence of probabilistic decisions that map physical problems to numerical solutions. Recent agentic AI systems automate individual scientific tasks by orchestrating LLM-driven planners,…

22
arXiv — Machine Learning research 17h ago

Language Modeling with Hyperspherical Flows

arXiv:2605.11125v1 Announce Type: new Abstract: Discrete Diffusion Language Models progressed rapidly as an alternative to autoregressive (AR) models, motivated by their parallel generation abilities. However, for tractability, discrete diffusion models sample from a factorized…

17
arXiv — Machine Learning research 17h ago

HEPA: A Self-Supervised Horizon-Conditioned Event Predictive Architecture for Time Series

arXiv:2605.11130v1 Announce Type: new Abstract: Critical events in multivariate time series, from turbine failures to cardiac arrhythmias, demand accurate prediction, yet labeled data is scarce because such events are rare and costly to annotate. We introduce HEPA…

16
arXiv — Machine Learning research 17h ago

Steerable Neural ODEs on Homogeneous Spaces

arXiv:2605.11133v1 Announce Type: new Abstract: We introduce steerable neural ordinary differential equations on homogeneous spaces $M=G/H$. These models constitute a novel geometric extension of manifold neural ordinary differential equations (NODEs) that transport associated…

33
arXiv — Machine Learning research 17h ago

Spurious Correlation Learning in Preference Optimization: Mechanisms, Consequences, and Mitigation via Tie Training

arXiv:2605.11134v1 Announce Type: new Abstract: Preference learning methods such as Direct Preference Optimization (DPO) are known to induce reliance on spurious correlations, leading to sycophancy and length bias in today's language models and potentially severe goal…

13
arXiv — Machine Learning research 17h ago

Rank Is Not Capacity: Spectral Occupancy for Latent Graph Models

arXiv:2605.11142v1 Announce Type: new Abstract: Graph representation learning has become a standard approach for analyzing networked data, with latent embeddings widely used for link prediction, community detection, and related tasks. Yet a basic design choice, the latent…

36
arXiv — Machine Learning research 17h ago

CORE: Cyclic Orthotope Relation Embedding for Knowledge Graph Completion

arXiv:2605.11159v1 Announce Type: new Abstract: Knowledge graph completion (KGC) aims to automatically infer missing facts in multi-relational data by mapping entities and relations into continuous representation spaces. Recent region-based embedding models have shown great…

16
arXiv — Machine Learning research 17h ago

Interpretability Can Be Actionable

arXiv:2605.11161v1 Announce Type: new Abstract: Interpretability aims to explain the behavior of deep neural networks. Despite rapid growth, there is mounting concern that much of this work has not translated into practical impact, raising questions about its relevance and…

37
arXiv — Machine Learning research 17h ago

COSMOS: Model-Agnostic Personalized Federated Learning with Clustered Server Models and Pseudo-Label-Only Communication

arXiv:2605.11165v1 Announce Type: new Abstract: Federated learning (FL) in heterogeneous environments remains challenging because client models often differ in both architecture and data distribution. While recent approaches attempt to address this challenge through client…

36
arXiv — Machine Learning research 17h ago

Unlearning with Asymmetric Sources: Improved Unlearning-Utility Trade-off with Public Data

arXiv:2605.11170v1 Announce Type: new Abstract: Noise-based certified machine unlearning currently faces a hard ceiling: the noise magnitude required to certify unlearning typically destroys model utility, particularly for large-scale deletion requests. While leveraging public…

12
arXiv — Machine Learning research 17h ago

Optimistic Dual Averaging Unifies Modern Optimizers

arXiv:2605.11172v1 Announce Type: new Abstract: We introduce SODA, a generalization of Optimistic Dual Averaging, which provides a common perspective on state-of-the-art optimizers like Muon, Lion, AdEMAMix and NAdam, showing that they can all be viewed as optimistic instances…

31
arXiv — Machine Learning research 17h ago

Oversmoothing as Representation Degeneracy in Neural Sheaf Diffusion

arXiv:2605.11178v1 Announce Type: new Abstract: Neural Sheaf Diffusion (NSD) generalizes diffusion-based Graph Neural Networks by replacing scalar graph Laplacians with sheaf Laplacians whose learned restriction maps define a task-adapted geometry. While the diffusion limit of…

25
arXiv — Machine Learning research 17h ago

Muon is Not That Special: Random or Inverted Spectra Work Just as Well

arXiv:2605.11181v1 Announce Type: new Abstract: The recent empirical success of the Muon optimizer has renewed interest in non-Euclidean optimization, typically justified by similarities with second-order methods, and linear minimization oracle (LMO) theory. In this paper, we…

8
arXiv — Machine Learning research 17h ago

CATS: Cascaded Adaptive Tree Speculation for Memory-Limited LLM Inference Acceleration

arXiv:2605.11186v1 Announce Type: new Abstract: Auto-regressive decoding in Large Language Models (LLMs) is inherently memory-bound: every generation step requires loading the model weights and intermediate results from memory (e.g., High-Bandwidth Memory (HBM) for GPU servers),…

19
arXiv — Machine Learning research 17h ago

Deep Learning for Protein Complex Prediction and Design

arXiv:2605.11189v1 Announce Type: new Abstract: Accurately modeling and designing protein complex structures is a central problem in computational structural biology, with broad implications for understanding cellular function and developing therapeutics. This thesis…

16
arXiv — Machine Learning research 17h ago

Variational Linear Attention: Stable Associative Memory for Long-Context Transformers

arXiv:2605.11196v1 Announce Type: new Abstract: Linear attention reduces the quadratic cost of softmax attention to $\mathcal{O}(T)$, but its memory state grows as $\mathcal{O}(T)$ in Frobenius norm, causing progressive interference between stored associations. We introduce…

13
arXiv — Machine Learning research 17h ago

FeatMap: Understanding image manipulation in the feature space and its implications for feature space geometry

arXiv:2605.11203v1 Announce Type: new Abstract: Intermediate feature representations represent the backbone for the expressivity and adaptability of deep neural networks. However, their geometric structure remains poorly understood. In this submission, we provide indirect…

20
arXiv — Machine Learning research 17h ago

The Scaling Law of Evaluation Failure: Why Simple Averaging Collapses Under Data Sparsity and Item Difficulty Gaps, and How Item Response Theory Recovers Ground Truth Across Domains

arXiv:2605.11205v1 Announce Type: new Abstract: Benchmark evaluation across AI and safety-critical domains overwhelmingly relies on simple averaging. We demonstrate that this practice produces substantially misleading rankings when two conditions co-occur: (1) the evaluation…

34
arXiv — Machine Learning research 17h ago

Measuring Five-Nines Reliability: Sample-Efficient LLM Evaluation in Saturated Benchmarks

arXiv:2605.11209v1 Announce Type: new Abstract: While existing benchmarks demonstrate the near-perfect performance of large language models (LLMs) on various tasks, this apparent saturation often obscures the need for rigorous evaluation of their reliability. In real-world…

36
arXiv — Machine Learning research 17h ago

Enforcing Constraints in Generative Sampling via Adaptive Correction Scheduling

arXiv:2605.11214v1 Announce Type: new Abstract: Hard constraints in generative sampling are typically enforced by projection, applied either once at the end of sampling or after every update. This binary framing overlooks a fundamental issue: projection changes the distribution…

17
arXiv — Machine Learning research 17h ago

Leveraging RAG for Training-Free Alignment of LLMs

arXiv:2605.11217v1 Announce Type: new Abstract: Large language model (LLM) alignment algorithms typically consist of post-training over preference pairs. While such algorithms are widely used to enable safety guardrails and align LLMs with general human preferences, we show that…

36
arXiv — Machine Learning research 17h ago

ADMM-Q: An Improved Hessian-based Weight Quantizer for Post-Training Quantization of Large Language Models

arXiv:2605.11222v1 Announce Type: new Abstract: Quantization is an effective strategy to reduce the storage and computation footprint of large language models (LLMs). Post-training quantization (PTQ) is a leading approach for compressing LLMs. Popular weight quantization…

5
arXiv — Machine Learning research 17h ago

LiBaGS: Lightweight Boundary Gap Synthesis for Targeted Synthetic Data Selection

arXiv:2605.11231v1 Announce Type: new Abstract: Synthetic data is useful only when the added samples fill missing parts of the training distribution that matter for the downstream task. We introduce LiBaGS, a lightweight, generator-agnostic method for targeted synthetic training…

30
arXiv — Machine Learning research 17h ago

A Comparative Study of Model Selection Criteria for Symbolic Regression

arXiv:2605.11233v1 Announce Type: new Abstract: Effective model selection is critical in symbolic regression (SR) to identify mathematical expressions that balance accuracy and complexity, and have low expected error on unseen data. Many modern implementations of genetic…

38
arXiv — Machine Learning research 17h ago

Internalizing Curriculum Judgment for LLM Reinforcement Fine-Tuning

arXiv:2605.11235v1 Announce Type: new Abstract: In LLM Reinforcement Fine-Tuning (RFT), curriculum learning drives both efficiency and performance. Yet, current methods externalize curriculum judgment via handcrafted heuristics or auxiliary models, risking misalignment with the…

18
arXiv — Machine Learning research 17h ago

DeconDTN-Toolkit: A Library for Evaluation and Enhancement of Robustness to Provenance Shift

arXiv:2605.11237v1 Announce Type: new Abstract: Despite the burgeoning body of work on distribution shifts, provenance shift-where the relationship between data source and label changes at deployment-remains poorly understood and under-addressed. In this paper, we establish a…

13
arXiv — Machine Learning research 17h ago

Extending Kernel Trick to Influence Functions

arXiv:2605.11239v1 Announce Type: new Abstract: In this paper, we present a dual representation of the influence functions, whose computational complexity scales with dataset size rather than model size. Both analytically and experimentally, we show that this representation can…

7
arXiv — Machine Learning research 17h ago

Support-Proximity Augmented Diffusion Estimation for Offline Black-Box Optimization

arXiv:2605.11246v1 Announce Type: new Abstract: Offline black-box optimization aims to discover novel designs with high property scores using only a static dataset, a task fundamentally challenged by the out-of-distribution (OOD) extrapolation problem. Existing approaches…

13
arXiv — Machine Learning research 17h ago

A Proof-of-Concept Simulation-Driven Digital Twin Framework for Decision-Aware Diabetes Modeling

arXiv:2605.11247v1 Announce Type: new Abstract: This paper presents a proof-of-concept digital twin framework for simulation-driven diabetes modeling using benchmark clinical data, synthetic temporal augmentation, and illustrative continuous glucose monitoring (CGM) analysis.…

27
arXiv — Machine Learning research 17h ago

Curriculum Learning-Guided Progressive Distillation in Large Language Models

arXiv:2605.11260v1 Announce Type: new Abstract: Knowledge distillation is a key technique for transferring the capabilities of large language models (LLMs) into smaller, more efficient student models. Existing distillation approaches often overlook two critical factors: the…

26
arXiv — Machine Learning research 17h ago

Latent Chain-of-Thought Improves Structured-Data Transformers

arXiv:2605.11262v1 Announce Type: new Abstract: Chain-of-thought and more broadly test-time compute are known to augment the expressive capabilities of language models and have led to major innovations in reasoning. Motivated by this success, this paper explores latent…

24
arXiv — Machine Learning research 17h ago

Localization Boosting for Growth Markets: Mitigating Cross-Locale Behavioral Bias in Learning-to-Rank

arXiv:2605.11272v1 Announce Type: new Abstract: Adobe Express is expanding internationally, but the US has a disproportionately large content supply and interaction volume. Learning-to-rank (LTR) models trained primarily on behavioral feedback inherit this imbalance: templates…

20
arXiv — Machine Learning research 17h ago

Beyond Similarity: Temporal Operator Attention for Time Series Analysis

arXiv:2605.11287v1 Announce Type: new Abstract: A persistent paradox in time-series forecasting is that structurally simple MLP and linear models often outperform high-capacity Transformers. We argue that this gap arises from a mismatch in the sequence-modeling primitive: while…

18
arXiv — Machine Learning research 17h ago

Quotient-Categorical Representations for Bellman-Compatible Average-Reward Distributional Reinforcement Learning

arXiv:2605.11289v1 Announce Type: new Abstract: Average-reward reinforcement learning requires estimating the gain and the bias, which is defined only up to an additive constant. This makes direct distributional analogues ill-posed on the real line. We introduce a quotient-space…

27
arXiv — Machine Learning research 17h ago

Optimal Representations for Generalized Contrastive Learning with Imbalanced Datasets

arXiv:2605.11291v1 Announce Type: new Abstract: In this paper, we provide a computable characterization of the geometry of optimal representations in Contrastive Learning (CL) when the classes are imbalanced. When classes are balanced and the representation dimension is greater…

27
arXiv — Machine Learning research 17h ago

Primal Generation, Dual Judgment: Self-Training from Test-Time Scaling

arXiv:2605.11299v1 Announce Type: new Abstract: Code generation is typically trained in the primal space of programs: a model produces a candidate solution and receives sparse execution feedback, often a single pass/fail bit. Test-time scaling enriches the inference procedure by…

32
arXiv — Machine Learning research 17h ago

A Theory of Time-Sensitive Language Generation: Sparse Hallucination Beats Mode Collapse

arXiv:2605.11302v1 Announce Type: new Abstract: We study language generation in the limit under a global preference ordering on strings, as introduced by Kleinberg and Wei. As in [arXiv:2504.14370, arXiv:2511.05295], we aim for \emph{breadth}, but impose an additional…

20
arXiv — Machine Learning research 17h ago

Couple to Control: Joint Initial Noise Design in Diffusion Models

arXiv:2605.11311v1 Announce Type: new Abstract: Diffusion models typically generate image batches from independent Gaussian initial noises. We argue that this independence assumption is only one choice within a broader class of valid joint noise designs. Instead, one can specify…

11
arXiv — Machine Learning research 17h ago

Error whitening: Why Gauss-Newton outperforms Newton

arXiv:2605.11316v1 Announce Type: new Abstract: The Gauss-Newton matrix is widely viewed as a positive semidefinite approximation of the Hessian, yet mounting empirical evidence shows that Gauss-Newton descent outperforms Newton's method. We adopt a function space perspective to…

5
arXiv — Machine Learning research 17h ago

$\varepsilon$-Good Action Identification in Fixed-Budget Monte Carlo Tree Search

arXiv:2605.11324v1 Announce Type: new Abstract: We study the fixed-budget max-min action identification problem in depth-2 max-min trees, an important special case of Monte Carlo Tree Search. A learner sequentially allocates $T$ samples to leaves and then recommends a subtree…

17
arXiv — Machine Learning research 17h ago

Neural Statistical Functions

arXiv:2605.11327v1 Announce Type: new Abstract: Classical deep learning typically operates on individual cases. Despite its success, real-world usage often requires repeated inference to estimate statistical quantities for complex decision-making tasks involving uncertainty or…

24
arXiv — Machine Learning research 17h ago

Epistemic Uncertainty for Test-Time Discovery

arXiv:2605.11328v1 Announce Type: new Abstract: Automated scientific discovery using large language models relies on identifying genuinely novel solutions. Standard reinforcement learning penalizes high-variance mutations, which leads the policy to prioritize familiar patterns.…

31
arXiv — Machine Learning research 17h ago

Physics-Informed Teacher-Student Ensemble Learning for Traffic State Estimation with a Varying Speed Limit Scenario

arXiv:2605.11346v1 Announce Type: new Abstract: Physics-informed deep learning (PIDL) neural networks have shown their capability as a useful instrument for transportation practitioners in utilizing the underlying relationship between the state variables for traffic state…

11
arXiv — Machine Learning research 17h ago

Gradient-Free Noise Optimization for Reward Alignment in Generative Models

arXiv:2605.11347v1 Announce Type: new Abstract: Existing reward alignment methods for diffusion and flow models rely on multi-step stochastic trajectories, making them difficult to extend to deterministic generators. A natural alternative is noise-space optimization, but…

38
arXiv — Machine Learning research 17h ago

gym-invmgmt: An Open Benchmarking Framework for Inventory Management Methods

arXiv:2605.11355v1 Announce Type: new Abstract: Inventory-policy comparisons are often difficult to interpret because performance depends on the evaluation contract as much as on the policy itself. Differences in topology, demand regime, information access, feasibility…

32
arXiv — Machine Learning research 17h ago

The tractability landscape of diffusion alignment: regularization, rewards, and computational primitives

arXiv:2605.11361v1 Announce Type: new Abstract: Inference-time reward alignment asks how to turn a pre-trained diffusion model with base law $p$ into a sampler that favors a reward $r$ while remaining close to $p$. Since there is no canonical distributional distance for this…

27
arXiv — Machine Learning research 17h ago

Causal Fairness for Survival Analysis

arXiv:2605.11362v1 Announce Type: new Abstract: In the data-driven era, large-scale datasets are routinely collected and analyzed using machine learning (ML) and artificial intelligence (AI) to inform decisions in high-stakes domains such as healthcare, employment, and criminal…

31
arXiv — Machine Learning research 17h ago

LPDP: Inference-Time Reward Control for Variable-Length DNA Generation with Edit Flows

arXiv:2605.11368v1 Announce Type: new Abstract: We study the application of recent Edit Flows for inference-time reward control for DNA sequence generation. Unlike most reward-guided DNA generation frameworks, which operate on fixed-length sequence spaces, Edit Flows have a…

6
arXiv — Machine Learning research 17h ago

TRACE: Temporal Routing with Autoregressive Cross-channel Experts for EEG Representation Learning

arXiv:2605.11380v1 Announce Type: new Abstract: Learning transferable representations for electroencephalography (EEG) remains challenging because EEG signals are inherently multi-channel and non-stationary. Channels observed at the same time provide coupled measurements of…

25
arXiv — Machine Learning research 17h ago

Behavioral Mode Discovery for Fine-tuning Multimodal Generative Policies

arXiv:2605.11387v1 Announce Type: new Abstract: We address the problem of fine-tuning pre-trained generative policies with reinforcement learning (RL) while preserving the multimodality of their action distributions. Existing methods for RL fine-tuning of generative policies…

17
arXiv — Machine Learning research 17h ago

MuonQ: Enhancing Low-Bit Muon Quantization via Directional Fidelity Optimization

arXiv:2605.11396v1 Announce Type: new Abstract: The Muon optimizer has emerged as a compelling alternative to Adam for training large language models, achieving remarkable computational savings through gradient orthogonalization. However, Muon's optimizer state is more sensitive…

21
arXiv — Machine Learning research 17h ago

More Than Meets the Eye: A Semantics-Aware Traffic Augmentation Framework for Generalizable Website Fingerprinting

arXiv:2605.11402v1 Announce Type: new Abstract: Deep learning-based website fingerprinting has emerged as an effective technique for inferring the websites users visit. Although existing methods achieve strong performance on closed-world datasets, they often fail to generalize…

23
arXiv — Machine Learning research 17h ago

20/20 Vision Language Models: A Prescription for Better VLMs through Data Curation Alone

arXiv:2605.11405v1 Announce Type: new Abstract: Data curation has shifted the quality-compute frontier for language-model and contrastive image-text pretraining, but its role for vision-language models (VLMs) is far less established. We ask how far data curation alone can take…

33
arXiv — Machine Learning research 17h ago

A Boundary-Aware Non-parametric Granular-Ball Classifier Based on Minimum Description Length

arXiv:2605.11406v1 Announce Type: new Abstract: Existing granular-ball classification methods are often driven by handcrafted quality measures, neighborhood rules, or heuristic splitting and stopping criteria, which may reduce the transparency of local construction decisions and…

6
arXiv — Machine Learning research 17h ago

Generative Diffusion Prior Distillation for Long-Context Knowledge Transfer

arXiv:2605.11414v1 Announce Type: new Abstract: While traditional time-series classifiers assume full sequences at inference, practical constraints (latency and cost) often limit inputs to partial prefixes. The absence of class-discriminative patterns in partial data can…

29
arXiv — Machine Learning research 17h ago

FastUMAP: Scalable Dimensionality Reduction via Bipartite Landmark Sampling

arXiv:2605.11428v1 Announce Type: new Abstract: Exploratory analysis of high-dimensional data rarely stops at a single embedding. In practice, analysts rerun dimensionality reduction after changing preprocessing, subsets, or hyperparameters, and standard nonlinear methods can…

26
arXiv — Machine Learning research 17h ago

Deep Minds and Shallow Probes

arXiv:2605.11448v1 Announce Type: new Abstract: Neural representations are not unique objects. Even when two systems realize the same downstream computation, their hidden coordinates may differ by reparameterization. A probe family intended to reveal structure already present in…

18
arXiv — Machine Learning research 17h ago

Beyond Prediction: Interval Neural Networks for Uncertainty-Aware System Identification

arXiv:2605.11460v1 Announce Type: new Abstract: System identification (SysID) is critical for modeling dynamical systems from experimental data, yet traditional approaches often fail to capture nonlinear behaviors. While deep learning offers powerful tools for modeling such…

20
arXiv — Machine Learning research 17h ago

Drop the Act: Probe-Filtered RL for Faithful Chain-of-Thought Reasoning

arXiv:2605.11467v1 Announce Type: new Abstract: Reasoning models post-hoc rationalize answers they have already committed to internally, producing chains of *reasoning theater*: deliberative-looking steps that contribute nothing to correctness. This wastes inference tokens,…

7
arXiv — Machine Learning research 17h ago

Robust Multi-Agent Path Finding under Observation Attacks: A Principled Adversarial-Plus-Smoothing Training Recipe

arXiv:2605.11469v1 Announce Type: new Abstract: Decentralized multi-agent path finding (MAPF) routes a team of agents on a shared grid, each acting from its own local view. The standard solution trains one shared neural policy with Proximal Policy Optimization (PPO), a popular…

20
arXiv — Machine Learning research 17h ago

On the Approximation Complexity of Matrix Product Operator Born Machines

arXiv:2605.11471v1 Announce Type: new Abstract: Matrix product operator Born machines (MPO-BMs) are tractable tensor-network models for probabilistic modeling, but their efficient approximation capability remains unclear. We characterize this boundary from both negative and…

35
arXiv — Machine Learning research 17h ago

Efficient Adjoint Matching for Fine-tuning Diffusion Models

arXiv:2605.11480v1 Announce Type: new Abstract: Reward fine-tuning has become a common approach for aligning pretrained diffusion and flow models with human preferences in text-to-image generation. Among reward-gradient-based methods, Adjoint Matching (AM) provides a principled…

30
arXiv — Machine Learning research 17h ago

Adaptive Calibration in Non-Stationary Environments

arXiv:2605.11490v1 Announce Type: new Abstract: Making calibrated online predictions is a central challenge in modern AI systems. Much of the existing literature focuses on fully adversarial environments where outcomes may be arbitrary, leading to conservative algorithms that…

9
arXiv — Machine Learning research 17h ago

Understanding and Preventing Entropy Collapse in RLVR with On-Policy Entropy Flow Optimization

arXiv:2605.11491v1 Announce Type: new Abstract: Reinforcement learning with verifiable rewards (RLVR) has become an effective paradigm for improving the reasoning ability of large language models. However, widely used RLVR algorithms, such as GRPO, often suffer from entropy…

12
arXiv — Machine Learning research 17h ago

CTFusion: A CTF-based Benchmark for LLM Agent Evaluation

arXiv:2605.11504v1 Announce Type: new Abstract: Recent advances in Large Language Models (LLMs) have enabled agentic systems for complex, multi-step tasks; cybersecurity is emerging as a prominent application. To evaluate such agents, researchers widely adopt Capture The Flag…

23
arXiv — Machine Learning research 17h ago

EqOD: Symmetry-Informed Stability Selection for PDE Identification

arXiv:2605.11524v1 Announce Type: new Abstract: Data-driven identification of partial differential equations (PDEs) relies on sparse regression over a candidate library of differential operators, where larger libraries inflate false positives under observation noise and smaller…

26
arXiv — Machine Learning research 1d ago

Reinforcement learning for inverse structural design and rapid laser cutting of kirigami prototypes

arXiv:2605.08098v1 Announce Type: new Abstract: Kirigami is an increasingly useful fabrication method to produce shape-programmable metamaterial structures. However, inverse design remains difficult because deployment…

12
arXiv — Machine Learning research 1d ago

Path-Based Gradient Boosting for Graph-Level Prediction

arXiv:2605.08102v1 Announce Type: new Abstract: We propose PathBoost, a gradient tree boosting method for graph-level classification and regression that learns discriminative path-based features directly from the input…

20
arXiv — Machine Learning research 1d ago

Distributional Reinforcement Learning via the Cram\'er Distance

arXiv:2605.08104v1 Announce Type: new Abstract: This paper explores the application of the Soft Actor-Critic (SAC) algorithm within a Distributional Reinforcement Learning setting and introduces an implementation of…

15
arXiv — Machine Learning research 1d ago

Geometry-free prediction of inertial lift forces in microfluidic devices using deep learning

arXiv:2605.08109v1 Announce Type: new Abstract: Inertial microfluidic devices (IMDs) offer low-cost, high-throughput alternative techniques for many traditional particle- (or cell-) manipulation tasks, but simulating…

19
arXiv — Machine Learning research 1d ago

BaLoRA: Bayesian Low-Rank Adaptation of Large Scale Models

arXiv:2605.08110v1 Announce Type: new Abstract: Low-Rank Adaptation (LoRA) has become the standard for fine-tuning large pre-trained models at reduced computational cost. However, its low-rank point-estimate updates…

6
arXiv — Machine Learning research 1d ago

TTCD:Transformer Integrated Temporal Causal Discovery from Non-Stationary Time Series Data

arXiv:2605.08111v1 Announce Type: new Abstract: The widespread availability of complex time series data in various domains such as environmental science, epidemiology, and economics demands robust causal discovery…

35
arXiv — Machine Learning research 1d ago

Do Foundation Model Embeddings Improve Cross-Country Crop Yield Generalisation? A Leave-One-Country-Out Evaluation in Sub-Saharan Africa

arXiv:2605.08113v1 Announce Type: new Abstract: Accurate predictions of smallholder maize yields across national boundaries are critical for food security planning in sub-Saharan Africa, yet most published benchmarks…

17
arXiv — Machine Learning research 1d ago

Statistical Inference and Quality Measures of KV Cache Quantisations Inspired by TurboQuant

arXiv:2605.08114v1 Announce Type: new Abstract: We analyse three KV cache quantization schemes under a fair bit budget: \textbf{KV} (scalar MSE baseline), \textbf{KQV} (WHT + MSE on $K$; WHT + MSE + QJL on $V$), and…

27
arXiv — Machine Learning research 1d ago

The Safety-Aware Denoiser for Text Diffusion Models

arXiv:2605.08116v1 Announce Type: new Abstract: Recent work on text diffusion models offers a promising alternative to autoregressive generation, but controlling their safety remains underexplored. Existing safety…

9
arXiv — Machine Learning research 1d ago

Feature Repulsion and Spectral Lock-in: An Empirical Study of Two-Layer Network Grokking

arXiv:2605.08119v1 Announce Type: new Abstract: Tian (2025) proves a repulsion theorem (Theorem 6) for the matrix $ B = (\widetilde{F}^\top \widetilde{F} + \eta I)^{-1} $ during the interactive feature-learning stage of…

31
arXiv — Machine Learning research 1d ago

Block-Wise Differentiable Sinkhorn Attention: Tail-Refinement Gradients with a Gap-Aware Dustbin Bridge

arXiv:2605.08123v1 Announce Type: new Abstract: We study long-context balanced entropic optimal transport (OT) attention on TPU hardware through a stopped-base, fixed-depth tail-refinement surrogate. After a stopped…

32
arXiv — Machine Learning research 1d ago

Towards Universal Gene Regulatory Network Inference: Unlocking Generalizable Regulatory Knowledge in Single-cell Foundation Models

arXiv:2605.08128v1 Announce Type: new Abstract: Gene Regulatory Network (GRN) inference is essential for understanding complex cellular mechanisms, rendered tractable through single-cell transcriptomic data. With the…

19
arXiv — Machine Learning research 1d ago

Towards Customized Multimodal Role-Play

arXiv:2605.08129v1 Announce Type: new Abstract: Unified multimodal understanding and generation models enable richer human-AI interaction. Yet jointly customizing a character's persona, dialogue style, and visual…

26
arXiv — Machine Learning research 1d ago

Additive Atomic Forests for Symbolic Function and Antiderivative Discovery

arXiv:2605.08130v1 Announce Type: new Abstract: We present a framework for the simultaneous symbolic recovery of a function and its antiderivative from data. The framework rests on three ideas. First, a derivative…

27
arXiv — Machine Learning research 1d ago

Interactive Inverse Reinforcement Learning of Interaction Scenarios via Bi-level Optimization

arXiv:2605.08131v1 Announce Type: new Abstract: Inverse reinforcement learning (IRL) learns a reward function and a corresponding policy that best fit the demonstration data of an expert. However, in the current IRL…

18
arXiv — Machine Learning research 1d ago

DARE: Diffusion Language Model Activation Reuse for Efficient Inference

arXiv:2605.08134v1 Announce Type: new Abstract: Diffusion Large Language Models (dLLMs) have emerged as a promising alternative to auto-regressive (AR) models, offering greater expressive capacity and potential for…

36
arXiv — Machine Learning research 1d ago

Dendritic Neural Networks with Equilibrium Propagation

arXiv:2605.08135v1 Announce Type: new Abstract: Equilibrium propagation (EP) is a biologically plausible alternative to backpropagation (BP), but its effectiveness can degrade in deeper and more challenging learning…

26
arXiv — Machine Learning research 1d ago

Weight Pruning Amplifies Bias: A Multi-Method Study of Compressed LLMs for Edge AI

arXiv:2605.08137v1 Announce Type: new Abstract: Weight pruning is widely advocated for deploying Large Language Models on resource-constrained IoT and edge devices, yet its impact on model fairness remains poorly…

6
arXiv — Machine Learning research 1d ago

DataArc-SynData-Toolkit: A Unified Closed-Loop Framework for Multi-Path, Multimodal, and Multilingual Data Synthesis

arXiv:2605.08138v1 Announce Type: new Abstract: Synthetic data has emerged as a crucial solution to the data scarcity bottleneck in large language models (LLMs), particularly for specialized domains and low-resource…

10
arXiv — Machine Learning research 1d ago

Reasoning emerges from constrained inference manifolds in large language models

arXiv:2605.08142v1 Announce Type: new Abstract: Reasoning in large language models is predominantly evaluated through labeled benchmarks, conflating task performance with the quality of internal inference. Here we study…

15

Interpretable EEG Microstate Discovery via Variational Deep Embedding: A Systematic Architecture Search with Multi-Quadrant Evaluation

QuIDE: Mastering the Quantized Intelligence Trade-off via Active Optimization

Steering Without Breaking: Mechanistically Informed Interventions for Discrete Diffusion Language Models

Rotation-Preserving Supervised Fine-Tuning

Vertex-Softmax: Tight Transformer Verification via Exact Softmax Optimization

Hierarchical Multi-Scale Graph Neural Networks: Scalable Heterophilous Learning with Oversmoothing and Oversquashing Mitigation

LEAP: Unlocking dLLM Parallelism via Lookahead Early-Convergence Token Detection

$\xi$-DPO: Direct Preference Optimization via Ratio Reward Margin

TMPO: Trajectory Matching Policy Optimization for Diverse and Efficient Diffusion Alignment

Structural Interpretations of Protein Language Model Representations via Differentiable Graph Partitioning

AESOP: Adversarial Execution-path Selection to Overload Deep Learning Pipelines

Seeing the Needle in the Haystack: Towards Weakly-Supervised Log Instance Anomaly Localization via Counterfactual Perturbation

SURGE: Surrogate Gradient Adaptation in Binary Neural Networks

Test-Time Personalization: A Diagnostic Framework and Probabilistic Fix for Scaling Failures

SkillGen: Verified Inference-Time Agent Skill Synthesis

Finite Volume-Informed Neural Network Framework for 2D Shallow Water Equations: Rugged Loss Landscapes and the Importance of Data Guidance

DisagMoE: Computation-Communication overlapped MoE Training via Disaggregated AF-Pipe Parallelism

RT-Transformer: The Transformer Block as a Spherical State Estimator

When and How to Canonize: A Generalization Perspective

ACSAC: Adaptive Chunk Size Actor-Critic with Causal Transformer Q-Network

A Comparative Study of Federated Learning Aggregation Strategies under Homogeneous and Heterogeneous Data Distributions

LoopUS: Recasting Pretrained LLMs into Looped Latent Refinement Models

Backbone-Equated Diffusion OOD via Sparse Internal Snapshots

Simpson's Paradox in Behavioral Curves: How Aggregation Distorts Parametric Models of User Dynamics

Efficient LLM Reasoning via Variational Posterior Guidance with Efficiency Awareness

Trust Region Inverse Reinforcement Learning: Explicit Dual Ascent using Local Policy Updates

A Switching System Theory of Q-Learning with Linear Function Approximation

ASD-Bench: A Four-Axis Comprehensive Benchmark of AI Models for Autism Spectrum Disorder

Enabling Performant and Flexible Model-Internal Observability for LLM Inference

Newton's Lantern: A Reinforcement Learning Framework for Finetuning AC Power Flow Warm Start Models

GRAFT-ATHENA: Self-Improving Agentic Teams for Autonomous Discovery and Evolutionary Numerical Algorithms

Language Modeling with Hyperspherical Flows

HEPA: A Self-Supervised Horizon-Conditioned Event Predictive Architecture for Time Series

Steerable Neural ODEs on Homogeneous Spaces

Spurious Correlation Learning in Preference Optimization: Mechanisms, Consequences, and Mitigation via Tie Training

Rank Is Not Capacity: Spectral Occupancy for Latent Graph Models

CORE: Cyclic Orthotope Relation Embedding for Knowledge Graph Completion

Interpretability Can Be Actionable

COSMOS: Model-Agnostic Personalized Federated Learning with Clustered Server Models and Pseudo-Label-Only Communication

Unlearning with Asymmetric Sources: Improved Unlearning-Utility Trade-off with Public Data

Optimistic Dual Averaging Unifies Modern Optimizers

Oversmoothing as Representation Degeneracy in Neural Sheaf Diffusion

Muon is Not That Special: Random or Inverted Spectra Work Just as Well

CATS: Cascaded Adaptive Tree Speculation for Memory-Limited LLM Inference Acceleration

Deep Learning for Protein Complex Prediction and Design

Variational Linear Attention: Stable Associative Memory for Long-Context Transformers

FeatMap: Understanding image manipulation in the feature space and its implications for feature space geometry

The Scaling Law of Evaluation Failure: Why Simple Averaging Collapses Under Data Sparsity and Item Difficulty Gaps, and How Item Response Theory Recovers Ground Truth Across Domains

Measuring Five-Nines Reliability: Sample-Efficient LLM Evaluation in Saturated Benchmarks

Enforcing Constraints in Generative Sampling via Adaptive Correction Scheduling

Leveraging RAG for Training-Free Alignment of LLMs

ADMM-Q: An Improved Hessian-based Weight Quantizer for Post-Training Quantization of Large Language Models

LiBaGS: Lightweight Boundary Gap Synthesis for Targeted Synthetic Data Selection

A Comparative Study of Model Selection Criteria for Symbolic Regression

Internalizing Curriculum Judgment for LLM Reinforcement Fine-Tuning

DeconDTN-Toolkit: A Library for Evaluation and Enhancement of Robustness to Provenance Shift

Extending Kernel Trick to Influence Functions

Support-Proximity Augmented Diffusion Estimation for Offline Black-Box Optimization

A Proof-of-Concept Simulation-Driven Digital Twin Framework for Decision-Aware Diabetes Modeling

Curriculum Learning-Guided Progressive Distillation in Large Language Models

Latent Chain-of-Thought Improves Structured-Data Transformers

Localization Boosting for Growth Markets: Mitigating Cross-Locale Behavioral Bias in Learning-to-Rank

Beyond Similarity: Temporal Operator Attention for Time Series Analysis

Quotient-Categorical Representations for Bellman-Compatible Average-Reward Distributional Reinforcement Learning

Optimal Representations for Generalized Contrastive Learning with Imbalanced Datasets

Primal Generation, Dual Judgment: Self-Training from Test-Time Scaling

A Theory of Time-Sensitive Language Generation: Sparse Hallucination Beats Mode Collapse

Couple to Control: Joint Initial Noise Design in Diffusion Models

Error whitening: Why Gauss-Newton outperforms Newton

$\varepsilon$-Good Action Identification in Fixed-Budget Monte Carlo Tree Search

Neural Statistical Functions

Epistemic Uncertainty for Test-Time Discovery

Physics-Informed Teacher-Student Ensemble Learning for Traffic State Estimation with a Varying Speed Limit Scenario

Gradient-Free Noise Optimization for Reward Alignment in Generative Models

gym-invmgmt: An Open Benchmarking Framework for Inventory Management Methods

The tractability landscape of diffusion alignment: regularization, rewards, and computational primitives

Causal Fairness for Survival Analysis

LPDP: Inference-Time Reward Control for Variable-Length DNA Generation with Edit Flows

TRACE: Temporal Routing with Autoregressive Cross-channel Experts for EEG Representation Learning

Behavioral Mode Discovery for Fine-tuning Multimodal Generative Policies