arXiv — Machine Learning
116 articles archived · Visit source ↗ · RSS
-
arXiv — Machine Learning research 17h ago
QuIDE: Mastering the Quantized Intelligence Trade-off via Active Optimization
arXiv:2605.10959v1 Announce Type: new Abstract: There is currently no unified metric for evaluating the efficiency of quantized neural networks. We propose QuIDE, built around the Intelligence Index I = (C x P)/log_2(T+1), which collapses the compression-accuracy-latency…
22 -
arXiv — Machine Learning research 17h ago
Steering Without Breaking: Mechanistically Informed Interventions for Discrete Diffusion Language Models
arXiv:2605.10971v1 Announce Type: new Abstract: Discrete diffusion language models (DLMs) generate text by iteratively denoising all positions in parallel, offering an alternative to autoregressive models. Controlled generation methods for DLMs, imported from autoregressive…
4 -
arXiv — Machine Learning research 17h ago
Rotation-Preserving Supervised Fine-Tuning
arXiv:2605.10973v1 Announce Type: new Abstract: Supervised fine-tuning (SFT) improves in-domain performance but can degrade out-of-domain (OOD) generalization. Prior work suggests that this degradation is related to changes in dominant singular subspaces of pretrained weight…
22 -
arXiv — Machine Learning research 17h ago
Vertex-Softmax: Tight Transformer Verification via Exact Softmax Optimization
arXiv:2605.10974v1 Announce Type: new Abstract: Certified verification of transformer attention requires bounding the softmax function over interval constraints on the pre-softmax scores. Existing verifiers relax softmax ndependently of the downstream objective, leaving…
26 -
arXiv — Machine Learning research 17h ago
Hierarchical Multi-Scale Graph Neural Networks: Scalable Heterophilous Learning with Oversmoothing and Oversquashing Mitigation
arXiv:2605.10975v1 Announce Type: new Abstract: Graphs with heterophily, where adjacent nodes carry different labels, are prevalent in real-world applications, from social networks to molecular interactions. However, existing spectral Graph Neural Network (GNN) approaches…
24 -
arXiv — Machine Learning research 17h ago
LEAP: Unlocking dLLM Parallelism via Lookahead Early-Convergence Token Detection
arXiv:2605.10980v1 Announce Type: new Abstract: Diffusion Language Models (dLLMs) have garnered significant attention for their potential in highly parallel processing. The parallel capabilities of existing dLLMs stem from the assumption of conditional independence at high…
35 -
arXiv — Machine Learning research 17h ago
$\xi$-DPO: Direct Preference Optimization via Ratio Reward Margin
arXiv:2605.10981v1 Announce Type: new Abstract: Reference-free preference optimization has emerged as an efficient alternative to reinforcement learning from human feedback, with Simple Preference Optimization(SimPO) demonstrating strong performance by eliminating the explicit…
23 -
arXiv — Machine Learning research 17h ago
TMPO: Trajectory Matching Policy Optimization for Diverse and Efficient Diffusion Alignment
arXiv:2605.10983v1 Announce Type: new Abstract: Reinforcement learning (RL) has shown extraordinary potential in aligning diffusion models to downstream tasks, yet most of them still suffer from significant reward hacking, which degrades generative diversity and quality by…
10 -
arXiv — Machine Learning research 17h ago
Structural Interpretations of Protein Language Model Representations via Differentiable Graph Partitioning
arXiv:2605.10985v1 Announce Type: new Abstract: Protein language models such as ESM-2 learn rich residue representations that achieve strong performance on protein function prediction, but their features remain difficult to interpret as structural $\&$ evolutionary signals are…
17 -
arXiv — Machine Learning research 17h ago
AESOP: Adversarial Execution-path Selection to Overload Deep Learning Pipelines
arXiv:2605.10987v1 Announce Type: new Abstract: Modern machine learning deployments increasingly compose specialized models into dynamic inference pipelines, where upstream components produce intermediate predictions that determine the workload and inputs of downstream…
21 -
arXiv — Machine Learning research 17h ago
Seeing the Needle in the Haystack: Towards Weakly-Supervised Log Instance Anomaly Localization via Counterfactual Perturbation
arXiv:2605.10988v1 Announce Type: new Abstract: Log anomaly detection is a critical task for system operations and security assurance. However, in networked systems at scale, log data are generated at massive scale while instance-level annotations are prohibitively expensive,…
29 -
arXiv — Machine Learning research 17h ago
SURGE: Surrogate Gradient Adaptation in Binary Neural Networks
arXiv:2605.10989v1 Announce Type: new Abstract: The training of Binary Neural Networks (BNNs) is fundamentally based on gradient approximation for non-differentiable binarization operations (e.g., sign function). However, prevailing methods including the Straight-Through…
11 -
arXiv — Machine Learning research 17h ago
Test-Time Personalization: A Diagnostic Framework and Probabilistic Fix for Scaling Failures
arXiv:2605.10991v1 Announce Type: new Abstract: Existing approaches to LLM personalization focus on constructing better personalized models or inputs, while treating inference as a single-shot process. In this work, we study Test-Time Personalization (TTP) along an unexplored…
11 -
arXiv — Machine Learning research 17h ago
SkillGen: Verified Inference-Time Agent Skill Synthesis
arXiv:2605.10999v1 Announce Type: new Abstract: Skills are a promising way to improve LLM agent capabilities without retraining, while keeping the added procedure reusable and controllable. However, high-quality skills are still largely written by hand. We introduce SkillGen, a…
33 -
arXiv — Machine Learning research 17h ago
DisagMoE: Computation-Communication overlapped MoE Training via Disaggregated AF-Pipe Parallelism
arXiv:2605.11005v1 Announce Type: new Abstract: Mixture-of-experts (MoE) architectures enable trillion-parameter LLMs with sparsely activated experts. Expert parallelism (EP) is a widely adopted MoE training strategy, but it suffers from severe all-to-all communication…
25 -
arXiv — Machine Learning research 17h ago
RT-Transformer: The Transformer Block as a Spherical State Estimator
arXiv:2605.11007v1 Announce Type: new Abstract: We show that the core components of the Transformer block -- attention, residual connections, and normalization -- arise naturally from a single geometric estimation problem. Modeling the latent state as a direction on the…
19 -
arXiv — Machine Learning research 17h ago
When and How to Canonize: A Generalization Perspective
arXiv:2605.11008v1 Announce Type: new Abstract: While invariant architectures are standard for processing symmetric data, there is growing interest in achieving invariance by applying group averaging or canonization to non-invariant backbones. However, the theoretical…
12 -
arXiv — Machine Learning research 17h ago
ACSAC: Adaptive Chunk Size Actor-Critic with Causal Transformer Q-Network
arXiv:2605.11009v1 Announce Type: new Abstract: Long-horizon, sparse-reward tasks pose a fundamental challenge for reinforcement learning, since single-step TD learning suffers from bootstrapping error accumulation across successive Bellman updates. Actor-critic methods with…
34 -
arXiv — Machine Learning research 17h ago
A Comparative Study of Federated Learning Aggregation Strategies under Homogeneous and Heterogeneous Data Distributions
arXiv:2605.11010v1 Announce Type: new Abstract: Federated Learning has emerged as a transformative paradigm for collaborative machine learning across distributed environments. However, its performance is strongly influenced by the aggregation strategy used to combine local model…
17 -
arXiv — Machine Learning research 17h ago
LoopUS: Recasting Pretrained LLMs into Looped Latent Refinement Models
arXiv:2605.11011v1 Announce Type: new Abstract: Looped computation shows promise in improving the reasoning-oriented performance of LLMs by scaling test-time compute. However, existing approaches typically require either training recurrent models from scratch or applying…
37 -
arXiv — Machine Learning research 17h ago
Backbone-Equated Diffusion OOD via Sparse Internal Snapshots
arXiv:2605.11014v1 Announce Type: new Abstract: Fair comparison between diffusion-based OOD detectors is challenging, as conclusions can vary with backbone choice, corruption parameterization, and test-time budget. We address this issue through a Mutualized Backbone-Equated…
30 -
arXiv — Machine Learning research 17h ago
Simpson's Paradox in Behavioral Curves: How Aggregation Distorts Parametric Models of User Dynamics
arXiv:2605.11017v1 Announce Type: new Abstract: Behavioral curve modeling -- fitting parametric functions to engagement-versus-exposure data -- is standard practice in recommendation, advertising, and clinical dosing. We show that aggregation introduces a systematic distortion:…
13 -
arXiv — Machine Learning research 17h ago
Efficient LLM Reasoning via Variational Posterior Guidance with Efficiency Awareness
arXiv:2605.11019v1 Announce Type: new Abstract: Although large language models rely on chain-of-thought for complex reasoning, the overthinking phenomenon severely degrades inference efficiency. Existing reinforcement learning methods compress reasoning chains by designing…
23 -
arXiv — Machine Learning research 17h ago
Trust Region Inverse Reinforcement Learning: Explicit Dual Ascent using Local Policy Updates
arXiv:2605.11020v1 Announce Type: new Abstract: Inverse reinforcement learning (IRL) is typically formulated as maximizing entropy subject to matching the distribution of expert trajectories. Classical (dual-ascent) IRL guarantees monotonic performance improvement but requires…
14 -
arXiv — Machine Learning research 17h ago
A Switching System Theory of Q-Learning with Linear Function Approximation
arXiv:2605.11021v1 Announce Type: new Abstract: This paper develops a switching-system interpretation of Q-learning with linear function approximation (LFA) based on the joint spectral radius (JSR). We derive an exact linear switched model for the mean dynamics and relate…
11 -
arXiv — Machine Learning research 17h ago
ASD-Bench: A Four-Axis Comprehensive Benchmark of AI Models for Autism Spectrum Disorder
arXiv:2605.11091v1 Announce Type: new Abstract: Automated ASD screening tools remain limited by single-architecture evaluations, axis-restricted assessment, and near-exclusive focus on adult cohorts, obscuring age-specific diagnostic patterns critical for early intervention. We…
4 -
arXiv — Machine Learning research 17h ago
Enabling Performant and Flexible Model-Internal Observability for LLM Inference
arXiv:2605.11093v1 Announce Type: new Abstract: Today's inference-time workloads increasingly depend on timely access to a model's internal states. We present DMI-Lib, a high-speed deep model inspector that treats internal observability as a first-class systems primitive,…
18 -
arXiv — Machine Learning research 17h ago
Newton's Lantern: A Reinforcement Learning Framework for Finetuning AC Power Flow Warm Start Models
arXiv:2605.11102v1 Announce Type: new Abstract: Neural warm starts can sharply reduce the number of Newton-Raphson iterations required to solve the AC power flow problem, but existing supervised approaches generalize poorly on heavily loaded instances near voltage collapse. We…
11 -
arXiv — Machine Learning research 17h ago
GRAFT-ATHENA: Self-Improving Agentic Teams for Autonomous Discovery and Evolutionary Numerical Algorithms
arXiv:2605.11117v1 Announce Type: new Abstract: Scientific discovery can be modeled as a sequence of probabilistic decisions that map physical problems to numerical solutions. Recent agentic AI systems automate individual scientific tasks by orchestrating LLM-driven planners,…
22 -
arXiv — Machine Learning research 17h ago
Language Modeling with Hyperspherical Flows
arXiv:2605.11125v1 Announce Type: new Abstract: Discrete Diffusion Language Models progressed rapidly as an alternative to autoregressive (AR) models, motivated by their parallel generation abilities. However, for tractability, discrete diffusion models sample from a factorized…
17 -
arXiv — Machine Learning research 17h ago
HEPA: A Self-Supervised Horizon-Conditioned Event Predictive Architecture for Time Series
arXiv:2605.11130v1 Announce Type: new Abstract: Critical events in multivariate time series, from turbine failures to cardiac arrhythmias, demand accurate prediction, yet labeled data is scarce because such events are rare and costly to annotate. We introduce HEPA…
16 -
arXiv — Machine Learning research 17h ago
Steerable Neural ODEs on Homogeneous Spaces
arXiv:2605.11133v1 Announce Type: new Abstract: We introduce steerable neural ordinary differential equations on homogeneous spaces $M=G/H$. These models constitute a novel geometric extension of manifold neural ordinary differential equations (NODEs) that transport associated…
33 -
arXiv — Machine Learning research 17h ago
Spurious Correlation Learning in Preference Optimization: Mechanisms, Consequences, and Mitigation via Tie Training
arXiv:2605.11134v1 Announce Type: new Abstract: Preference learning methods such as Direct Preference Optimization (DPO) are known to induce reliance on spurious correlations, leading to sycophancy and length bias in today's language models and potentially severe goal…
13 -
arXiv — Machine Learning research 17h ago
Rank Is Not Capacity: Spectral Occupancy for Latent Graph Models
arXiv:2605.11142v1 Announce Type: new Abstract: Graph representation learning has become a standard approach for analyzing networked data, with latent embeddings widely used for link prediction, community detection, and related tasks. Yet a basic design choice, the latent…
36 -
arXiv — Machine Learning research 17h ago
CORE: Cyclic Orthotope Relation Embedding for Knowledge Graph Completion
arXiv:2605.11159v1 Announce Type: new Abstract: Knowledge graph completion (KGC) aims to automatically infer missing facts in multi-relational data by mapping entities and relations into continuous representation spaces. Recent region-based embedding models have shown great…
16 -
arXiv — Machine Learning research 17h ago
Interpretability Can Be Actionable
arXiv:2605.11161v1 Announce Type: new Abstract: Interpretability aims to explain the behavior of deep neural networks. Despite rapid growth, there is mounting concern that much of this work has not translated into practical impact, raising questions about its relevance and…
37 -
arXiv — Machine Learning research 17h ago
COSMOS: Model-Agnostic Personalized Federated Learning with Clustered Server Models and Pseudo-Label-Only Communication
arXiv:2605.11165v1 Announce Type: new Abstract: Federated learning (FL) in heterogeneous environments remains challenging because client models often differ in both architecture and data distribution. While recent approaches attempt to address this challenge through client…
36 -
arXiv — Machine Learning research 17h ago
Unlearning with Asymmetric Sources: Improved Unlearning-Utility Trade-off with Public Data
arXiv:2605.11170v1 Announce Type: new Abstract: Noise-based certified machine unlearning currently faces a hard ceiling: the noise magnitude required to certify unlearning typically destroys model utility, particularly for large-scale deletion requests. While leveraging public…
12 -
arXiv — Machine Learning research 17h ago
Optimistic Dual Averaging Unifies Modern Optimizers
arXiv:2605.11172v1 Announce Type: new Abstract: We introduce SODA, a generalization of Optimistic Dual Averaging, which provides a common perspective on state-of-the-art optimizers like Muon, Lion, AdEMAMix and NAdam, showing that they can all be viewed as optimistic instances…
31 -
arXiv — Machine Learning research 17h ago
Oversmoothing as Representation Degeneracy in Neural Sheaf Diffusion
arXiv:2605.11178v1 Announce Type: new Abstract: Neural Sheaf Diffusion (NSD) generalizes diffusion-based Graph Neural Networks by replacing scalar graph Laplacians with sheaf Laplacians whose learned restriction maps define a task-adapted geometry. While the diffusion limit of…
25 -
arXiv — Machine Learning research 17h ago
Muon is Not That Special: Random or Inverted Spectra Work Just as Well
arXiv:2605.11181v1 Announce Type: new Abstract: The recent empirical success of the Muon optimizer has renewed interest in non-Euclidean optimization, typically justified by similarities with second-order methods, and linear minimization oracle (LMO) theory. In this paper, we…
8 -
arXiv — Machine Learning research 17h ago
CATS: Cascaded Adaptive Tree Speculation for Memory-Limited LLM Inference Acceleration
arXiv:2605.11186v1 Announce Type: new Abstract: Auto-regressive decoding in Large Language Models (LLMs) is inherently memory-bound: every generation step requires loading the model weights and intermediate results from memory (e.g., High-Bandwidth Memory (HBM) for GPU servers),…
19 -
arXiv — Machine Learning research 17h ago
Deep Learning for Protein Complex Prediction and Design
arXiv:2605.11189v1 Announce Type: new Abstract: Accurately modeling and designing protein complex structures is a central problem in computational structural biology, with broad implications for understanding cellular function and developing therapeutics. This thesis…
16 -
arXiv — Machine Learning research 17h ago
Variational Linear Attention: Stable Associative Memory for Long-Context Transformers
arXiv:2605.11196v1 Announce Type: new Abstract: Linear attention reduces the quadratic cost of softmax attention to $\mathcal{O}(T)$, but its memory state grows as $\mathcal{O}(T)$ in Frobenius norm, causing progressive interference between stored associations. We introduce…
13 -
arXiv — Machine Learning research 17h ago
FeatMap: Understanding image manipulation in the feature space and its implications for feature space geometry
arXiv:2605.11203v1 Announce Type: new Abstract: Intermediate feature representations represent the backbone for the expressivity and adaptability of deep neural networks. However, their geometric structure remains poorly understood. In this submission, we provide indirect…
20 -
arXiv — Machine Learning research 17h ago
Measuring Five-Nines Reliability: Sample-Efficient LLM Evaluation in Saturated Benchmarks
arXiv:2605.11209v1 Announce Type: new Abstract: While existing benchmarks demonstrate the near-perfect performance of large language models (LLMs) on various tasks, this apparent saturation often obscures the need for rigorous evaluation of their reliability. In real-world…
36 -
arXiv — Machine Learning research 17h ago
Enforcing Constraints in Generative Sampling via Adaptive Correction Scheduling
arXiv:2605.11214v1 Announce Type: new Abstract: Hard constraints in generative sampling are typically enforced by projection, applied either once at the end of sampling or after every update. This binary framing overlooks a fundamental issue: projection changes the distribution…
17 -
arXiv — Machine Learning research 17h ago
Leveraging RAG for Training-Free Alignment of LLMs
arXiv:2605.11217v1 Announce Type: new Abstract: Large language model (LLM) alignment algorithms typically consist of post-training over preference pairs. While such algorithms are widely used to enable safety guardrails and align LLMs with general human preferences, we show that…
36 -
arXiv — Machine Learning research 17h ago
ADMM-Q: An Improved Hessian-based Weight Quantizer for Post-Training Quantization of Large Language Models
arXiv:2605.11222v1 Announce Type: new Abstract: Quantization is an effective strategy to reduce the storage and computation footprint of large language models (LLMs). Post-training quantization (PTQ) is a leading approach for compressing LLMs. Popular weight quantization…
5 -
arXiv — Machine Learning research 17h ago
LiBaGS: Lightweight Boundary Gap Synthesis for Targeted Synthetic Data Selection
arXiv:2605.11231v1 Announce Type: new Abstract: Synthetic data is useful only when the added samples fill missing parts of the training distribution that matter for the downstream task. We introduce LiBaGS, a lightweight, generator-agnostic method for targeted synthetic training…
30 -
arXiv — Machine Learning research 17h ago
A Comparative Study of Model Selection Criteria for Symbolic Regression
arXiv:2605.11233v1 Announce Type: new Abstract: Effective model selection is critical in symbolic regression (SR) to identify mathematical expressions that balance accuracy and complexity, and have low expected error on unseen data. Many modern implementations of genetic…
38 -
arXiv — Machine Learning research 17h ago
Internalizing Curriculum Judgment for LLM Reinforcement Fine-Tuning
arXiv:2605.11235v1 Announce Type: new Abstract: In LLM Reinforcement Fine-Tuning (RFT), curriculum learning drives both efficiency and performance. Yet, current methods externalize curriculum judgment via handcrafted heuristics or auxiliary models, risking misalignment with the…
18 -
arXiv — Machine Learning research 17h ago
DeconDTN-Toolkit: A Library for Evaluation and Enhancement of Robustness to Provenance Shift
arXiv:2605.11237v1 Announce Type: new Abstract: Despite the burgeoning body of work on distribution shifts, provenance shift-where the relationship between data source and label changes at deployment-remains poorly understood and under-addressed. In this paper, we establish a…
13 -
arXiv — Machine Learning research 17h ago
Extending Kernel Trick to Influence Functions
arXiv:2605.11239v1 Announce Type: new Abstract: In this paper, we present a dual representation of the influence functions, whose computational complexity scales with dataset size rather than model size. Both analytically and experimentally, we show that this representation can…
7 -
arXiv — Machine Learning research 17h ago
Support-Proximity Augmented Diffusion Estimation for Offline Black-Box Optimization
arXiv:2605.11246v1 Announce Type: new Abstract: Offline black-box optimization aims to discover novel designs with high property scores using only a static dataset, a task fundamentally challenged by the out-of-distribution (OOD) extrapolation problem. Existing approaches…
13 -
arXiv — Machine Learning research 17h ago
A Proof-of-Concept Simulation-Driven Digital Twin Framework for Decision-Aware Diabetes Modeling
arXiv:2605.11247v1 Announce Type: new Abstract: This paper presents a proof-of-concept digital twin framework for simulation-driven diabetes modeling using benchmark clinical data, synthetic temporal augmentation, and illustrative continuous glucose monitoring (CGM) analysis.…
27 -
arXiv — Machine Learning research 17h ago
Curriculum Learning-Guided Progressive Distillation in Large Language Models
arXiv:2605.11260v1 Announce Type: new Abstract: Knowledge distillation is a key technique for transferring the capabilities of large language models (LLMs) into smaller, more efficient student models. Existing distillation approaches often overlook two critical factors: the…
26 -
arXiv — Machine Learning research 17h ago
Latent Chain-of-Thought Improves Structured-Data Transformers
arXiv:2605.11262v1 Announce Type: new Abstract: Chain-of-thought and more broadly test-time compute are known to augment the expressive capabilities of language models and have led to major innovations in reasoning. Motivated by this success, this paper explores latent…
24 -
arXiv — Machine Learning research 17h ago
Localization Boosting for Growth Markets: Mitigating Cross-Locale Behavioral Bias in Learning-to-Rank
arXiv:2605.11272v1 Announce Type: new Abstract: Adobe Express is expanding internationally, but the US has a disproportionately large content supply and interaction volume. Learning-to-rank (LTR) models trained primarily on behavioral feedback inherit this imbalance: templates…
20 -
arXiv — Machine Learning research 17h ago
Beyond Similarity: Temporal Operator Attention for Time Series Analysis
arXiv:2605.11287v1 Announce Type: new Abstract: A persistent paradox in time-series forecasting is that structurally simple MLP and linear models often outperform high-capacity Transformers. We argue that this gap arises from a mismatch in the sequence-modeling primitive: while…
18 -
arXiv — Machine Learning research 17h ago
Quotient-Categorical Representations for Bellman-Compatible Average-Reward Distributional Reinforcement Learning
arXiv:2605.11289v1 Announce Type: new Abstract: Average-reward reinforcement learning requires estimating the gain and the bias, which is defined only up to an additive constant. This makes direct distributional analogues ill-posed on the real line. We introduce a quotient-space…
27 -
arXiv — Machine Learning research 17h ago
Optimal Representations for Generalized Contrastive Learning with Imbalanced Datasets
arXiv:2605.11291v1 Announce Type: new Abstract: In this paper, we provide a computable characterization of the geometry of optimal representations in Contrastive Learning (CL) when the classes are imbalanced. When classes are balanced and the representation dimension is greater…
27 -
arXiv — Machine Learning research 17h ago
Primal Generation, Dual Judgment: Self-Training from Test-Time Scaling
arXiv:2605.11299v1 Announce Type: new Abstract: Code generation is typically trained in the primal space of programs: a model produces a candidate solution and receives sparse execution feedback, often a single pass/fail bit. Test-time scaling enriches the inference procedure by…
32 -
arXiv — Machine Learning research 17h ago
A Theory of Time-Sensitive Language Generation: Sparse Hallucination Beats Mode Collapse
arXiv:2605.11302v1 Announce Type: new Abstract: We study language generation in the limit under a global preference ordering on strings, as introduced by Kleinberg and Wei. As in [arXiv:2504.14370, arXiv:2511.05295], we aim for \emph{breadth}, but impose an additional…
20 -
arXiv — Machine Learning research 17h ago
Couple to Control: Joint Initial Noise Design in Diffusion Models
arXiv:2605.11311v1 Announce Type: new Abstract: Diffusion models typically generate image batches from independent Gaussian initial noises. We argue that this independence assumption is only one choice within a broader class of valid joint noise designs. Instead, one can specify…
11 -
arXiv — Machine Learning research 17h ago
Error whitening: Why Gauss-Newton outperforms Newton
arXiv:2605.11316v1 Announce Type: new Abstract: The Gauss-Newton matrix is widely viewed as a positive semidefinite approximation of the Hessian, yet mounting empirical evidence shows that Gauss-Newton descent outperforms Newton's method. We adopt a function space perspective to…
5 -
arXiv — Machine Learning research 17h ago
$\varepsilon$-Good Action Identification in Fixed-Budget Monte Carlo Tree Search
arXiv:2605.11324v1 Announce Type: new Abstract: We study the fixed-budget max-min action identification problem in depth-2 max-min trees, an important special case of Monte Carlo Tree Search. A learner sequentially allocates $T$ samples to leaves and then recommends a subtree…
17 -
arXiv — Machine Learning research 17h ago
Neural Statistical Functions
arXiv:2605.11327v1 Announce Type: new Abstract: Classical deep learning typically operates on individual cases. Despite its success, real-world usage often requires repeated inference to estimate statistical quantities for complex decision-making tasks involving uncertainty or…
24 -
arXiv — Machine Learning research 17h ago
Epistemic Uncertainty for Test-Time Discovery
arXiv:2605.11328v1 Announce Type: new Abstract: Automated scientific discovery using large language models relies on identifying genuinely novel solutions. Standard reinforcement learning penalizes high-variance mutations, which leads the policy to prioritize familiar patterns.…
31 -
arXiv — Machine Learning research 17h ago
Physics-Informed Teacher-Student Ensemble Learning for Traffic State Estimation with a Varying Speed Limit Scenario
arXiv:2605.11346v1 Announce Type: new Abstract: Physics-informed deep learning (PIDL) neural networks have shown their capability as a useful instrument for transportation practitioners in utilizing the underlying relationship between the state variables for traffic state…
11 -
arXiv — Machine Learning research 17h ago
Gradient-Free Noise Optimization for Reward Alignment in Generative Models
arXiv:2605.11347v1 Announce Type: new Abstract: Existing reward alignment methods for diffusion and flow models rely on multi-step stochastic trajectories, making them difficult to extend to deterministic generators. A natural alternative is noise-space optimization, but…
38 -
arXiv — Machine Learning research 17h ago
gym-invmgmt: An Open Benchmarking Framework for Inventory Management Methods
arXiv:2605.11355v1 Announce Type: new Abstract: Inventory-policy comparisons are often difficult to interpret because performance depends on the evaluation contract as much as on the policy itself. Differences in topology, demand regime, information access, feasibility…
32 -
arXiv — Machine Learning research 17h ago
The tractability landscape of diffusion alignment: regularization, rewards, and computational primitives
arXiv:2605.11361v1 Announce Type: new Abstract: Inference-time reward alignment asks how to turn a pre-trained diffusion model with base law $p$ into a sampler that favors a reward $r$ while remaining close to $p$. Since there is no canonical distributional distance for this…
27 -
arXiv — Machine Learning research 17h ago
Causal Fairness for Survival Analysis
arXiv:2605.11362v1 Announce Type: new Abstract: In the data-driven era, large-scale datasets are routinely collected and analyzed using machine learning (ML) and artificial intelligence (AI) to inform decisions in high-stakes domains such as healthcare, employment, and criminal…
31 -
arXiv — Machine Learning research 17h ago
LPDP: Inference-Time Reward Control for Variable-Length DNA Generation with Edit Flows
arXiv:2605.11368v1 Announce Type: new Abstract: We study the application of recent Edit Flows for inference-time reward control for DNA sequence generation. Unlike most reward-guided DNA generation frameworks, which operate on fixed-length sequence spaces, Edit Flows have a…
6 -
arXiv — Machine Learning research 17h ago
TRACE: Temporal Routing with Autoregressive Cross-channel Experts for EEG Representation Learning
arXiv:2605.11380v1 Announce Type: new Abstract: Learning transferable representations for electroencephalography (EEG) remains challenging because EEG signals are inherently multi-channel and non-stationary. Channels observed at the same time provide coupled measurements of…
25 -
arXiv — Machine Learning research 17h ago
Behavioral Mode Discovery for Fine-tuning Multimodal Generative Policies
arXiv:2605.11387v1 Announce Type: new Abstract: We address the problem of fine-tuning pre-trained generative policies with reinforcement learning (RL) while preserving the multimodality of their action distributions. Existing methods for RL fine-tuning of generative policies…
17 -
arXiv — Machine Learning research 17h ago
MuonQ: Enhancing Low-Bit Muon Quantization via Directional Fidelity Optimization
arXiv:2605.11396v1 Announce Type: new Abstract: The Muon optimizer has emerged as a compelling alternative to Adam for training large language models, achieving remarkable computational savings through gradient orthogonalization. However, Muon's optimizer state is more sensitive…
21 -
arXiv — Machine Learning research 17h ago
More Than Meets the Eye: A Semantics-Aware Traffic Augmentation Framework for Generalizable Website Fingerprinting
arXiv:2605.11402v1 Announce Type: new Abstract: Deep learning-based website fingerprinting has emerged as an effective technique for inferring the websites users visit. Although existing methods achieve strong performance on closed-world datasets, they often fail to generalize…
23 -
arXiv — Machine Learning research 17h ago
20/20 Vision Language Models: A Prescription for Better VLMs through Data Curation Alone
arXiv:2605.11405v1 Announce Type: new Abstract: Data curation has shifted the quality-compute frontier for language-model and contrastive image-text pretraining, but its role for vision-language models (VLMs) is far less established. We ask how far data curation alone can take…
33 -
arXiv — Machine Learning research 17h ago
A Boundary-Aware Non-parametric Granular-Ball Classifier Based on Minimum Description Length
arXiv:2605.11406v1 Announce Type: new Abstract: Existing granular-ball classification methods are often driven by handcrafted quality measures, neighborhood rules, or heuristic splitting and stopping criteria, which may reduce the transparency of local construction decisions and…
6 -
arXiv — Machine Learning research 17h ago
Generative Diffusion Prior Distillation for Long-Context Knowledge Transfer
arXiv:2605.11414v1 Announce Type: new Abstract: While traditional time-series classifiers assume full sequences at inference, practical constraints (latency and cost) often limit inputs to partial prefixes. The absence of class-discriminative patterns in partial data can…
29 -
arXiv — Machine Learning research 17h ago
FastUMAP: Scalable Dimensionality Reduction via Bipartite Landmark Sampling
arXiv:2605.11428v1 Announce Type: new Abstract: Exploratory analysis of high-dimensional data rarely stops at a single embedding. In practice, analysts rerun dimensionality reduction after changing preprocessing, subsets, or hyperparameters, and standard nonlinear methods can…
26 -
arXiv — Machine Learning research 17h ago
Deep Minds and Shallow Probes
arXiv:2605.11448v1 Announce Type: new Abstract: Neural representations are not unique objects. Even when two systems realize the same downstream computation, their hidden coordinates may differ by reparameterization. A probe family intended to reveal structure already present in…
18 -
arXiv — Machine Learning research 17h ago
Beyond Prediction: Interval Neural Networks for Uncertainty-Aware System Identification
arXiv:2605.11460v1 Announce Type: new Abstract: System identification (SysID) is critical for modeling dynamical systems from experimental data, yet traditional approaches often fail to capture nonlinear behaviors. While deep learning offers powerful tools for modeling such…
20 -
arXiv — Machine Learning research 17h ago
Drop the Act: Probe-Filtered RL for Faithful Chain-of-Thought Reasoning
arXiv:2605.11467v1 Announce Type: new Abstract: Reasoning models post-hoc rationalize answers they have already committed to internally, producing chains of *reasoning theater*: deliberative-looking steps that contribute nothing to correctness. This wastes inference tokens,…
7 -
arXiv — Machine Learning research 17h ago
Robust Multi-Agent Path Finding under Observation Attacks: A Principled Adversarial-Plus-Smoothing Training Recipe
arXiv:2605.11469v1 Announce Type: new Abstract: Decentralized multi-agent path finding (MAPF) routes a team of agents on a shared grid, each acting from its own local view. The standard solution trains one shared neural policy with Proximal Policy Optimization (PPO), a popular…
20 -
arXiv — Machine Learning research 17h ago
On the Approximation Complexity of Matrix Product Operator Born Machines
arXiv:2605.11471v1 Announce Type: new Abstract: Matrix product operator Born machines (MPO-BMs) are tractable tensor-network models for probabilistic modeling, but their efficient approximation capability remains unclear. We characterize this boundary from both negative and…
35 -
arXiv — Machine Learning research 17h ago
Efficient Adjoint Matching for Fine-tuning Diffusion Models
arXiv:2605.11480v1 Announce Type: new Abstract: Reward fine-tuning has become a common approach for aligning pretrained diffusion and flow models with human preferences in text-to-image generation. Among reward-gradient-based methods, Adjoint Matching (AM) provides a principled…
30 -
arXiv — Machine Learning research 17h ago
Adaptive Calibration in Non-Stationary Environments
arXiv:2605.11490v1 Announce Type: new Abstract: Making calibrated online predictions is a central challenge in modern AI systems. Much of the existing literature focuses on fully adversarial environments where outcomes may be arbitrary, leading to conservative algorithms that…
9 -
arXiv — Machine Learning research 17h ago
Understanding and Preventing Entropy Collapse in RLVR with On-Policy Entropy Flow Optimization
arXiv:2605.11491v1 Announce Type: new Abstract: Reinforcement learning with verifiable rewards (RLVR) has become an effective paradigm for improving the reasoning ability of large language models. However, widely used RLVR algorithms, such as GRPO, often suffer from entropy…
12 -
arXiv — Machine Learning research 17h ago
CTFusion: A CTF-based Benchmark for LLM Agent Evaluation
arXiv:2605.11504v1 Announce Type: new Abstract: Recent advances in Large Language Models (LLMs) have enabled agentic systems for complex, multi-step tasks; cybersecurity is emerging as a prominent application. To evaluate such agents, researchers widely adopt Capture The Flag…
23 -
arXiv — Machine Learning research 17h ago
EqOD: Symmetry-Informed Stability Selection for PDE Identification
arXiv:2605.11524v1 Announce Type: new Abstract: Data-driven identification of partial differential equations (PDEs) relies on sparse regression over a candidate library of differential operators, where larger libraries inflate false positives under observation noise and smaller…
26 -
arXiv — Machine Learning research 1d ago
Reinforcement learning for inverse structural design and rapid laser cutting of kirigami prototypes
arXiv:2605.08098v1 Announce Type: new Abstract: Kirigami is an increasingly useful fabrication method to produce shape-programmable metamaterial structures. However, inverse design remains difficult because deployment…
12 -
arXiv — Machine Learning research 1d ago
Path-Based Gradient Boosting for Graph-Level Prediction
arXiv:2605.08102v1 Announce Type: new Abstract: We propose PathBoost, a gradient tree boosting method for graph-level classification and regression that learns discriminative path-based features directly from the input…
20 -
arXiv — Machine Learning research 1d ago
Distributional Reinforcement Learning via the Cram\'er Distance
arXiv:2605.08104v1 Announce Type: new Abstract: This paper explores the application of the Soft Actor-Critic (SAC) algorithm within a Distributional Reinforcement Learning setting and introduces an implementation of…
15 -
arXiv — Machine Learning research 1d ago
Geometry-free prediction of inertial lift forces in microfluidic devices using deep learning
arXiv:2605.08109v1 Announce Type: new Abstract: Inertial microfluidic devices (IMDs) offer low-cost, high-throughput alternative techniques for many traditional particle- (or cell-) manipulation tasks, but simulating…
19 -
arXiv — Machine Learning research 1d ago
BaLoRA: Bayesian Low-Rank Adaptation of Large Scale Models
arXiv:2605.08110v1 Announce Type: new Abstract: Low-Rank Adaptation (LoRA) has become the standard for fine-tuning large pre-trained models at reduced computational cost. However, its low-rank point-estimate updates…
6 -
arXiv — Machine Learning research 1d ago
TTCD:Transformer Integrated Temporal Causal Discovery from Non-Stationary Time Series Data
arXiv:2605.08111v1 Announce Type: new Abstract: The widespread availability of complex time series data in various domains such as environmental science, epidemiology, and economics demands robust causal discovery…
35 -
arXiv — Machine Learning research 1d ago
Do Foundation Model Embeddings Improve Cross-Country Crop Yield Generalisation? A Leave-One-Country-Out Evaluation in Sub-Saharan Africa
arXiv:2605.08113v1 Announce Type: new Abstract: Accurate predictions of smallholder maize yields across national boundaries are critical for food security planning in sub-Saharan Africa, yet most published benchmarks…
17 -
arXiv — Machine Learning research 1d ago
Statistical Inference and Quality Measures of KV Cache Quantisations Inspired by TurboQuant
arXiv:2605.08114v1 Announce Type: new Abstract: We analyse three KV cache quantization schemes under a fair bit budget: \textbf{KV} (scalar MSE baseline), \textbf{KQV} (WHT + MSE on $K$; WHT + MSE + QJL on $V$), and…
27 -
arXiv — Machine Learning research 1d ago
The Safety-Aware Denoiser for Text Diffusion Models
arXiv:2605.08116v1 Announce Type: new Abstract: Recent work on text diffusion models offers a promising alternative to autoregressive generation, but controlling their safety remains underexplored. Existing safety…
9 -
arXiv — Machine Learning research 1d ago
Feature Repulsion and Spectral Lock-in: An Empirical Study of Two-Layer Network Grokking
arXiv:2605.08119v1 Announce Type: new Abstract: Tian (2025) proves a repulsion theorem (Theorem 6) for the matrix $ B = (\widetilde{F}^\top \widetilde{F} + \eta I)^{-1} $ during the interactive feature-learning stage of…
31 -
arXiv — Machine Learning research 1d ago
Block-Wise Differentiable Sinkhorn Attention: Tail-Refinement Gradients with a Gap-Aware Dustbin Bridge
arXiv:2605.08123v1 Announce Type: new Abstract: We study long-context balanced entropic optimal transport (OT) attention on TPU hardware through a stopped-base, fixed-depth tail-refinement surrogate. After a stopped…
32 -
arXiv — Machine Learning research 1d ago
Towards Universal Gene Regulatory Network Inference: Unlocking Generalizable Regulatory Knowledge in Single-cell Foundation Models
arXiv:2605.08128v1 Announce Type: new Abstract: Gene Regulatory Network (GRN) inference is essential for understanding complex cellular mechanisms, rendered tractable through single-cell transcriptomic data. With the…
19 -
arXiv — Machine Learning research 1d ago
Additive Atomic Forests for Symbolic Function and Antiderivative Discovery
arXiv:2605.08130v1 Announce Type: new Abstract: We present a framework for the simultaneous symbolic recovery of a function and its antiderivative from data. The framework rests on three ideas. First, a derivative…
27 -
arXiv — Machine Learning research 1d ago
Interactive Inverse Reinforcement Learning of Interaction Scenarios via Bi-level Optimization
arXiv:2605.08131v1 Announce Type: new Abstract: Inverse reinforcement learning (IRL) learns a reward function and a corresponding policy that best fit the demonstration data of an expert. However, in the current IRL…
18 -
arXiv — Machine Learning research 1d ago
DARE: Diffusion Language Model Activation Reuse for Efficient Inference
arXiv:2605.08134v1 Announce Type: new Abstract: Diffusion Large Language Models (dLLMs) have emerged as a promising alternative to auto-regressive (AR) models, offering greater expressive capacity and potential for…
36 -
arXiv — Machine Learning research 1d ago
Dendritic Neural Networks with Equilibrium Propagation
arXiv:2605.08135v1 Announce Type: new Abstract: Equilibrium propagation (EP) is a biologically plausible alternative to backpropagation (BP), but its effectiveness can degrade in deeper and more challenging learning…
26 -
arXiv — Machine Learning research 1d ago
Weight Pruning Amplifies Bias: A Multi-Method Study of Compressed LLMs for Edge AI
arXiv:2605.08137v1 Announce Type: new Abstract: Weight pruning is widely advocated for deploying Large Language Models on resource-constrained IoT and edge devices, yet its impact on model fairness remains poorly…
6 -
arXiv — Machine Learning research 1d ago
DataArc-SynData-Toolkit: A Unified Closed-Loop Framework for Multi-Path, Multimodal, and Multilingual Data Synthesis
arXiv:2605.08138v1 Announce Type: new Abstract: Synthetic data has emerged as a crucial solution to the data scarcity bottleneck in large language models (LLMs), particularly for specialized domains and low-resource…
10 -
arXiv — Machine Learning research 1d ago
Reasoning emerges from constrained inference manifolds in large language models
arXiv:2605.08142v1 Announce Type: new Abstract: Reasoning in large language models is predominantly evaluated through labeled benchmarks, conflating task performance with the quality of internal inference. Here we study…
15