arXiv — Machine Learning

500 articles archived · Visit source ↗ · RSS

arXiv — Machine Learning research 4d ago

Zero-Shot Size Transfer for Neural ODEs on Sparse Random Graphs: Graphon Limits and Adjoint Convergence

arXiv:2606.26662v1 Announce Type: new Abstract: Graph Neural Differential Equations (GNDEs) model continuous-time graph dynamics by parameterizing Neural ODE velocity fields with Graph Neural Networks. Their local, size-independent filters suggest a zero-shot size-transfer…

24
arXiv — Machine Learning research 4d ago

PersistentKV: Page-Aware Decode Scheduling for Long-Context LLM Serving on Commodity GPUs

arXiv:2606.26666v1 Announce Type: new Abstract: Autoregressive large language model (LLM) serving is increasingly limited by key-value (KV) cache movement rather than dense matrix multiplication. Modern paged-attention systems reduce KV-cache fragmentation and mature kernels…

20
arXiv — Machine Learning research 4d ago

Algorithmic Foundations of Deep Learning: Complexity-Theoretic Rates and a Characterization of Universal Approximation

arXiv:2606.26705v1 Announce Type: new Abstract: Feedforward neural network (NN) expressivity is typically studied by emulating optimal basis-expansion schemes. While powerful, this perspective is incomplete: it primarily captures complexity through regularity, and therefore does…

37
arXiv — Machine Learning research 4d ago

HyperDFlash: MHC-Aligned Block Speculative Decoding with Gated Residual Reduction

arXiv:2606.26744v1 Announce Type: new Abstract: We present HyperDFlash, a block-parallel speculative decoding framework tailored to the novel multi-hyper-connection (MHC) architecture proposed by DeepSeek-V4. Despite the strong initial-token drafting performance of the native…

10
arXiv — Machine Learning research 4d ago

Structure Before Collapse: Transient semantic geometry in next-token prediction

arXiv:2606.26749v1 Announce Type: new Abstract: Neural Collapse predicts that balanced one-hot classification pushes model representations to be equally far from each other; a symmetric configuration that depends only on the output label and ignores any semantic similarity in…

29
arXiv — Machine Learning research 4d ago

Batch-Invariant Spectral Intelligence for Robust and Explainable Insect Authentication

arXiv:2606.26757v1 Announce Type: new Abstract: Edible insects offer an efficient source of alternative protein, requiring less land, water and emitting less greenhouse gas than conventional livestock. However, their successful integration into the food supply chain demands…

22
arXiv — Machine Learning research 4d ago

Escaping Iterative Parameter-Space Noise: Differentially Private Learning with a Hypernetwork

arXiv:2606.26772v1 Announce Type: new Abstract: Differentially private (DP) training of neural networks is often hindered by the large amount of noise required by gradient-based methods such as DP-SGD, which repeatedly inject high-dimensional noise in parameter space throughout…

20
arXiv — Machine Learning research 4d ago

Reproducibility Study of "AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models"

arXiv:2606.26783v1 Announce Type: new Abstract: Fang et al. (2025) introduced a null-space constrained projection, named AlphaEdit, for locate-then-edit knowledge editing methods, theoretically guaranteeing that edits do not disrupt previously preserved knowledge, and reports…

20
arXiv — Machine Learning research 4d ago

AIGP: An LLM-Based Framework for Long-Term Value Alignment in E-Commerce Pricing

arXiv:2606.26787v1 Announce Type: new Abstract: Traditional dynamic pricing models in large-scale e-commerce suffer from limited interpretability, poor utilization of unstructured information, and misalignment with long-term business objectives such as cumulative Gross…

26
arXiv — Machine Learning research 4d ago

Reasoning Quality Emerges Early: Data Curation for Reasoning Models

arXiv:2606.26797v1 Announce Type: new Abstract: Supervised fine-tuning (SFT) on a small, high-quality set of long reasoning traces is an effective approach for eliciting strong reasoning capabilities in Large Language Models (LLMs). However, existing methods for curating…

14
arXiv — Machine Learning research 4d ago

Quantization in Federated Learning: Methods, Challenges and Future Directions

arXiv:2606.26822v1 Announce Type: new Abstract: Federated Learning (FL) has become a foundational paradigm for privacy-preserving distributed intelligence, yet its scalability remains fundamentally constrained by communication bottlenecks, device heterogeneity, and the…

20
arXiv — Machine Learning research 4d ago

Asymptotically Optimal Learning for Parametric Prophet Inequalities

arXiv:2606.26893v1 Announce Type: new Abstract: We study learning in prophet inequalities with i.i.d. rewards drawn from an exponential-type parametric family with an unknown parameter $\theta$, a class that includes exponential, Pareto, and bounded-support power-family…

32
arXiv — Machine Learning research 4d ago

GEOALIGN: Geometric Rollout Curation for Robust LLM Reinforcement Learning

arXiv:2606.26917v1 Announce Type: new Abstract: Online reinforcement learning is widely used to align large language models (LLMs) with reward signals, yet training can be unstable under noisy or misspecified rewards. We identify a failure mode we call directional inconsistency:…

26
arXiv — Machine Learning research 4d ago

Decision-Aligned Evaluation of Uncertainty Quantification

arXiv:2606.26990v1 Announce Type: new Abstract: Uncertainty estimates in machine learning are typically evaluated using generic metrics such as the negative log-likelihood and expected calibration error, yet good performance on such metrics does not necessarily imply high…

13
arXiv — Machine Learning research 4d ago

Uncertainty quantification via conformal prediction in data assimilation

arXiv:2606.27001v1 Announce Type: new Abstract: Quantifying the evolution of uncertainty is critical to both probabilistic forecasting and data assimilation in numerical weather prediction. In this study, we investigate the applicability of conformal prediction (CP), a recent…

30
arXiv — Machine Learning research 4d ago

A Generalization Theory for JEPA-Based World Models

arXiv:2606.27014v1 Announce Type: new Abstract: Joint Embedding Predictive Architectures (JEPAs) have recently emerged as a promising paradigm for world modeling by learning predictive dynamics in a latent space rather than generating future observations at the input level.…

5
arXiv — Machine Learning research 4d ago

Just how sure are you? Improving Verbalized Uncertainty Calibration in Medical VQA

arXiv:2606.27023v1 Announce Type: new Abstract: Multimodal large language models (MLLMs) applied to Medical Visual Question Answering (VQA) tend to produce overconfident outputs regardless of actual correctness, and existing verbalized confidence calibration methods, developed…

15
arXiv — Machine Learning research 4d ago

Symplectic Neural Networks for learning Generalized Hamiltonians

arXiv:2606.27029v1 Announce Type: new Abstract: Hamiltonian Neural Networks (HNNs) integrate physical priors into neural models by learning a system's Hamiltonian, improving generalization and sample efficiency. Identifying the system Hamiltonian from noisy observations of state…

9
arXiv — Machine Learning research 4d ago

State Representation Matters in Deep Reinforcement Learning: Application to Energy Trading

arXiv:2606.27032v1 Announce Type: new Abstract: Energy trading decisions depend not only on current market prices, but also on expected future market conditions, and operational constraints. This makes the state representation given to a reinforcement learning agent an important…

5
arXiv — Machine Learning research 4d ago

Finding Stationary Points by Comparisons

arXiv:2606.27082v1 Announce Type: new Abstract: We study the problem of finding stationary points of non-convex functions when access to the objective is provided only through a comparison oracle that, given two points, outputs which has the larger function value. For a twice…

17
arXiv — Machine Learning research 4d ago

Data-Free Reservoir Features for Efficient Long-Horizon Cold-Start Continual Learning

arXiv:2606.27095v1 Announce Type: new Abstract: Cold-start exemplar-free class-incremental learning requires learning a growing set of classes without replay, external pretraining, or a large initial task. Existing cold-start methods typically either train the backbone…

19
arXiv — Machine Learning research 4d ago

Transformer-Based Classification of Bacterial Raman Spectra with LOOCV

arXiv:2606.27096v1 Announce Type: new Abstract: Transformer-based models have recently attracted increasing attention for Raman spectral classification. In this study, a transformer-based approach was systematically evaluated using a nested leave-one-replicate-out…

31
arXiv — Machine Learning research 4d ago

Heavy-Ball Q-Learning with Residual Weighting Correction

arXiv:2606.27112v1 Announce Type: new Abstract: This paper proposes a corrected heavy-ball Q-learning method for reinforcement learning (RL) and establishes its convergence. It also identifies conditions under which the method is theoretically guaranteed to converge faster than…

31
arXiv — Machine Learning research 4d ago

Cross-Head Attention Uplift Network with Inverse Propensity Score under Unobserved Confounding

arXiv:2606.27114v1 Announce Type: new Abstract: Uplift modeling, crucial for estimating individual treatment effects (ITE), faces dual challenges: flexibly leveraging inter-group similarity to enhance discriminative power and debiasing under unobserved confounding scenarios. In…

19
arXiv — Machine Learning research 4d ago

Kolmogorov Arnold networks (KAN) for aerodynamic prediction: a comparison with MLPs and GNNs

arXiv:2606.27126v1 Announce Type: new Abstract: Kolmogorov Arnold networks (KAN) have recently been introduced as a (deep) neural network architecture whose trainable parameters adapt the activation functions, instead of the coefficients of the affine transformations at the core…

27
arXiv — Machine Learning research 4d ago

fTNN: a tensor neural network for fractional PDEs

arXiv:2606.27140v1 Announce Type: new Abstract: We develop the fTNN, a deterministic tensor neural network subspace method for problems involving the fractional Laplacian on bounded domains, taking the fractional Poisson equation and time-dependent fractional advection-diffusion…

21
arXiv — Machine Learning research 4d ago

Stochastic Gradient Optimization with Model-Assisted Sampling

arXiv:2606.27171v1 Announce Type: new Abstract: This work addresses the problem of variance in stochastic gradient estimation for machine learning optimization. Deep learning relies on mini-batch methods such as stochastic gradient descent, which approximate full gradients but…

34
arXiv — Machine Learning research 4d ago

RecallRisk-BERT: A Multi-Task Framework for Post-Report Medical Device Recall Triage

arXiv:2606.27174v1 Announce Type: new Abstract: Medical device recalls are a critical regulatory mechanism for protecting patient safety. The growing volume of FDA recall records presents challenges in post-report recall triage, severity assessment, and root-cause…

24
arXiv — Machine Learning research 4d ago

Automating Potential-based Reward Shaping with Vision Language Model Guidance

arXiv:2606.27180v1 Announce Type: new Abstract: Sparse rewards are inherently challenging for reinforcement learning agents as they lack intermediate feedback to guide exploration and to correctly attribute the sparse success rewards to relevant parts of the trajectory. Naive…

36
arXiv — Machine Learning research 4d ago

Explaining Temporal Graph Neural Networks via Feature-induced Information Flow

arXiv:2606.27201v1 Announce Type: new Abstract: Event-based Temporal Graph Neural Networks (ETGNNs) have demonstrated strong performance across a wide range of applications, including social network analysis, epidemic tracing, recommender systems, and political event…

5
arXiv — Machine Learning research 4d ago

Graph Neural Networks Applications Across Domains: All Insights You Need

arXiv:2606.27202v1 Announce Type: new Abstract: Graph neural networks have moved from a niche representation-learning technique to the default model class wherever data carry relational structure. The interesting question is no longer whether message passing helps on a given…

17
arXiv — Machine Learning research 4d ago

The Geometry of Updates: Fisher Alignment at Vocabulary Scale

arXiv:2606.27242v1 Announce Type: new Abstract: Training-free source selection for LLM families with shared vocabularies arises in scientific string domains such as SMILES, protein, and genomic sequences, where candidate corpora share a tokenizer but differ in prediction…

38
arXiv — Machine Learning research 4d ago

Effective Covariance Dynamics in Solvable High-Dimensional GANs

arXiv:2606.27246v1 Announce Type: new Abstract: We study a solvable high-dimensional model of generative adversarial network (GAN) training in which a linear generator learns a low-dimensional subspace from data with structured latent covariance. Prior solvable GAN analyses…

33
arXiv — Machine Learning research 4d ago

RSPC: A Benchmark for Modeling Stress and Psychiatric Conditions in Digitally Mediated Relationships using Psychiatrist Annotations

arXiv:2606.27247v1 Announce Type: new Abstract: In NLP, mental health conditions are often modeled as isolated phenomena, without interpersonal context. We use Reddit posts about long-distance relationships to capture both mental health distress and associated relational…

24
arXiv — Machine Learning research 4d ago

BetXplain: An Explanation-Annotated Dataset for Detecting Manipulative Betting Advertisements on Social Media

arXiv:2606.27274v1 Announce Type: new Abstract: The promotion of betting applications on social media platforms has increased significantly in recent years. Many of these advertisements use persuasive techniques that may mislead users, encourage risky behavior, and potentially…

37
arXiv — Machine Learning research 4d ago

How Good Can Linear Models Be for Time-Series Forecasting?

arXiv:2606.27282v1 Announce Type: new Abstract: Time-series forecasting research has been moving steadily toward larger architectures, from specialized transformers to general-purpose foundation models, on the assumption that capacity is what unlocks accuracy. We take the…

19
arXiv — Machine Learning research 4d ago

Recovering Governing Equations from Solution Data: Identifiability Bounds for Linear and Nonlinear ODEs

arXiv:2606.27285v1 Announce Type: new Abstract: Learning governing equations from observed solution data is a fundamental challenge in scientific machine learning…

15
arXiv — Machine Learning research 4d ago

Designing Reward Signals for Portable Query Generation: A Case Study in Industrial Semantic Job Search

arXiv:2606.27291v1 Announce Type: new Abstract: Job-search platforms rely on low-bandwidth query interfaces that often fail to capture the high-dimensional complexity of candidate profiles. We present an end-to-end RLAIF (Reinforcement Learning from AI Feedback) framework to…

10
arXiv — Machine Learning research 4d ago

A Multi-Fidelity Convolutional Autoencoder-Transfer Learning Framework for Guided-Wave-Based Damage Diagnosis Using Large Simulated and Limited Experimental Datasets

arXiv:2606.27304v1 Announce Type: new Abstract: Guided wave-based structural health monitoring (GWSHM) with onboard transducers offers significant potential for the early diagnosis of damage in engineering structures. However, the practical deployment of deep learning models is…

4
arXiv — Machine Learning research 4d ago

Blackwell Approachability and Gradient Equilibrium are Equivalent

arXiv:2606.27315v1 Announce Type: new Abstract: Gradient equilibrium (GEQ) is a recently introduced online optimization framework that generalizes first-order stationarity from offline optimization and abstracts problems like online conformal prediction. While GEQ has curious…

20
arXiv — Machine Learning research 4d ago

Beyond the Hard Budget: Sparsity Regularizers for More Interpretable Top-k Sparse Autoencoders

arXiv:2606.27321v1 Announce Type: new Abstract: Sparse autoencoders (SAEs) have become a leading tool for interpreting the representations of vision foundation models, decomposing their polysemantic activations into a larger set of sparse, more monosemantic features. The Top-$k$…

22
arXiv — Machine Learning research 4d ago

Hallucination in World Models is Predictable and Preventable

arXiv:2606.27326v1 Announce Type: new Abstract: Modern generative world models render increasingly realistic action-controllable futures, yet they frequently hallucinate: rollouts remain visually fluent while drifting from the ground-truth dynamics. We hypothesize that…

19
arXiv — Machine Learning research 4d ago

Error-Conditioned Neural Solvers

arXiv:2606.27354v1 Announce Type: new Abstract: Neural surrogate models offer fast approximate mappings from PDE parameters to solutions, but they typically treat solving as a purely statistical task: once trained, they struggle to correct their own constraint violations and…

25
arXiv — Machine Learning research 4d ago

Autoregressive Boltzmann Generators

arXiv:2606.27361v1 Announce Type: new Abstract: Efficient sampling of molecular systems at thermodynamic equilibrium is a hallmark challenge in statistical physics. This challenge has driven the development of Boltzmann Generators (BGs), which allow rapid generation of…

7
arXiv — Machine Learning research 4d ago

Reinforcement Learning without Ground-Truth Solutions can Improve LLMs

arXiv:2606.27369v1 Announce Type: new Abstract: Reinforcement learning with verifiable rewards (RLVR) for training LLMs typically rely on ground-truth answers to assign rewards, limiting their applicability to tasks where the ground-truth solution is unknown. We introduce a…

19
arXiv — Machine Learning research 4d ago

Context Recycling for Long-Horizon LLM Inference

arXiv:2606.26105v1 Announce Type: cross Abstract: Large language models (LLMs) exhibit strong capabilities in short-context reasoning but degrade in performance over long conversational horizons due to context window limitations and inefficient token usage. We introduce…

27
arXiv — Machine Learning research 4d ago

The Open Source Economic Index of AI Adoption and Capability

arXiv:2606.26118v1 Announce Type: cross Abstract: We work towards measuring both AI adoption and the capability of AI to perform discrete labor tasks across various occupations. To measure adoption, we develop an open-source economic index that uses publicly available user-LLM…

5
arXiv — Machine Learning research 4d ago

Dynamic-dLLM: Dynamic Cache-Budget and Adaptive Parallel Decoding for Training-Free Acceleration of Diffusion LLM

arXiv:2606.26120v1 Announce Type: cross Abstract: Diffusion Large Language Models (dLLMs) offer a promising alternative to autoregressive models, excelling in text generation tasks due to their bidirectional attention mechanisms. However, their computational complexity scales on…

15
arXiv — Machine Learning research 4d ago

Dot-Flik: A Scalable Edge AI Architecture for Distributed Insect Monitoring

arXiv:2606.26121v1 Announce Type: cross Abstract: Global insect population declines necessitate scalable, continuous monitoring systems, yet existing vision-based solutions remain constrained by high hardware costs, energy demands, and reliance on centralized processing or cloud…

11
arXiv — Machine Learning research 4d ago

Code evolution for link prediction in complex networks

arXiv:2606.26132v1 Announce Type: cross Abstract: The problem of predicting links in complex networks appears in different disciplines and has led to a variety of ingenious human-designed methods. We use this rich program space to explore the performance and behavior of…

8

Zero-Shot Size Transfer for Neural ODEs on Sparse Random Graphs: Graphon Limits and Adjoint Convergence

PersistentKV: Page-Aware Decode Scheduling for Long-Context LLM Serving on Commodity GPUs

Algorithmic Foundations of Deep Learning: Complexity-Theoretic Rates and a Characterization of Universal Approximation

HyperDFlash: MHC-Aligned Block Speculative Decoding with Gated Residual Reduction

Structure Before Collapse: Transient semantic geometry in next-token prediction

Batch-Invariant Spectral Intelligence for Robust and Explainable Insect Authentication

Escaping Iterative Parameter-Space Noise: Differentially Private Learning with a Hypernetwork

Reproducibility Study of "AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models"

AIGP: An LLM-Based Framework for Long-Term Value Alignment in E-Commerce Pricing

Reasoning Quality Emerges Early: Data Curation for Reasoning Models

Quantization in Federated Learning: Methods, Challenges and Future Directions

Asymptotically Optimal Learning for Parametric Prophet Inequalities

GEOALIGN: Geometric Rollout Curation for Robust LLM Reinforcement Learning

Decision-Aligned Evaluation of Uncertainty Quantification

Uncertainty quantification via conformal prediction in data assimilation

A Generalization Theory for JEPA-Based World Models

Just how sure are you? Improving Verbalized Uncertainty Calibration in Medical VQA

Symplectic Neural Networks for learning Generalized Hamiltonians

State Representation Matters in Deep Reinforcement Learning: Application to Energy Trading

Finding Stationary Points by Comparisons

Data-Free Reservoir Features for Efficient Long-Horizon Cold-Start Continual Learning

Transformer-Based Classification of Bacterial Raman Spectra with LOOCV

Heavy-Ball Q-Learning with Residual Weighting Correction

Cross-Head Attention Uplift Network with Inverse Propensity Score under Unobserved Confounding

Kolmogorov Arnold networks (KAN) for aerodynamic prediction: a comparison with MLPs and GNNs

fTNN: a tensor neural network for fractional PDEs

Stochastic Gradient Optimization with Model-Assisted Sampling

RecallRisk-BERT: A Multi-Task Framework for Post-Report Medical Device Recall Triage

Automating Potential-based Reward Shaping with Vision Language Model Guidance

Explaining Temporal Graph Neural Networks via Feature-induced Information Flow

Graph Neural Networks Applications Across Domains: All Insights You Need

The Geometry of Updates: Fisher Alignment at Vocabulary Scale

Effective Covariance Dynamics in Solvable High-Dimensional GANs

RSPC: A Benchmark for Modeling Stress and Psychiatric Conditions in Digitally Mediated Relationships using Psychiatrist Annotations

BetXplain: An Explanation-Annotated Dataset for Detecting Manipulative Betting Advertisements on Social Media

How Good Can Linear Models Be for Time-Series Forecasting?

Recovering Governing Equations from Solution Data: Identifiability Bounds for Linear and Nonlinear ODEs

Designing Reward Signals for Portable Query Generation: A Case Study in Industrial Semantic Job Search

A Multi-Fidelity Convolutional Autoencoder-Transfer Learning Framework for Guided-Wave-Based Damage Diagnosis Using Large Simulated and Limited Experimental Datasets

Blackwell Approachability and Gradient Equilibrium are Equivalent

Beyond the Hard Budget: Sparsity Regularizers for More Interpretable Top-k Sparse Autoencoders

Hallucination in World Models is Predictable and Preventable

Error-Conditioned Neural Solvers

Autoregressive Boltzmann Generators

Reinforcement Learning without Ground-Truth Solutions can Improve LLMs

Context Recycling for Long-Horizon LLM Inference

The Open Source Economic Index of AI Adoption and Capability

Dynamic-dLLM: Dynamic Cache-Budget and Adaptive Parallel Decoding for Training-Free Acceleration of Diffusion LLM

Dot-Flik: A Scalable Edge AI Architecture for Distributed Insect Monitoring

Code evolution for link prediction in complex networks