arXiv — Machine Learning
500 articles archived · Visit source ↗ · RSS
-
arXiv — Machine Learning research 4d ago
Zero-Shot Size Transfer for Neural ODEs on Sparse Random Graphs: Graphon Limits and Adjoint Convergence
arXiv:2606.26662v1 Announce Type: new Abstract: Graph Neural Differential Equations (GNDEs) model continuous-time graph dynamics by parameterizing Neural ODE velocity fields with Graph Neural Networks. Their local, size-independent filters suggest a zero-shot size-transfer…
24 -
arXiv — Machine Learning research 4d ago
PersistentKV: Page-Aware Decode Scheduling for Long-Context LLM Serving on Commodity GPUs
arXiv:2606.26666v1 Announce Type: new Abstract: Autoregressive large language model (LLM) serving is increasingly limited by key-value (KV) cache movement rather than dense matrix multiplication. Modern paged-attention systems reduce KV-cache fragmentation and mature kernels…
20 -
-
arXiv — Machine Learning research 4d ago
HyperDFlash: MHC-Aligned Block Speculative Decoding with Gated Residual Reduction
arXiv:2606.26744v1 Announce Type: new Abstract: We present HyperDFlash, a block-parallel speculative decoding framework tailored to the novel multi-hyper-connection (MHC) architecture proposed by DeepSeek-V4. Despite the strong initial-token drafting performance of the native…
10 -
arXiv — Machine Learning research 4d ago
Structure Before Collapse: Transient semantic geometry in next-token prediction
arXiv:2606.26749v1 Announce Type: new Abstract: Neural Collapse predicts that balanced one-hot classification pushes model representations to be equally far from each other; a symmetric configuration that depends only on the output label and ignores any semantic similarity in…
29 -
arXiv — Machine Learning research 4d ago
Batch-Invariant Spectral Intelligence for Robust and Explainable Insect Authentication
arXiv:2606.26757v1 Announce Type: new Abstract: Edible insects offer an efficient source of alternative protein, requiring less land, water and emitting less greenhouse gas than conventional livestock. However, their successful integration into the food supply chain demands…
22 -
arXiv — Machine Learning research 4d ago
Escaping Iterative Parameter-Space Noise: Differentially Private Learning with a Hypernetwork
arXiv:2606.26772v1 Announce Type: new Abstract: Differentially private (DP) training of neural networks is often hindered by the large amount of noise required by gradient-based methods such as DP-SGD, which repeatedly inject high-dimensional noise in parameter space throughout…
20 -
arXiv — Machine Learning research 4d ago
Reproducibility Study of "AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models"
arXiv:2606.26783v1 Announce Type: new Abstract: Fang et al. (2025) introduced a null-space constrained projection, named AlphaEdit, for locate-then-edit knowledge editing methods, theoretically guaranteeing that edits do not disrupt previously preserved knowledge, and reports…
20 -
arXiv — Machine Learning research 4d ago
AIGP: An LLM-Based Framework for Long-Term Value Alignment in E-Commerce Pricing
arXiv:2606.26787v1 Announce Type: new Abstract: Traditional dynamic pricing models in large-scale e-commerce suffer from limited interpretability, poor utilization of unstructured information, and misalignment with long-term business objectives such as cumulative Gross…
26 -
arXiv — Machine Learning research 4d ago
Reasoning Quality Emerges Early: Data Curation for Reasoning Models
arXiv:2606.26797v1 Announce Type: new Abstract: Supervised fine-tuning (SFT) on a small, high-quality set of long reasoning traces is an effective approach for eliciting strong reasoning capabilities in Large Language Models (LLMs). However, existing methods for curating…
14 -
arXiv — Machine Learning research 4d ago
Quantization in Federated Learning: Methods, Challenges and Future Directions
arXiv:2606.26822v1 Announce Type: new Abstract: Federated Learning (FL) has become a foundational paradigm for privacy-preserving distributed intelligence, yet its scalability remains fundamentally constrained by communication bottlenecks, device heterogeneity, and the…
20 -
arXiv — Machine Learning research 4d ago
Asymptotically Optimal Learning for Parametric Prophet Inequalities
arXiv:2606.26893v1 Announce Type: new Abstract: We study learning in prophet inequalities with i.i.d. rewards drawn from an exponential-type parametric family with an unknown parameter $\theta$, a class that includes exponential, Pareto, and bounded-support power-family…
32 -
arXiv — Machine Learning research 4d ago
GEOALIGN: Geometric Rollout Curation for Robust LLM Reinforcement Learning
arXiv:2606.26917v1 Announce Type: new Abstract: Online reinforcement learning is widely used to align large language models (LLMs) with reward signals, yet training can be unstable under noisy or misspecified rewards. We identify a failure mode we call directional inconsistency:…
26 -
arXiv — Machine Learning research 4d ago
Decision-Aligned Evaluation of Uncertainty Quantification
arXiv:2606.26990v1 Announce Type: new Abstract: Uncertainty estimates in machine learning are typically evaluated using generic metrics such as the negative log-likelihood and expected calibration error, yet good performance on such metrics does not necessarily imply high…
13 -
arXiv — Machine Learning research 4d ago
Uncertainty quantification via conformal prediction in data assimilation
arXiv:2606.27001v1 Announce Type: new Abstract: Quantifying the evolution of uncertainty is critical to both probabilistic forecasting and data assimilation in numerical weather prediction. In this study, we investigate the applicability of conformal prediction (CP), a recent…
30 -
arXiv — Machine Learning research 4d ago
A Generalization Theory for JEPA-Based World Models
arXiv:2606.27014v1 Announce Type: new Abstract: Joint Embedding Predictive Architectures (JEPAs) have recently emerged as a promising paradigm for world modeling by learning predictive dynamics in a latent space rather than generating future observations at the input level.…
5 -
arXiv — Machine Learning research 4d ago
Just how sure are you? Improving Verbalized Uncertainty Calibration in Medical VQA
arXiv:2606.27023v1 Announce Type: new Abstract: Multimodal large language models (MLLMs) applied to Medical Visual Question Answering (VQA) tend to produce overconfident outputs regardless of actual correctness, and existing verbalized confidence calibration methods, developed…
15 -
arXiv — Machine Learning research 4d ago
Symplectic Neural Networks for learning Generalized Hamiltonians
arXiv:2606.27029v1 Announce Type: new Abstract: Hamiltonian Neural Networks (HNNs) integrate physical priors into neural models by learning a system's Hamiltonian, improving generalization and sample efficiency. Identifying the system Hamiltonian from noisy observations of state…
9 -
arXiv — Machine Learning research 4d ago
State Representation Matters in Deep Reinforcement Learning: Application to Energy Trading
arXiv:2606.27032v1 Announce Type: new Abstract: Energy trading decisions depend not only on current market prices, but also on expected future market conditions, and operational constraints. This makes the state representation given to a reinforcement learning agent an important…
5 -
arXiv — Machine Learning research 4d ago
Finding Stationary Points by Comparisons
arXiv:2606.27082v1 Announce Type: new Abstract: We study the problem of finding stationary points of non-convex functions when access to the objective is provided only through a comparison oracle that, given two points, outputs which has the larger function value. For a twice…
17 -
arXiv — Machine Learning research 4d ago
Data-Free Reservoir Features for Efficient Long-Horizon Cold-Start Continual Learning
arXiv:2606.27095v1 Announce Type: new Abstract: Cold-start exemplar-free class-incremental learning requires learning a growing set of classes without replay, external pretraining, or a large initial task. Existing cold-start methods typically either train the backbone…
19 -
arXiv — Machine Learning research 4d ago
Transformer-Based Classification of Bacterial Raman Spectra with LOOCV
arXiv:2606.27096v1 Announce Type: new Abstract: Transformer-based models have recently attracted increasing attention for Raman spectral classification. In this study, a transformer-based approach was systematically evaluated using a nested leave-one-replicate-out…
31 -
arXiv — Machine Learning research 4d ago
Heavy-Ball Q-Learning with Residual Weighting Correction
arXiv:2606.27112v1 Announce Type: new Abstract: This paper proposes a corrected heavy-ball Q-learning method for reinforcement learning (RL) and establishes its convergence. It also identifies conditions under which the method is theoretically guaranteed to converge faster than…
31 -
arXiv — Machine Learning research 4d ago
Cross-Head Attention Uplift Network with Inverse Propensity Score under Unobserved Confounding
arXiv:2606.27114v1 Announce Type: new Abstract: Uplift modeling, crucial for estimating individual treatment effects (ITE), faces dual challenges: flexibly leveraging inter-group similarity to enhance discriminative power and debiasing under unobserved confounding scenarios. In…
19 -
arXiv — Machine Learning research 4d ago
Kolmogorov Arnold networks (KAN) for aerodynamic prediction: a comparison with MLPs and GNNs
arXiv:2606.27126v1 Announce Type: new Abstract: Kolmogorov Arnold networks (KAN) have recently been introduced as a (deep) neural network architecture whose trainable parameters adapt the activation functions, instead of the coefficients of the affine transformations at the core…
27 -
arXiv — Machine Learning research 4d ago
fTNN: a tensor neural network for fractional PDEs
arXiv:2606.27140v1 Announce Type: new Abstract: We develop the fTNN, a deterministic tensor neural network subspace method for problems involving the fractional Laplacian on bounded domains, taking the fractional Poisson equation and time-dependent fractional advection-diffusion…
21 -
arXiv — Machine Learning research 4d ago
Stochastic Gradient Optimization with Model-Assisted Sampling
arXiv:2606.27171v1 Announce Type: new Abstract: This work addresses the problem of variance in stochastic gradient estimation for machine learning optimization. Deep learning relies on mini-batch methods such as stochastic gradient descent, which approximate full gradients but…
34 -
arXiv — Machine Learning research 4d ago
RecallRisk-BERT: A Multi-Task Framework for Post-Report Medical Device Recall Triage
arXiv:2606.27174v1 Announce Type: new Abstract: Medical device recalls are a critical regulatory mechanism for protecting patient safety. The growing volume of FDA recall records presents challenges in post-report recall triage, severity assessment, and root-cause…
24 -
arXiv — Machine Learning research 4d ago
Automating Potential-based Reward Shaping with Vision Language Model Guidance
arXiv:2606.27180v1 Announce Type: new Abstract: Sparse rewards are inherently challenging for reinforcement learning agents as they lack intermediate feedback to guide exploration and to correctly attribute the sparse success rewards to relevant parts of the trajectory. Naive…
36 -
arXiv — Machine Learning research 4d ago
Explaining Temporal Graph Neural Networks via Feature-induced Information Flow
arXiv:2606.27201v1 Announce Type: new Abstract: Event-based Temporal Graph Neural Networks (ETGNNs) have demonstrated strong performance across a wide range of applications, including social network analysis, epidemic tracing, recommender systems, and political event…
5 -
arXiv — Machine Learning research 4d ago
Graph Neural Networks Applications Across Domains: All Insights You Need
arXiv:2606.27202v1 Announce Type: new Abstract: Graph neural networks have moved from a niche representation-learning technique to the default model class wherever data carry relational structure. The interesting question is no longer whether message passing helps on a given…
17 -
arXiv — Machine Learning research 4d ago
The Geometry of Updates: Fisher Alignment at Vocabulary Scale
arXiv:2606.27242v1 Announce Type: new Abstract: Training-free source selection for LLM families with shared vocabularies arises in scientific string domains such as SMILES, protein, and genomic sequences, where candidate corpora share a tokenizer but differ in prediction…
38 -
arXiv — Machine Learning research 4d ago
Effective Covariance Dynamics in Solvable High-Dimensional GANs
arXiv:2606.27246v1 Announce Type: new Abstract: We study a solvable high-dimensional model of generative adversarial network (GAN) training in which a linear generator learns a low-dimensional subspace from data with structured latent covariance. Prior solvable GAN analyses…
33 -
-
arXiv — Machine Learning research 4d ago
BetXplain: An Explanation-Annotated Dataset for Detecting Manipulative Betting Advertisements on Social Media
arXiv:2606.27274v1 Announce Type: new Abstract: The promotion of betting applications on social media platforms has increased significantly in recent years. Many of these advertisements use persuasive techniques that may mislead users, encourage risky behavior, and potentially…
37 -
arXiv — Machine Learning research 4d ago
How Good Can Linear Models Be for Time-Series Forecasting?
arXiv:2606.27282v1 Announce Type: new Abstract: Time-series forecasting research has been moving steadily toward larger architectures, from specialized transformers to general-purpose foundation models, on the assumption that capacity is what unlocks accuracy. We take the…
19 -
arXiv — Machine Learning research 4d ago
Recovering Governing Equations from Solution Data: Identifiability Bounds for Linear and Nonlinear ODEs
arXiv:2606.27285v1 Announce Type: new Abstract: Learning governing equations from observed solution data is a fundamental challenge in scientific machine learning…
15 -
arXiv — Machine Learning research 4d ago
Designing Reward Signals for Portable Query Generation: A Case Study in Industrial Semantic Job Search
arXiv:2606.27291v1 Announce Type: new Abstract: Job-search platforms rely on low-bandwidth query interfaces that often fail to capture the high-dimensional complexity of candidate profiles. We present an end-to-end RLAIF (Reinforcement Learning from AI Feedback) framework to…
10 -
-
arXiv — Machine Learning research 4d ago
Blackwell Approachability and Gradient Equilibrium are Equivalent
arXiv:2606.27315v1 Announce Type: new Abstract: Gradient equilibrium (GEQ) is a recently introduced online optimization framework that generalizes first-order stationarity from offline optimization and abstracts problems like online conformal prediction. While GEQ has curious…
20 -
arXiv — Machine Learning research 4d ago
Beyond the Hard Budget: Sparsity Regularizers for More Interpretable Top-k Sparse Autoencoders
arXiv:2606.27321v1 Announce Type: new Abstract: Sparse autoencoders (SAEs) have become a leading tool for interpreting the representations of vision foundation models, decomposing their polysemantic activations into a larger set of sparse, more monosemantic features. The Top-$k$…
22 -
arXiv — Machine Learning research 4d ago
Hallucination in World Models is Predictable and Preventable
arXiv:2606.27326v1 Announce Type: new Abstract: Modern generative world models render increasingly realistic action-controllable futures, yet they frequently hallucinate: rollouts remain visually fluent while drifting from the ground-truth dynamics. We hypothesize that…
19 -
arXiv — Machine Learning research 4d ago
Error-Conditioned Neural Solvers
arXiv:2606.27354v1 Announce Type: new Abstract: Neural surrogate models offer fast approximate mappings from PDE parameters to solutions, but they typically treat solving as a purely statistical task: once trained, they struggle to correct their own constraint violations and…
25 -
arXiv — Machine Learning research 4d ago
Autoregressive Boltzmann Generators
arXiv:2606.27361v1 Announce Type: new Abstract: Efficient sampling of molecular systems at thermodynamic equilibrium is a hallmark challenge in statistical physics. This challenge has driven the development of Boltzmann Generators (BGs), which allow rapid generation of…
7 -
arXiv — Machine Learning research 4d ago
Reinforcement Learning without Ground-Truth Solutions can Improve LLMs
arXiv:2606.27369v1 Announce Type: new Abstract: Reinforcement learning with verifiable rewards (RLVR) for training LLMs typically rely on ground-truth answers to assign rewards, limiting their applicability to tasks where the ground-truth solution is unknown. We introduce a…
19 -
arXiv — Machine Learning research 4d ago
Context Recycling for Long-Horizon LLM Inference
arXiv:2606.26105v1 Announce Type: cross Abstract: Large language models (LLMs) exhibit strong capabilities in short-context reasoning but degrade in performance over long conversational horizons due to context window limitations and inefficient token usage. We introduce…
27 -
arXiv — Machine Learning research 4d ago
The Open Source Economic Index of AI Adoption and Capability
arXiv:2606.26118v1 Announce Type: cross Abstract: We work towards measuring both AI adoption and the capability of AI to perform discrete labor tasks across various occupations. To measure adoption, we develop an open-source economic index that uses publicly available user-LLM…
5 -
arXiv — Machine Learning research 4d ago
Dynamic-dLLM: Dynamic Cache-Budget and Adaptive Parallel Decoding for Training-Free Acceleration of Diffusion LLM
arXiv:2606.26120v1 Announce Type: cross Abstract: Diffusion Large Language Models (dLLMs) offer a promising alternative to autoregressive models, excelling in text generation tasks due to their bidirectional attention mechanisms. However, their computational complexity scales on…
15 -
arXiv — Machine Learning research 4d ago
Dot-Flik: A Scalable Edge AI Architecture for Distributed Insect Monitoring
arXiv:2606.26121v1 Announce Type: cross Abstract: Global insect population declines necessitate scalable, continuous monitoring systems, yet existing vision-based solutions remain constrained by high hardware costs, energy demands, and reliance on centralized processing or cloud…
11 -
arXiv — Machine Learning research 4d ago
Code evolution for link prediction in complex networks
arXiv:2606.26132v1 Announce Type: cross Abstract: The problem of predicting links in complex networks appears in different disciplines and has led to a variety of ingenious human-designed methods. We use this rich program space to explore the performance and behavior of…
8