arXiv — Machine Learning

500 articles archived · Visit source ↗ · RSS

arXiv — Machine Learning research 5d ago

Dense Supervision Is Not Enough: The Readout Blind Spot in Looped Language Models

arXiv:2606.24898v1 Announce Type: new Abstract: Looped language models turn hidden states into runtime state: each state is decoded for prediction and fed back into future computation. This creates a basic supervision question: which state variables does cross-entropy actually…

37
arXiv — Machine Learning research 5d ago

From Meta Idea to Advanced Mathematical Discovery -- Human-AI Co-Discovery of Sign-Embedding Quantum Algorithms

arXiv:2606.24899v1 Announce Type: new Abstract: AI-assisted mathematics is often evaluated on solving predefined problems. In practice, however, many important advances begin earlier, when a vague research intuition is transformed into a concrete problem, a promising route, and…

37
arXiv — Machine Learning research 5d ago

On-Device Neural Architecture Search

arXiv:2606.24900v1 Announce Type: new Abstract: This paper proposes a new approach to near-sensor computing, in which a lightweight Neural Architecture Search (NAS) is performed directly on the deployment device to find the best tiny neural architecture for analyzing the…

26
arXiv — Machine Learning research 5d ago

LLM Evolution as an Industry-Scale Ecosystem: A Lifecycle Perspective on Continual Learning

arXiv:2606.24901v1 Announce Type: new Abstract: Continual learning capability is critical for Industrial LLMs, as deployed models must be continuously updated to meet evolving requirements and environments, rather than repeatedly retrained from scratch. However, most existing…

6
arXiv — Machine Learning research 5d ago

A Spectral Phase Diagram for Binary Few-Shot Classification: Intrinsic Dimensionality, Geometric Saturation, and Representational Diagnosis

arXiv:2606.24903v1 Announce Type: new Abstract: Deciding when to stop collecting labeled examples is a fundamental but undertheorized problem in applied machine learning. The saturation index $S(K) = \operatorname{erank}(\widehat{\Sigma}_W^{(K)}) / K$ measures the ratio of the…

8
arXiv — Machine Learning research 5d ago

When Do Conservation Laws Survive Learned Representations? Certified Horizons for Latent World Models

arXiv:2606.24945v1 Announce Type: new Abstract: We ask a representation-learning question about physical world models: when does a conservation law remain certifiable after a model learns a latent representation? A certified horizon bounds -- in advance, from measurable model…

37
arXiv — Machine Learning research 5d ago

Conformal Orbit-Valid Trust Horizons for Equivariant World Models

arXiv:2606.24946v1 Announce Type: new Abstract: Learned world models are useful only over horizons on which their rollout error remains controlled. We study trust-horizon certification for latent world models with known group symmetries. Given a one-step latent residual and a…

37
arXiv — Machine Learning research 5d ago

Supervised Reinforcement Learning for the Coordination of Distributed Energy Resources

arXiv:2606.24947v1 Announce Type: new Abstract: The increasing integration of distributed energy resources (DERs) is crucial for power system decarbonization, yet unlocking DERs' flexibility is challenged by their inherent uncertainties and modelling complexity. As traditional…

27
arXiv — Machine Learning research 5d ago

Holographic Memory for Zero-Shot Compositional Reasoning in Knowledge Graphs: A Mechanistic Study of Where and Why It Fails

arXiv:2606.24948v1 Announce Type: new Abstract: Knowledge graph embedding (KGE) models predict single-hop links well but have no mechanism for zero-shot compositional queries: multi-hop questions whose relation chains never appeared during training. Holographic Reduced…

31
arXiv — Machine Learning research 5d ago

MacroLens: A Multi-Task Benchmark for Contextual Financial Reasoning under Macroeconomic Scenarios

arXiv:2606.24950v1 Announce Type: new Abstract: Financial decision-making is contextual: forecasting prices, valuing companies, and assessing event exposure weigh price history, accounting fundamentals, macroeconomic regime, and contemporaneous text. A benchmark over these four…

25
arXiv — Machine Learning research 5d ago

How Complexity Contributes to Learning Opacity in Machine Learning

arXiv:2606.24953v1 Announce Type: new Abstract: Machine learning (ML) algorithms are known to be opaque. We do not know the reasons for their predictions. The learning process leading to the prediction function is also opaque. We do not fully understand the time evolution of the…

22
arXiv — Machine Learning research 5d ago

Digital Twin-Driven Adaptive Sim-to-Real Alignment via Reinforcement Learning for Vibration-Based Bearing Health Monitoring Under Data Scarcity

arXiv:2606.24954v1 Announce Type: new Abstract: Vibration-based health monitoring of rotating machinery requires reliable fault diagnosis under operational data constraints, yet condition assessment remains challenged by structural scarcity of fault events and heterogeneous…

30
arXiv — Machine Learning research 5d ago

Towards Continuous Power Forecasting: Practical Continual Learning for Real-World Energy Systems in Nonstationary Time Series

arXiv:2606.24955v1 Announce Type: new Abstract: Power forecasting models deployed in real-world energy markets must operate under nonstationary conditions, where data distributions continually evolve due to weather variability, infrastructure upgrades, and changing consumption…

24
arXiv — Machine Learning research 5d ago

Convex--Concave Quadratic Spectral Filtering for Graph Neural Networks

arXiv:2606.24956v1 Announce Type: new Abstract: Spectral graph neural networks (GNNs) interpret message passing as frequency-selective filtering. While low-order spectral filters are efficient, their limited selectivity often leads to weak attenuation outside the passband,…

28
arXiv — Machine Learning research 5d ago

Swarm-Inspired Generation of Collective Behaviors in Graph Dynamical Systems

arXiv:2606.24958v1 Announce Type: new Abstract: Collective behavior arises when locally interacting units produce coordinated global organization, from synchronization in dynamical systems to task-relevant information flow on graphs. The central challenge is not only to explain…

9
arXiv — Machine Learning research 5d ago

Reliable Conformal Prediction for Ordinal Classification Using the Ranked Probability Score

arXiv:2606.24959v1 Announce Type: new Abstract: Ordinal classification (OC) arises in high-stakes domains such as medicine and finance, where uncertainty quantification must account for the severity of ordinal errors. Conformal prediction (CP) provides distribution-free…

22
arXiv — Machine Learning research 5d ago

Enhancing Clinician Decision-Making via Uncertainty-Aware Multi-Expert Fusion for Stroke Rehabilitation

arXiv:2606.24960v1 Announce Type: new Abstract: Tailoring stroke rehabilitation requires assessing how movements are organized, not merely if they succeed. Currently, this assessment is a rate-limiting bottleneck. Instruments like the Action Research Arm Test (ARAT) compress…

20
arXiv — Machine Learning research 5d ago

Towards Scalable Multi-Task Reinforcement Learning with Large Decision Models

arXiv:2606.24962v1 Announce Type: new Abstract: Recent progress in large-scale sequence modeling has shown that a single model can learn useful representations across highly diverse data distributions. Inspired by these advances, we investigate whether a unified transformer…

21
arXiv — Machine Learning research 5d ago

Evidence for feature-specific error correction in LLMs

arXiv:2606.24964v1 Announce Type: new Abstract: Understanding the features of large language models (LLMs) is a central goal of interpretability. LLMs are commonly assumed to use superposition to represent more features than they have dimensions. They may not only represent…

20
arXiv — Machine Learning research 5d ago

Learning Dynamical Systems from Multiple Sparse Datasets: A Hierarchical Bayesian Modeling Approach

arXiv:2606.24966v1 Announce Type: new Abstract: Estimating parameters of dynamical systems from sparse, noisy, and irregularly sampled data is often severely ill-conditioned. When multiple related datasets are available, they provide additional information if the shared…

30
arXiv — Machine Learning research 5d ago

What Do Language Priors Contribute to Darcy-Flow Inversion? A Mechanistic Audit

arXiv:2606.24967v1 Announce Type: new Abstract: In ill-posed inverse problems, the recovered solution depends as much on the prior as on the data, yet much of the engineering knowledge that could serve as that prior is recorded qualitatively rather than in formal mathematical…

24
arXiv — Machine Learning research 5d ago

Training Dynamics of Neural Software Defect Predictors under Coupled Data-Quality Issues

arXiv:2606.24968v1 Announce Type: new Abstract: Context: Software defect prediction supports maintenance decisions such as testing prioritization, release-risk assessment, and quality monitoring. However, metric-based SDP datasets often contain coupled data-quality issues,…

6
arXiv — Machine Learning research 5d ago

Frequency Domain Reservoir Computing

arXiv:2606.24969v1 Announce Type: new Abstract: While the quadratic sequence-length bottleneck of transformers has fueled a resurgence in recurrent models, effectively capturing complex dynamics requires architectures that balance efficient training with highly expressive latent…

7
arXiv — Machine Learning research 5d ago

Don't Go Breaking My LLM: The Impact of Pruning Attention Layers on Explanation Faithfulness and Confidence Calibration

arXiv:2606.24970v1 Announce Type: new Abstract: Pruning Large Language Models (LLMs) reduces memory and inference costs by removing parts of the network, producing smaller models that retain most of their accuracy. As attention layers are the most resource-intensive parts of…

32
arXiv — Machine Learning research 5d ago

Quantifying Explainable AI-introduced signal noise on ECG data with Spectral Entropy

arXiv:2606.24974v1 Announce Type: new Abstract: Explainability techniques are used to assess the output of various deep learning models. This is especially true in healthcare, where models need to be trusted and decisions justified. Explainability (XAI) tools use heuristics…

22
arXiv — Machine Learning research 5d ago

Why Do Accumulated Transformations Extrapolate?

arXiv:2606.24975v1 Announce Type: new Abstract: PaTH Attention showed that replacing RoPE's position-indexed rotations with accumulated data-dependent Householder reflections yields strong length extrapolation, though performance degrades at extreme context lengths. We ask…

22
arXiv — Machine Learning research 5d ago

Auto-Configured Explainable Graph Neural Networks for Multi-Site Pollution Prediction

arXiv:2606.24978v1 Announce Type: new Abstract: Accurate particulate matter (PM) prediction is crucial for mitigating air pollution. Graph Neural Networks (GNNs) effectively model spatiotemporal dependencies, but predefined graphs limit adaptability, and some datasets complicate…

25
arXiv — Machine Learning research 5d ago

CKM-Driven Communication-Aware UAV Intelligent Trajectory Optimization for Urban Inspection

arXiv:2606.24979v1 Announce Type: new Abstract: Unmanned aerial vehicles (UAVs) are increasingly employed in urban inspection tasks, where reliable communication is critical but challenging due to the severe spatial channel heterogeneity. To address the issue, in this paper, we…

19
arXiv — Machine Learning research 5d ago

Closed-Loop Graph Algorithm Execution with Small Language Models: Step Accuracy and Rollout Reliability

arXiv:2606.24980v1 Announce Type: new Abstract: Small language models offer an efficient alternative to large-scale systems, but their ability to execute structured algorithms over multiple dependent decisions remains poorly understood. We study graph algorithm execution as a…

24
arXiv — Machine Learning research 5d ago

A Single Stepsize Suffices for Unprojected Linear TD(0): Simultaneous Robust and Fast Rates via Polyak--Ruppert Averaging

arXiv:2606.24981v1 Announce Type: new Abstract: We study linear TD(0) under Markovian sampling, where data are generated along a single trajectory. We provide high-probability guarantees for a plain unprojected TD(0) algorithm with Polyak-Ruppert (PR) averaging, using a single…

38
arXiv — Machine Learning research 5d ago

Latent Block-Diffusion Temporal Point Processes: A Semi-Autoregressive Framework for Asynchronous Event Sequence Generation

arXiv:2606.24982v1 Announce Type: new Abstract: Modeling and sampling from the underlying distribution of asynchronous event sequences are crucial in various real-world applications, including social networks, medical diagnosis, and financial transactions. Existing…

35
arXiv — Machine Learning research 5d ago

Learning Diachronic Representations of Ancient Greek Letterforms

arXiv:2606.24984v1 Announce Type: new Abstract: Learning representations that remain robust across centuries of variation in handwriting is a key challenge in diachronic representation learning. Taking one of the longest continuously used writing systems, ancient Greek, as a…

27
arXiv — Machine Learning research 5d ago

Retrieval-Augmented Personalization with Foundation Models for Wearable Stress Detection

arXiv:2606.24985v1 Announce Type: new Abstract: Personalization in wearable-based stress detection remains challenging due to substantial inter-individual variability in physiological and behavioral responses. While traditional approaches rely on user-specific fine-tuning or…

5
arXiv — Machine Learning research 5d ago

When Multi-Sensor Fusion Fails to Generalize: Cattle Posture Classification Under Animal-Level and Temporal Distribution Shift

arXiv:2606.24986v1 Announce Type: new Abstract: Automated cattle posture-classification systems frequently report near-perfect accuracy, yet their robustness under realistic deployment conditions remains largely unknown. In particular, it is unclear whether multimodal sensor…

25
arXiv — Machine Learning research 5d ago

Low-Cost High-Order Singular Value Decomposition for Tensor-Based Reconstruction from Sparse Sensor Measurements: Urban Flow and Air-Quality Applications

arXiv:2606.24989v1 Announce Type: new Abstract: Urban flow and air-quality simulations generate high-dimensional datasets describing velocity and pollutant transport across multiple spatial, temporal, and physical-variable dimensions. Reconstructing these fields from sparse…

21
arXiv — Machine Learning research 5d ago

Uncertainty-aware reinforcement learning for chemical language models

arXiv:2606.24990v1 Announce Type: new Abstract: Reinforcement Learning (RL) has become a powerful paradigm for de novo molecular design, enabling Chemical Language Models (CLMs) to navigate and explore the chemical space while optimizing specific desired properties. However, the…

26
arXiv — Machine Learning research 5d ago

The Geometry of Sequential Learning: Lie-Bracket Prediction of Transfer Order

arXiv:2606.24993v1 Announce Type: new Abstract: Sequential learning is order-dependent: from Pile-style next-token domain adaptation to instruction-SFT and DPO, N candidate sources induce N! possible curricula. We show that the local order effect is governed by a computable…

7
arXiv — Machine Learning research 5d ago

ExTra: Exploratory Trajectory Optimization for Language Model Reinforcement Learning

arXiv:2606.24994v1 Announce Type: new Abstract: Reinforcement Learning with Verifiable Rewards (RLVR) for language-model reasoning can fail at both extremes of task difficulty: easy prompts often produce all-correct, low-diversity rollout groups with little gradient signal,…

25
arXiv — Machine Learning research 5d ago

Are Tabular Foundation Models Robust to Realistic Query Distribution Shifts in Microbiome Data?

arXiv:2606.24995v1 Announce Type: new Abstract: Tabular foundation models (TFMs) achieve strong performance on microbiome abundance data, yet their robustness under realistic distribution shift remains poorly characterized. We introduce a benchmark that evaluates the robustness…

22
arXiv — Machine Learning research 5d ago

From Forecasting Leaderboards to Deployment Decisions: A Fail-Closed Certification Protocol

arXiv:2606.24996v1 Announce Type: new Abstract: Forecasting leaderboards rank models by predictive quality, but their winners are often read as deployment-ready top-1 advice. That reading can fail when forecasts are passed through a fixed decision interface, such as an alert…

23
arXiv — Machine Learning research 5d ago

What's in an Earth Embedding? An Explainability Analysis of Location Encoders

arXiv:2606.24997v1 Announce Type: new Abstract: Geographic implicit neural representations (INRs) learn to map any coordinate on Earth to a location embedding, implicitly encoding geospatial data into the weights of a neural network. Location embeddings are widely used off the…

15
arXiv — Machine Learning research 5d ago

Internal Data Repetition Destroys Language Models

arXiv:2606.24998v1 Announce Type: new Abstract: Language models are running out of high-quality training data, and even aggressively deduplicated corpora retain some amount of repetition. Earlier controlled studies predated Chinchilla-style scaling laws and could only measure…

5
arXiv — Machine Learning research 5d ago

A Zeroth-Order Deep Learning Method for Fully Nonlinear Parabolic Partial Differential Equations with Unknown Coefficients

arXiv:2606.24999v1 Announce Type: new Abstract: High-dimensional partial differential equations (PDEs) with unknown coefficients arise widely in scientific machine learning, including continuous-time reinforcement learning, yet solving them efficiently in a data-driven way…

8
arXiv — Machine Learning research 5d ago

Geo-Strat-RL: Learning Geological Event Reasoning from Verifiable Tasks

arXiv:2606.25000v1 Announce Type: new Abstract: To evaluate whether vision-language models can reason about geological histories, it is necessary to construct observations for which the underlying process history is known. Furthermore, reasoning over geological histories is not…

6
arXiv — Machine Learning research 5d ago

Erased, but Not Gone: Output Forgetting Is Not True Forgetting

arXiv:2606.25001v1 Announce Type: new Abstract: Machine unlearning (MU) is commonly judged by output forgetting, such as low forget-set accuracy or reduced logit-level membership inference. But if output-level success can coexist with retraining-inconsistent residuals in…

26
arXiv — Machine Learning research 5d ago

TRACER: Training-Free Closed-Loop Structured Inference for Traffic Accident Reconstruction

arXiv:2606.25002v1 Announce Type: new Abstract: Traffic accident reconstruction is a forensic inverse problem that requires recovering physically consistent motion from sparse and heterogeneous evidence. Existing learning-based approaches predominantly optimize for semantic…

23
arXiv — Machine Learning research 5d ago

Adaptive Joint Compression and Synchronisation in Federated Split Learning for IoT Rainfall Prediction

arXiv:2606.25003v1 Announce Type: new Abstract: Federated split learning (FSL) enables collaborative training across bandwidth-constrained IoT devices, but repeated activation and gradient exchange creates a communication bot-tleneck. Prior work optimises either activation…

6
arXiv — Machine Learning research 5d ago

Certification of Machine Learning Models via Directional Sharpness

arXiv:2606.25004v1 Announce Type: new Abstract: In machine learning, model certification has been identified as an important method for gaining assurance about a model's trustworthiness and quality. A model's quality is largely determined by its ability to generalize, i.e., to…

13
arXiv — Machine Learning research 5d ago

Scalable Peptide Design via Memory-Efficient Equivariant Transformer

arXiv:2606.25006v1 Announce Type: new Abstract: Target-specific peptide design requires sequence and structure co-design under full atom geometric constraints. Latent generative frameworks offer an effective route for this problem by compressing fine grained atomic structures…

13
arXiv — Machine Learning research 5d ago

Multi-Stream Temporal Fusion for Financial Fraud Detection

arXiv:2606.25007v1 Announce Type: new Abstract: Financial fraud detection in digital banking requires reasoning over multiple heterogeneous event streams -- transactions, login sessions, risk signals -- that individually appear benign but collectively reveal fraudulent patterns.…

14

Dense Supervision Is Not Enough: The Readout Blind Spot in Looped Language Models

From Meta Idea to Advanced Mathematical Discovery -- Human-AI Co-Discovery of Sign-Embedding Quantum Algorithms

On-Device Neural Architecture Search

LLM Evolution as an Industry-Scale Ecosystem: A Lifecycle Perspective on Continual Learning

A Spectral Phase Diagram for Binary Few-Shot Classification: Intrinsic Dimensionality, Geometric Saturation, and Representational Diagnosis

When Do Conservation Laws Survive Learned Representations? Certified Horizons for Latent World Models

Conformal Orbit-Valid Trust Horizons for Equivariant World Models

Supervised Reinforcement Learning for the Coordination of Distributed Energy Resources

Holographic Memory for Zero-Shot Compositional Reasoning in Knowledge Graphs: A Mechanistic Study of Where and Why It Fails

MacroLens: A Multi-Task Benchmark for Contextual Financial Reasoning under Macroeconomic Scenarios

How Complexity Contributes to Learning Opacity in Machine Learning

Digital Twin-Driven Adaptive Sim-to-Real Alignment via Reinforcement Learning for Vibration-Based Bearing Health Monitoring Under Data Scarcity

Towards Continuous Power Forecasting: Practical Continual Learning for Real-World Energy Systems in Nonstationary Time Series

Convex--Concave Quadratic Spectral Filtering for Graph Neural Networks

Swarm-Inspired Generation of Collective Behaviors in Graph Dynamical Systems

Reliable Conformal Prediction for Ordinal Classification Using the Ranked Probability Score

Enhancing Clinician Decision-Making via Uncertainty-Aware Multi-Expert Fusion for Stroke Rehabilitation

Towards Scalable Multi-Task Reinforcement Learning with Large Decision Models

Evidence for feature-specific error correction in LLMs

Learning Dynamical Systems from Multiple Sparse Datasets: A Hierarchical Bayesian Modeling Approach

What Do Language Priors Contribute to Darcy-Flow Inversion? A Mechanistic Audit

Training Dynamics of Neural Software Defect Predictors under Coupled Data-Quality Issues

Frequency Domain Reservoir Computing

Don't Go Breaking My LLM: The Impact of Pruning Attention Layers on Explanation Faithfulness and Confidence Calibration

Quantifying Explainable AI-introduced signal noise on ECG data with Spectral Entropy

Why Do Accumulated Transformations Extrapolate?

Auto-Configured Explainable Graph Neural Networks for Multi-Site Pollution Prediction

CKM-Driven Communication-Aware UAV Intelligent Trajectory Optimization for Urban Inspection

Closed-Loop Graph Algorithm Execution with Small Language Models: Step Accuracy and Rollout Reliability

A Single Stepsize Suffices for Unprojected Linear TD(0): Simultaneous Robust and Fast Rates via Polyak--Ruppert Averaging

Latent Block-Diffusion Temporal Point Processes: A Semi-Autoregressive Framework for Asynchronous Event Sequence Generation

Learning Diachronic Representations of Ancient Greek Letterforms

Retrieval-Augmented Personalization with Foundation Models for Wearable Stress Detection

When Multi-Sensor Fusion Fails to Generalize: Cattle Posture Classification Under Animal-Level and Temporal Distribution Shift

Low-Cost High-Order Singular Value Decomposition for Tensor-Based Reconstruction from Sparse Sensor Measurements: Urban Flow and Air-Quality Applications

Uncertainty-aware reinforcement learning for chemical language models

The Geometry of Sequential Learning: Lie-Bracket Prediction of Transfer Order

ExTra: Exploratory Trajectory Optimization for Language Model Reinforcement Learning

Are Tabular Foundation Models Robust to Realistic Query Distribution Shifts in Microbiome Data?

From Forecasting Leaderboards to Deployment Decisions: A Fail-Closed Certification Protocol

What's in an Earth Embedding? An Explainability Analysis of Location Encoders

Internal Data Repetition Destroys Language Models

A Zeroth-Order Deep Learning Method for Fully Nonlinear Parabolic Partial Differential Equations with Unknown Coefficients

Geo-Strat-RL: Learning Geological Event Reasoning from Verifiable Tasks

Erased, but Not Gone: Output Forgetting Is Not True Forgetting

TRACER: Training-Free Closed-Loop Structured Inference for Traffic Accident Reconstruction

Adaptive Joint Compression and Synchronisation in Federated Split Learning for IoT Rainfall Prediction

Certification of Machine Learning Models via Directional Sharpness

Scalable Peptide Design via Memory-Efficient Equivariant Transformer

Multi-Stream Temporal Fusion for Financial Fraud Detection