arXiv — Machine Learning
500 articles archived · Visit source ↗ · RSS
-
arXiv — Machine Learning research 5d ago
Dense Supervision Is Not Enough: The Readout Blind Spot in Looped Language Models
arXiv:2606.24898v1 Announce Type: new Abstract: Looped language models turn hidden states into runtime state: each state is decoded for prediction and fed back into future computation. This creates a basic supervision question: which state variables does cross-entropy actually…
37 -
arXiv — Machine Learning research 5d ago
From Meta Idea to Advanced Mathematical Discovery -- Human-AI Co-Discovery of Sign-Embedding Quantum Algorithms
arXiv:2606.24899v1 Announce Type: new Abstract: AI-assisted mathematics is often evaluated on solving predefined problems. In practice, however, many important advances begin earlier, when a vague research intuition is transformed into a concrete problem, a promising route, and…
37 -
arXiv — Machine Learning research 5d ago
On-Device Neural Architecture Search
arXiv:2606.24900v1 Announce Type: new Abstract: This paper proposes a new approach to near-sensor computing, in which a lightweight Neural Architecture Search (NAS) is performed directly on the deployment device to find the best tiny neural architecture for analyzing the…
26 -
arXiv — Machine Learning research 5d ago
LLM Evolution as an Industry-Scale Ecosystem: A Lifecycle Perspective on Continual Learning
arXiv:2606.24901v1 Announce Type: new Abstract: Continual learning capability is critical for Industrial LLMs, as deployed models must be continuously updated to meet evolving requirements and environments, rather than repeatedly retrained from scratch. However, most existing…
6 -
-
arXiv — Machine Learning research 5d ago
When Do Conservation Laws Survive Learned Representations? Certified Horizons for Latent World Models
arXiv:2606.24945v1 Announce Type: new Abstract: We ask a representation-learning question about physical world models: when does a conservation law remain certifiable after a model learns a latent representation? A certified horizon bounds -- in advance, from measurable model…
37 -
arXiv — Machine Learning research 5d ago
Conformal Orbit-Valid Trust Horizons for Equivariant World Models
arXiv:2606.24946v1 Announce Type: new Abstract: Learned world models are useful only over horizons on which their rollout error remains controlled. We study trust-horizon certification for latent world models with known group symmetries. Given a one-step latent residual and a…
37 -
arXiv — Machine Learning research 5d ago
Supervised Reinforcement Learning for the Coordination of Distributed Energy Resources
arXiv:2606.24947v1 Announce Type: new Abstract: The increasing integration of distributed energy resources (DERs) is crucial for power system decarbonization, yet unlocking DERs' flexibility is challenged by their inherent uncertainties and modelling complexity. As traditional…
27 -
arXiv — Machine Learning research 5d ago
Holographic Memory for Zero-Shot Compositional Reasoning in Knowledge Graphs: A Mechanistic Study of Where and Why It Fails
arXiv:2606.24948v1 Announce Type: new Abstract: Knowledge graph embedding (KGE) models predict single-hop links well but have no mechanism for zero-shot compositional queries: multi-hop questions whose relation chains never appeared during training. Holographic Reduced…
31 -
arXiv — Machine Learning research 5d ago
MacroLens: A Multi-Task Benchmark for Contextual Financial Reasoning under Macroeconomic Scenarios
arXiv:2606.24950v1 Announce Type: new Abstract: Financial decision-making is contextual: forecasting prices, valuing companies, and assessing event exposure weigh price history, accounting fundamentals, macroeconomic regime, and contemporaneous text. A benchmark over these four…
25 -
arXiv — Machine Learning research 5d ago
How Complexity Contributes to Learning Opacity in Machine Learning
arXiv:2606.24953v1 Announce Type: new Abstract: Machine learning (ML) algorithms are known to be opaque. We do not know the reasons for their predictions. The learning process leading to the prediction function is also opaque. We do not fully understand the time evolution of the…
22 -
-
-
arXiv — Machine Learning research 5d ago
Convex--Concave Quadratic Spectral Filtering for Graph Neural Networks
arXiv:2606.24956v1 Announce Type: new Abstract: Spectral graph neural networks (GNNs) interpret message passing as frequency-selective filtering. While low-order spectral filters are efficient, their limited selectivity often leads to weak attenuation outside the passband,…
28 -
arXiv — Machine Learning research 5d ago
Swarm-Inspired Generation of Collective Behaviors in Graph Dynamical Systems
arXiv:2606.24958v1 Announce Type: new Abstract: Collective behavior arises when locally interacting units produce coordinated global organization, from synchronization in dynamical systems to task-relevant information flow on graphs. The central challenge is not only to explain…
9 -
arXiv — Machine Learning research 5d ago
Reliable Conformal Prediction for Ordinal Classification Using the Ranked Probability Score
arXiv:2606.24959v1 Announce Type: new Abstract: Ordinal classification (OC) arises in high-stakes domains such as medicine and finance, where uncertainty quantification must account for the severity of ordinal errors. Conformal prediction (CP) provides distribution-free…
22 -
arXiv — Machine Learning research 5d ago
Enhancing Clinician Decision-Making via Uncertainty-Aware Multi-Expert Fusion for Stroke Rehabilitation
arXiv:2606.24960v1 Announce Type: new Abstract: Tailoring stroke rehabilitation requires assessing how movements are organized, not merely if they succeed. Currently, this assessment is a rate-limiting bottleneck. Instruments like the Action Research Arm Test (ARAT) compress…
20 -
arXiv — Machine Learning research 5d ago
Towards Scalable Multi-Task Reinforcement Learning with Large Decision Models
arXiv:2606.24962v1 Announce Type: new Abstract: Recent progress in large-scale sequence modeling has shown that a single model can learn useful representations across highly diverse data distributions. Inspired by these advances, we investigate whether a unified transformer…
21 -
arXiv — Machine Learning research 5d ago
Evidence for feature-specific error correction in LLMs
arXiv:2606.24964v1 Announce Type: new Abstract: Understanding the features of large language models (LLMs) is a central goal of interpretability. LLMs are commonly assumed to use superposition to represent more features than they have dimensions. They may not only represent…
20 -
arXiv — Machine Learning research 5d ago
Learning Dynamical Systems from Multiple Sparse Datasets: A Hierarchical Bayesian Modeling Approach
arXiv:2606.24966v1 Announce Type: new Abstract: Estimating parameters of dynamical systems from sparse, noisy, and irregularly sampled data is often severely ill-conditioned. When multiple related datasets are available, they provide additional information if the shared…
30 -
arXiv — Machine Learning research 5d ago
What Do Language Priors Contribute to Darcy-Flow Inversion? A Mechanistic Audit
arXiv:2606.24967v1 Announce Type: new Abstract: In ill-posed inverse problems, the recovered solution depends as much on the prior as on the data, yet much of the engineering knowledge that could serve as that prior is recorded qualitatively rather than in formal mathematical…
24 -
arXiv — Machine Learning research 5d ago
Training Dynamics of Neural Software Defect Predictors under Coupled Data-Quality Issues
arXiv:2606.24968v1 Announce Type: new Abstract: Context: Software defect prediction supports maintenance decisions such as testing prioritization, release-risk assessment, and quality monitoring. However, metric-based SDP datasets often contain coupled data-quality issues,…
6 -
arXiv — Machine Learning research 5d ago
Frequency Domain Reservoir Computing
arXiv:2606.24969v1 Announce Type: new Abstract: While the quadratic sequence-length bottleneck of transformers has fueled a resurgence in recurrent models, effectively capturing complex dynamics requires architectures that balance efficient training with highly expressive latent…
7 -
arXiv — Machine Learning research 5d ago
Don't Go Breaking My LLM: The Impact of Pruning Attention Layers on Explanation Faithfulness and Confidence Calibration
arXiv:2606.24970v1 Announce Type: new Abstract: Pruning Large Language Models (LLMs) reduces memory and inference costs by removing parts of the network, producing smaller models that retain most of their accuracy. As attention layers are the most resource-intensive parts of…
32 -
arXiv — Machine Learning research 5d ago
Quantifying Explainable AI-introduced signal noise on ECG data with Spectral Entropy
arXiv:2606.24974v1 Announce Type: new Abstract: Explainability techniques are used to assess the output of various deep learning models. This is especially true in healthcare, where models need to be trusted and decisions justified. Explainability (XAI) tools use heuristics…
22 -
arXiv — Machine Learning research 5d ago
Why Do Accumulated Transformations Extrapolate?
arXiv:2606.24975v1 Announce Type: new Abstract: PaTH Attention showed that replacing RoPE's position-indexed rotations with accumulated data-dependent Householder reflections yields strong length extrapolation, though performance degrades at extreme context lengths. We ask…
22 -
arXiv — Machine Learning research 5d ago
Auto-Configured Explainable Graph Neural Networks for Multi-Site Pollution Prediction
arXiv:2606.24978v1 Announce Type: new Abstract: Accurate particulate matter (PM) prediction is crucial for mitigating air pollution. Graph Neural Networks (GNNs) effectively model spatiotemporal dependencies, but predefined graphs limit adaptability, and some datasets complicate…
25 -
arXiv — Machine Learning research 5d ago
CKM-Driven Communication-Aware UAV Intelligent Trajectory Optimization for Urban Inspection
arXiv:2606.24979v1 Announce Type: new Abstract: Unmanned aerial vehicles (UAVs) are increasingly employed in urban inspection tasks, where reliable communication is critical but challenging due to the severe spatial channel heterogeneity. To address the issue, in this paper, we…
19 -
arXiv — Machine Learning research 5d ago
Closed-Loop Graph Algorithm Execution with Small Language Models: Step Accuracy and Rollout Reliability
arXiv:2606.24980v1 Announce Type: new Abstract: Small language models offer an efficient alternative to large-scale systems, but their ability to execute structured algorithms over multiple dependent decisions remains poorly understood. We study graph algorithm execution as a…
24 -
-
arXiv — Machine Learning research 5d ago
Latent Block-Diffusion Temporal Point Processes: A Semi-Autoregressive Framework for Asynchronous Event Sequence Generation
arXiv:2606.24982v1 Announce Type: new Abstract: Modeling and sampling from the underlying distribution of asynchronous event sequences are crucial in various real-world applications, including social networks, medical diagnosis, and financial transactions. Existing…
35 -
arXiv — Machine Learning research 5d ago
Learning Diachronic Representations of Ancient Greek Letterforms
arXiv:2606.24984v1 Announce Type: new Abstract: Learning representations that remain robust across centuries of variation in handwriting is a key challenge in diachronic representation learning. Taking one of the longest continuously used writing systems, ancient Greek, as a…
27 -
arXiv — Machine Learning research 5d ago
Retrieval-Augmented Personalization with Foundation Models for Wearable Stress Detection
arXiv:2606.24985v1 Announce Type: new Abstract: Personalization in wearable-based stress detection remains challenging due to substantial inter-individual variability in physiological and behavioral responses. While traditional approaches rely on user-specific fine-tuning or…
5 -
-
-
arXiv — Machine Learning research 5d ago
Uncertainty-aware reinforcement learning for chemical language models
arXiv:2606.24990v1 Announce Type: new Abstract: Reinforcement Learning (RL) has become a powerful paradigm for de novo molecular design, enabling Chemical Language Models (CLMs) to navigate and explore the chemical space while optimizing specific desired properties. However, the…
26 -
arXiv — Machine Learning research 5d ago
The Geometry of Sequential Learning: Lie-Bracket Prediction of Transfer Order
arXiv:2606.24993v1 Announce Type: new Abstract: Sequential learning is order-dependent: from Pile-style next-token domain adaptation to instruction-SFT and DPO, N candidate sources induce N! possible curricula. We show that the local order effect is governed by a computable…
7 -
arXiv — Machine Learning research 5d ago
ExTra: Exploratory Trajectory Optimization for Language Model Reinforcement Learning
arXiv:2606.24994v1 Announce Type: new Abstract: Reinforcement Learning with Verifiable Rewards (RLVR) for language-model reasoning can fail at both extremes of task difficulty: easy prompts often produce all-correct, low-diversity rollout groups with little gradient signal,…
25 -
arXiv — Machine Learning research 5d ago
Are Tabular Foundation Models Robust to Realistic Query Distribution Shifts in Microbiome Data?
arXiv:2606.24995v1 Announce Type: new Abstract: Tabular foundation models (TFMs) achieve strong performance on microbiome abundance data, yet their robustness under realistic distribution shift remains poorly characterized. We introduce a benchmark that evaluates the robustness…
22 -
arXiv — Machine Learning research 5d ago
From Forecasting Leaderboards to Deployment Decisions: A Fail-Closed Certification Protocol
arXiv:2606.24996v1 Announce Type: new Abstract: Forecasting leaderboards rank models by predictive quality, but their winners are often read as deployment-ready top-1 advice. That reading can fail when forecasts are passed through a fixed decision interface, such as an alert…
23 -
arXiv — Machine Learning research 5d ago
What's in an Earth Embedding? An Explainability Analysis of Location Encoders
arXiv:2606.24997v1 Announce Type: new Abstract: Geographic implicit neural representations (INRs) learn to map any coordinate on Earth to a location embedding, implicitly encoding geospatial data into the weights of a neural network. Location embeddings are widely used off the…
15 -
arXiv — Machine Learning research 5d ago
Internal Data Repetition Destroys Language Models
arXiv:2606.24998v1 Announce Type: new Abstract: Language models are running out of high-quality training data, and even aggressively deduplicated corpora retain some amount of repetition. Earlier controlled studies predated Chinchilla-style scaling laws and could only measure…
5 -
-
arXiv — Machine Learning research 5d ago
Geo-Strat-RL: Learning Geological Event Reasoning from Verifiable Tasks
arXiv:2606.25000v1 Announce Type: new Abstract: To evaluate whether vision-language models can reason about geological histories, it is necessary to construct observations for which the underlying process history is known. Furthermore, reasoning over geological histories is not…
6 -
arXiv — Machine Learning research 5d ago
Erased, but Not Gone: Output Forgetting Is Not True Forgetting
arXiv:2606.25001v1 Announce Type: new Abstract: Machine unlearning (MU) is commonly judged by output forgetting, such as low forget-set accuracy or reduced logit-level membership inference. But if output-level success can coexist with retraining-inconsistent residuals in…
26 -
arXiv — Machine Learning research 5d ago
TRACER: Training-Free Closed-Loop Structured Inference for Traffic Accident Reconstruction
arXiv:2606.25002v1 Announce Type: new Abstract: Traffic accident reconstruction is a forensic inverse problem that requires recovering physically consistent motion from sparse and heterogeneous evidence. Existing learning-based approaches predominantly optimize for semantic…
23 -
arXiv — Machine Learning research 5d ago
Adaptive Joint Compression and Synchronisation in Federated Split Learning for IoT Rainfall Prediction
arXiv:2606.25003v1 Announce Type: new Abstract: Federated split learning (FSL) enables collaborative training across bandwidth-constrained IoT devices, but repeated activation and gradient exchange creates a communication bot-tleneck. Prior work optimises either activation…
6 -
arXiv — Machine Learning research 5d ago
Certification of Machine Learning Models via Directional Sharpness
arXiv:2606.25004v1 Announce Type: new Abstract: In machine learning, model certification has been identified as an important method for gaining assurance about a model's trustworthiness and quality. A model's quality is largely determined by its ability to generalize, i.e., to…
13 -
arXiv — Machine Learning research 5d ago
Scalable Peptide Design via Memory-Efficient Equivariant Transformer
arXiv:2606.25006v1 Announce Type: new Abstract: Target-specific peptide design requires sequence and structure co-design under full atom geometric constraints. Latent generative frameworks offer an effective route for this problem by compressing fine grained atomic structures…
13 -
arXiv — Machine Learning research 5d ago
Multi-Stream Temporal Fusion for Financial Fraud Detection
arXiv:2606.25007v1 Announce Type: new Abstract: Financial fraud detection in digital banking requires reasoning over multiple heterogeneous event streams -- transactions, login sessions, risk signals -- that individually appear benign but collectively reveal fraudulent patterns.…
14