arXiv — Machine Learning
500 articles archived · Visit source ↗ · RSS
-
arXiv — Machine Learning research 4d ago
Physics-guided Convolutional Neural Network for Domain Growth Prediction in Systems with Conserved Kinetics
arXiv:2606.26128v1 Announce Type: new Abstract: The spatiotemporal evolution of many physical, chemical, and biological systems is described by nonlinear partial differential equations (PDEs). Recently, deep neural network-based surrogate models have gained increasing interest…
16 -
-
-
arXiv — Machine Learning research 4d ago
Neural Architecture Search for Generative Adversarial Networks: A Comprehensive Review and Critical Analysis
arXiv:2606.26169v1 Announce Type: new Abstract: Neural Architecture Search (NAS) has emerged as a pivotal technique in optimizing the design of Generative Adversarial Networks (GANs), automating the search for effective architectures while addressing the challenges inherent in…
15 -
arXiv — Machine Learning research 4d ago
KG-TRACE: A Neuro-Symbolic Framework for Mechanistic Grounding in Antimicrobial Resistance Prediction
arXiv:2606.26179v1 Announce Type: new Abstract: While WGS-based AMR prediction has reached high accuracy, existing models lack a mechanism to ground neural attributions in established biological pathways. We present KG-TRACE, a novel neuro-symbolic framework that integrates the…
36 -
arXiv — Machine Learning research 4d ago
Necessary but Not Sufficient: Temperature Control and Reproducibility in LLM-as-Judge Safety Evaluations
arXiv:2606.26185v1 Announce Type: new Abstract: LLM-as-judge ("grader") components are now standard in evaluation harnesses, including safety evaluations where a pass/fail verdict may gate downstream deployment decisions. A widespread assumption is that setting the grader's…
4 -
arXiv — Machine Learning research 4d ago
Clue-Guided Money Laundering Group Discovery
arXiv:2606.26189v1 Announce Type: new Abstract: Money Laundering Group Discovery (MLGD) aims to identify hidden criminal groups and recover their complete structures in large-scale financial networks. Existing graph anomaly detection methods mainly produce node-level risk…
17 -
arXiv — Machine Learning research 4d ago
Federated Hash Projected Latent Factor Learning
arXiv:2606.26192v1 Announce Type: new Abstract: Hash Learning (HL) is an efficient representation learning approach that maps real-valued data into compact binary representations. Traditional HL methods typically require users to upload personal data to a central server, which…
13 -
arXiv — Machine Learning research 4d ago
Statistical and Structural Approaches to Algorithmic Fairness
arXiv:2606.26200v1 Announce Type: new Abstract: Modern machine learning systems have outgrown their origins as isolated predictive constructs, evolving into complex socio-technical architectures that actively mediate human opportunity. As algorithms increasingly determine access…
29 -
arXiv — Machine Learning research 4d ago
Topology-Informed Neural Networks for Flood Detection in Optical and Synthetic Aperture Radar Imagery
arXiv:2606.26204v1 Announce Type: new Abstract: Floods frequently impact regions around the world. Rapid and accurate flood detection is crucial for emergency response and timely mitigation of human and economic loss. The expanding availability of satellite data and advances in…
16 -
arXiv — Machine Learning research 4d ago
A General Framework for Learning Algebraic Properties from Cayley Graphs using Graph Neural Networks
arXiv:2606.26212v1 Announce Type: new Abstract: A Graph Neural Network (GNN) framework for predicting the solvability of finite groups from their Cayley graph representations was introduced in [1]. In the present work, we generalize this approach and develop a…
18 -
arXiv — Machine Learning research 4d ago
Fast LeWorldModel
arXiv:2606.26217v1 Announce Type: new Abstract: Joint-Embedding Predictive Architectures (JEPAs), including recent LeWorldModel (LeWM), have become a promising foundation for reconstruction-free visual world models. For visual planning, however, LeWM evaluates candidate action…
32 -
arXiv — Machine Learning research 4d ago
Dataset Usage Inference without Shadow Models or Held-out Data
arXiv:2606.26257v1 Announce Type: new Abstract: How much of my data was used to train a machine learning model? Dataset Usage Inference (DUI) aims to answer this by estimating what fraction of a dataset contributed to a model's training. However, existing DUI methods rely on…
27 -
arXiv — Machine Learning research 4d ago
Equivariance and Augmentation for Bayesian Neural Networks
arXiv:2606.26273v1 Announce Type: new Abstract: Symmetries are important for many deep learning tasks, ranging from applications in the sciences to medical imaging. However, there is an ongoing debate about whether to impose symmetry constraints on the neural network…
33 -
arXiv — Machine Learning research 4d ago
SSM Adapters via Hankel Reduced-order Modeling: Injection Site Determines Task Suitability in Long-Context Fine-Tuning
arXiv:2606.26290v1 Announce Type: new Abstract: While parameter-efficient fine-tuning (PEFT) typically targets attention projectors, its efficacy for tasks requiring sequential state accumulation remains under-explored. We examine if PEFT for such tasks can benefit from state…
18 -
arXiv — Machine Learning research 4d ago
The Red Queen G\"odel Machine: Co-Evolving Agents and Their Evaluators
arXiv:2606.26294v1 Announce Type: new Abstract: Self-improving agents are state-of-the-art (SOTA) on agentic coding benchmarks and have recently been extended to general domains. However, their search methods generally assume a stationary evaluation criterion: a fixed verifier,…
25 -
arXiv — Machine Learning research 4d ago
High-Probability PL-SGD with Markovian Noise: Optimal Mixing and Tail Dependence
arXiv:2606.26316v1 Announce Type: new Abstract: We study first-order methods for smooth objectives satisfying the Polyak-\L{}ojasiewicz (PL) condition when gradient samples are generated by an exogenous Markov chain. In the light-tailed setting, prior uniform-in-time…
7 -
arXiv — Machine Learning research 4d ago
EVOM: Agentic Meta-Evolution of Actor-Critic Architectures for Reinforcement Learning
arXiv:2606.26327v1 Announce Type: new Abstract: In actor-critic reinforcement learning, network architectures are typically manually designed. Automating this design is challenging because each candidate must be trained before evaluation, and the design space is open-ended. To…
29 -
arXiv — Machine Learning research 4d ago
Mesh-RL: Coupled subgrid reinforcement learning
arXiv:2606.26333v1 Announce Type: new Abstract: Reinforcement learning in large or sparse-reward environments suffers from slow temporal-difference reward propagation, as value information spreads only locally across the state space. We propose Mesh-RL, a spatial…
27 -
arXiv — Machine Learning research 4d ago
EMA-FS: Accelerating GBDT Training via Gain-Informed Feature Screening
arXiv:2606.26337v1 Announce Type: new Abstract: Gradient Boosted Decision Trees (GBDT), exemplified by LightGBM, spend a dominant fraction of training time -- typically 65-70% -- constructing per-feature histograms. Existing approaches such as random feature subsampling…
23 -
arXiv — Machine Learning research 4d ago
Does Aurora Encode Atmospheric Structure? Latent Regime Analysis and Attribution
arXiv:2606.26361v1 Announce Type: new Abstract: ML foundation models are able to emulate atmospheric dynamics accurately and efficiently but operate as opaque ``black boxes''. We investigate the internal representations of the Aurora model using spatially pooled PCA and…
35 -
arXiv — Machine Learning research 4d ago
SOLAR: AI-Powered Speed-of-Light Performance Analysis
arXiv:2606.26383v1 Announce Type: new Abstract: How fast could a deep-learning model run on target hardware, and how far is today's implementation from that limit? These questions are central to software, hardware, and algorithm optimizations. Speed-of-Light (SOL) analysis…
11 -
arXiv — Machine Learning research 4d ago
At the Edge of Understanding: Sparse Autoencoders Trace The Limits of Transformer Generalization
arXiv:2606.26396v1 Announce Type: new Abstract: Pre-trained transformers have demonstrated remarkable generalization abilities, at times extending beyond the scope of their training data. Yet, real-world deployments often face unexpected or adversarial data that diverges from…
34 -
arXiv — Machine Learning research 4d ago
Deterministic Pareto-Optimal Policy Synthesis for Multi-Objective Reinforcement Learning
arXiv:2606.26397v1 Announce Type: new Abstract: Real-world decision-making often requires balancing multiple conflicting objectives, a challenge that standard Reinforcement Learning (RL) frequently addresses by aggregating rewards into a single scalar signal. While effective for…
14 -
-
arXiv — Machine Learning research 4d ago
Otter Weather: Skillful and Computationally Efficient Medium-Range Weather Forecasting
arXiv:2606.26421v1 Announce Type: new Abstract: State-of-the-art medium-range AI weather models can outperform traditional Numerical Weather Prediction (NWP) but require massive training budgets. This restricts usage for under-resourced groups and severely limits fast model…
4 -
arXiv — Machine Learning research 4d ago
Rethinking Training & Inference for Forecasting: Linking Winner-Take-All back to GMMs
arXiv:2606.26424v1 Announce Type: new Abstract: Trajectory forecasting for autonomous driving has advanced rapidly, yet representative models often produce uninformative posteriors over forecast modes, causing problems for mode pruning. We trace this to a modeling-training…
37 -
arXiv — Machine Learning research 4d ago
DualEval: Joint Model-Item Calibration for Unified LLM Evaluation
arXiv:2606.26429v1 Announce Type: new Abstract: Current LLM evaluation relies on two complementary but often disconnected signals: static benchmarks with objective correctness labels and arena-style preference data that better reflect open-ended user interactions. We introduce…
24 -
arXiv — Machine Learning research 4d ago
Embedding Foundation Model Predictions in Discrete-Choice Models with Structural Guarantees
arXiv:2606.26432v1 Announce Type: new Abstract: Tabular foundation models achieve strong accuracy on choice prediction tasks, but their predictions often violate the economic logic those tasks require: raising a price can increase predicted demand, implied willingness-to-pay…
36 -
arXiv — Machine Learning research 4d ago
Optimizing CUDA like a Human: Micro-Profiling Tools as Expert Surrogates for LLM-Based GPU Kernel Optimization
arXiv:2606.26453v1 Announce Type: new Abstract: We present KernelPro, a closed-loop multi-agent system that automatically generates, profiles, and iteratively optimizes GPU kernel code by integrating large language model (LLM) code generation with hardware profiler feedback and…
21 -
arXiv — Machine Learning research 4d ago
Finding the Time to Think: Learning Planning Budgets in Real-Time RL
arXiv:2606.26463v1 Announce Type: new Abstract: Deliberating takes time. In real-time settings, that time is not free. Standard reinforcement learning (RL) sidesteps this as the environment waits indefinitely for the agent's decision. Instead, we study real-time RL environments…
38 -
arXiv — Machine Learning research 4d ago
A Causal Foundation Model for Structure and Outcome Prediction
arXiv:2606.26467v1 Announce Type: new Abstract: We introduce TabPFN-CFM, a causal foundation model that can handle multiple causal problems. TabPFN-CFM predicts both causal structure and outcomes from observational data, supports queries on all three levels of Pearl's Causal…
9 -
arXiv — Machine Learning research 4d ago
Epiphany-Aware KV Cache Eviction Without the Attention Matrix
arXiv:2606.26472v1 Announce Type: new Abstract: As reasoning models emit chains of thought tens of thousands of tokens long, KV cache increasingly becomes a deployment bottleneck. Existing cache eviction methods rank tokens by attention weight, which is a noisy importance proxy…
21 -
arXiv — Machine Learning research 4d ago
When Does Quality-Aware Multimodal Fusion Matter? A Leakage-Safe Diagnostic for Decision-Level Dependence
arXiv:2606.26473v1 Announce Type: new Abstract: Many multimodal systems estimate the reliability of each modality and weight their contributions to the final prediction. However, it remains unclear whether these scores influence model decisions or merely correlate with…
20 -
arXiv — Machine Learning research 4d ago
Localizing RL-Induced Tool Use to a Single Crosscoder Feature
arXiv:2606.26474v1 Announce Type: new Abstract: Fine-tuning through RL reshapes the internal representations of language models to enable agentic behaviors such as tool use, yet the mechanistic basis of these changes remains poorly understood. While RL substantially improves…
4 -
-
arXiv — Machine Learning research 4d ago
What Survives When You Compress a Recursive Reasoner for the Edge?
arXiv:2606.26488v1 Announce Type: new Abstract: Recursive reasoning models can solve complex structured tasks with only a few million parameters by repeatedly updating a latent state. Deploying these models on edge hardware requires significant compression, but unlike…
30 -
arXiv — Machine Learning research 4d ago
Learning Probabilistic Filters with Strictly Proper Scoring Rules
arXiv:2606.26497v1 Announce Type: new Abstract: Bayesian filtering of partially and noisily observed dynamical systems seeks to infer the evolving conditional distribution of the state of a dynamical system, given observations, in an online fashion. This Bayesian filtering…
7 -
arXiv — Machine Learning research 4d ago
Multipath Adaptive Gated Bottleneck Latent ODE with Raman Data Fusion for Cell Culture Process Forecasting
arXiv:2606.26520v1 Announce Type: new Abstract: Mammalian cell-culture processes underpin the manufacture of many biopharmaceuticals, yet keeping a run on track is hard: critical process parameters drift over days, and an off-specification trend is often confirmed too late to…
4 -
arXiv — Machine Learning research 4d ago
Theory-Scale Auto-Formalization of Logics for Computer Science
arXiv:2606.26525v1 Announce Type: new Abstract: Auto-formalization is critical for scalable formal verification, but existing progress largely focuses on isolated statements, while theory-scale auto-formalization, which coherently translates hundreds of interdependent…
8 -
arXiv — Machine Learning research 4d ago
Sample-efficient Transfer Reinforcement Learning via Adaptive Reward Shaping and Policy-Ratio Reweighting Strategy
arXiv:2606.26527v1 Announce Type: new Abstract: Transfer learning improves policy learning efficiency by reusing knowledge from source tasks, providing a feasible paradigm for safe and efficient autonomous highway lane changing decision-making. Existing methods frequently…
25 -
arXiv — Machine Learning research 4d ago
CascadeFormer: Depth-Tapered Transformers Motivated by Gradient Fan-in Asymmetry
arXiv:2606.26538v1 Announce Type: new Abstract: Deep Transformers are composed of uniformly stacked residual blocks, yet their deepest layers often add little value. We present two efficiency methods that exploit this asymmetry. CascadeFormer tapers width with depth to match the…
31 -
-
arXiv — Machine Learning research 4d ago
Revisiting Action Factorization for Complex Action Spaces
arXiv:2606.26574v1 Announce Type: new Abstract: Many real-world control problems involve hybrid discrete-continuous action spaces. For example, steering and signaling in autonomous driving, and aiming and firing in robotics or video-games. Despite real-world hybrid factorization…
10 -
arXiv — Machine Learning research 4d ago
SharQ: Bridging Activation Sparsity and FP4 Quantization for LLM Inference
arXiv:2606.26587v1 Announce Type: new Abstract: Low-bit floating-point formats and semi-structured sparsity are increasingly supported by modern accelerators, yet combining them for LLM activation compression remains challenging: activations contain input-dependent outliers that…
29 -
-
arXiv — Machine Learning research 4d ago
Sketched Linear Contrastive Learning: Approximation, Optimization, and Statistical Scaling
arXiv:2606.26617v1 Announce Type: new Abstract: Scaling laws describe how learning performance varies with model size, data size, and compute. While recent theoretical work has established scaling laws for sketched linear regression, much less is understood for contrastive…
25 -
arXiv — Machine Learning research 4d ago
Discovering Millions of Interpretable Features with Sparse Autoencoders
arXiv:2606.26620v1 Announce Type: new Abstract: Sparse autoencoders (SAEs) have emerged as a powerful tool for decomposing superposed language model representations into sparse and interpretable features. However, training SAEs is computationally expensive, and available…
5 -
arXiv — Machine Learning research 4d ago
From Weights to Features: SAE-Guided Activation Regularization for LLM Continual Learning
arXiv:2606.26629v1 Announce Type: new Abstract: Weight-space regularization methods such as Elastic Weight Consolidation (EWC) are the standard approach to catastrophic forgetting in continual learning. However, those methods tend to underperform when applied to large language…
15 -
arXiv — Machine Learning research 4d ago
Target-Aware Bandit Allocation for Scalable Surrogate Optimization in Chemical Space
arXiv:2606.26657v1 Announce Type: new Abstract: Identifying high-utility candidates from massive discrete spaces under expensive evaluations is a recurring challenge across the sciences, with structure-based drug discovery as a prominent example. While surrogate-based…
20