arXiv — Machine Learning

500 articles archived · Visit source ↗ · RSS

arXiv — Machine Learning research 4d ago

Physics-guided Convolutional Neural Network for Domain Growth Prediction in Systems with Conserved Kinetics

arXiv:2606.26128v1 Announce Type: new Abstract: The spatiotemporal evolution of many physical, chemical, and biological systems is described by nonlinear partial differential equations (PDEs). Recently, deep neural network-based surrogate models have gained increasing interest…

16
arXiv — Machine Learning research 4d ago

\chisao{}: A GPU-Native Parallel Optimizer for Multimodal Black-Box Functions via Convergence-Anticonvergence Oscillation

arXiv:2606.26164v1 Announce Type: new Abstract: Finding all modes of a multimodal black-box function is a fundamental challenge in optimization, Bayesian inference, and scientific computing. Existing approaches -- basin-hopping, CMA-ES, multistart gradient descent -- operate…

26
arXiv — Machine Learning research 4d ago

Implementation of reinforcement learning in chemical reaction networks: application to phototaxis as curiosity-driven exploration

arXiv:2606.26168v1 Announce Type: new Abstract: Living systems navigate environments using noisy and incomplete sensory signals. In unicellular algae, phototaxis is often modeled as a mechanistic run--tumble process driven by stimulus--response rules. However, such descriptions…

30
arXiv — Machine Learning research 4d ago

Neural Architecture Search for Generative Adversarial Networks: A Comprehensive Review and Critical Analysis

arXiv:2606.26169v1 Announce Type: new Abstract: Neural Architecture Search (NAS) has emerged as a pivotal technique in optimizing the design of Generative Adversarial Networks (GANs), automating the search for effective architectures while addressing the challenges inherent in…

15
arXiv — Machine Learning research 4d ago

KG-TRACE: A Neuro-Symbolic Framework for Mechanistic Grounding in Antimicrobial Resistance Prediction

arXiv:2606.26179v1 Announce Type: new Abstract: While WGS-based AMR prediction has reached high accuracy, existing models lack a mechanism to ground neural attributions in established biological pathways. We present KG-TRACE, a novel neuro-symbolic framework that integrates the…

36
arXiv — Machine Learning research 4d ago

Necessary but Not Sufficient: Temperature Control and Reproducibility in LLM-as-Judge Safety Evaluations

arXiv:2606.26185v1 Announce Type: new Abstract: LLM-as-judge ("grader") components are now standard in evaluation harnesses, including safety evaluations where a pass/fail verdict may gate downstream deployment decisions. A widespread assumption is that setting the grader's…

4
arXiv — Machine Learning research 4d ago

Clue-Guided Money Laundering Group Discovery

arXiv:2606.26189v1 Announce Type: new Abstract: Money Laundering Group Discovery (MLGD) aims to identify hidden criminal groups and recover their complete structures in large-scale financial networks. Existing graph anomaly detection methods mainly produce node-level risk…

17
arXiv — Machine Learning research 4d ago

Federated Hash Projected Latent Factor Learning

arXiv:2606.26192v1 Announce Type: new Abstract: Hash Learning (HL) is an efficient representation learning approach that maps real-valued data into compact binary representations. Traditional HL methods typically require users to upload personal data to a central server, which…

13
arXiv — Machine Learning research 4d ago

Statistical and Structural Approaches to Algorithmic Fairness

arXiv:2606.26200v1 Announce Type: new Abstract: Modern machine learning systems have outgrown their origins as isolated predictive constructs, evolving into complex socio-technical architectures that actively mediate human opportunity. As algorithms increasingly determine access…

29
arXiv — Machine Learning research 4d ago

Topology-Informed Neural Networks for Flood Detection in Optical and Synthetic Aperture Radar Imagery

arXiv:2606.26204v1 Announce Type: new Abstract: Floods frequently impact regions around the world. Rapid and accurate flood detection is crucial for emergency response and timely mitigation of human and economic loss. The expanding availability of satellite data and advances in…

16
arXiv — Machine Learning research 4d ago

A General Framework for Learning Algebraic Properties from Cayley Graphs using Graph Neural Networks

arXiv:2606.26212v1 Announce Type: new Abstract: A Graph Neural Network (GNN) framework for predicting the solvability of finite groups from their Cayley graph representations was introduced in [1]. In the present work, we generalize this approach and develop a…

18
arXiv — Machine Learning research 4d ago

Fast LeWorldModel

arXiv:2606.26217v1 Announce Type: new Abstract: Joint-Embedding Predictive Architectures (JEPAs), including recent LeWorldModel (LeWM), have become a promising foundation for reconstruction-free visual world models. For visual planning, however, LeWM evaluates candidate action…

32
arXiv — Machine Learning research 4d ago

Dataset Usage Inference without Shadow Models or Held-out Data

arXiv:2606.26257v1 Announce Type: new Abstract: How much of my data was used to train a machine learning model? Dataset Usage Inference (DUI) aims to answer this by estimating what fraction of a dataset contributed to a model's training. However, existing DUI methods rely on…

27
arXiv — Machine Learning research 4d ago

Equivariance and Augmentation for Bayesian Neural Networks

arXiv:2606.26273v1 Announce Type: new Abstract: Symmetries are important for many deep learning tasks, ranging from applications in the sciences to medical imaging. However, there is an ongoing debate about whether to impose symmetry constraints on the neural network…

33
arXiv — Machine Learning research 4d ago

SSM Adapters via Hankel Reduced-order Modeling: Injection Site Determines Task Suitability in Long-Context Fine-Tuning

arXiv:2606.26290v1 Announce Type: new Abstract: While parameter-efficient fine-tuning (PEFT) typically targets attention projectors, its efficacy for tasks requiring sequential state accumulation remains under-explored. We examine if PEFT for such tasks can benefit from state…

18
arXiv — Machine Learning research 4d ago

The Red Queen G\"odel Machine: Co-Evolving Agents and Their Evaluators

arXiv:2606.26294v1 Announce Type: new Abstract: Self-improving agents are state-of-the-art (SOTA) on agentic coding benchmarks and have recently been extended to general domains. However, their search methods generally assume a stationary evaluation criterion: a fixed verifier,…

25
arXiv — Machine Learning research 4d ago

High-Probability PL-SGD with Markovian Noise: Optimal Mixing and Tail Dependence

arXiv:2606.26316v1 Announce Type: new Abstract: We study first-order methods for smooth objectives satisfying the Polyak-\L{}ojasiewicz (PL) condition when gradient samples are generated by an exogenous Markov chain. In the light-tailed setting, prior uniform-in-time…

7
arXiv — Machine Learning research 4d ago

EVOM: Agentic Meta-Evolution of Actor-Critic Architectures for Reinforcement Learning

arXiv:2606.26327v1 Announce Type: new Abstract: In actor-critic reinforcement learning, network architectures are typically manually designed. Automating this design is challenging because each candidate must be trained before evaluation, and the design space is open-ended. To…

29
arXiv — Machine Learning research 4d ago

Mesh-RL: Coupled subgrid reinforcement learning

arXiv:2606.26333v1 Announce Type: new Abstract: Reinforcement learning in large or sparse-reward environments suffers from slow temporal-difference reward propagation, as value information spreads only locally across the state space. We propose Mesh-RL, a spatial…

27
arXiv — Machine Learning research 4d ago

EMA-FS: Accelerating GBDT Training via Gain-Informed Feature Screening

arXiv:2606.26337v1 Announce Type: new Abstract: Gradient Boosted Decision Trees (GBDT), exemplified by LightGBM, spend a dominant fraction of training time -- typically 65-70% -- constructing per-feature histograms. Existing approaches such as random feature subsampling…

23
arXiv — Machine Learning research 4d ago

Does Aurora Encode Atmospheric Structure? Latent Regime Analysis and Attribution

arXiv:2606.26361v1 Announce Type: new Abstract: ML foundation models are able to emulate atmospheric dynamics accurately and efficiently but operate as opaque ``black boxes''. We investigate the internal representations of the Aurora model using spatially pooled PCA and…

35
arXiv — Machine Learning research 4d ago

SOLAR: AI-Powered Speed-of-Light Performance Analysis

arXiv:2606.26383v1 Announce Type: new Abstract: How fast could a deep-learning model run on target hardware, and how far is today's implementation from that limit? These questions are central to software, hardware, and algorithm optimizations. Speed-of-Light (SOL) analysis…

11
arXiv — Machine Learning research 4d ago

At the Edge of Understanding: Sparse Autoencoders Trace The Limits of Transformer Generalization

arXiv:2606.26396v1 Announce Type: new Abstract: Pre-trained transformers have demonstrated remarkable generalization abilities, at times extending beyond the scope of their training data. Yet, real-world deployments often face unexpected or adversarial data that diverges from…

34
arXiv — Machine Learning research 4d ago

Deterministic Pareto-Optimal Policy Synthesis for Multi-Objective Reinforcement Learning

arXiv:2606.26397v1 Announce Type: new Abstract: Real-world decision-making often requires balancing multiple conflicting objectives, a challenge that standard Reinforcement Learning (RL) frequently addresses by aggregating rewards into a single scalar signal. While effective for…

14
arXiv — Machine Learning research 4d ago

Beyond Feedforward Networks: Reentry Neural Systems as the Fundamental Basis of Subjecthood and Intrinsic Safety of Next-Generation AGI

arXiv:2606.26406v1 Announce Type: new Abstract: We propose a complete architectural blueprint for safe artificial general intelligence based on a closed reentry loop (D I cycle). In contrast to feedforward networks, which are directed acyclic graphs (C=0, S=0) incapable of…

37
arXiv — Machine Learning research 4d ago

Otter Weather: Skillful and Computationally Efficient Medium-Range Weather Forecasting

arXiv:2606.26421v1 Announce Type: new Abstract: State-of-the-art medium-range AI weather models can outperform traditional Numerical Weather Prediction (NWP) but require massive training budgets. This restricts usage for under-resourced groups and severely limits fast model…

4
arXiv — Machine Learning research 4d ago

Rethinking Training & Inference for Forecasting: Linking Winner-Take-All back to GMMs

arXiv:2606.26424v1 Announce Type: new Abstract: Trajectory forecasting for autonomous driving has advanced rapidly, yet representative models often produce uninformative posteriors over forecast modes, causing problems for mode pruning. We trace this to a modeling-training…

37
arXiv — Machine Learning research 4d ago

DualEval: Joint Model-Item Calibration for Unified LLM Evaluation

arXiv:2606.26429v1 Announce Type: new Abstract: Current LLM evaluation relies on two complementary but often disconnected signals: static benchmarks with objective correctness labels and arena-style preference data that better reflect open-ended user interactions. We introduce…

24
arXiv — Machine Learning research 4d ago

Embedding Foundation Model Predictions in Discrete-Choice Models with Structural Guarantees

arXiv:2606.26432v1 Announce Type: new Abstract: Tabular foundation models achieve strong accuracy on choice prediction tasks, but their predictions often violate the economic logic those tasks require: raising a price can increase predicted demand, implied willingness-to-pay…

36
arXiv — Machine Learning research 4d ago

Optimizing CUDA like a Human: Micro-Profiling Tools as Expert Surrogates for LLM-Based GPU Kernel Optimization

arXiv:2606.26453v1 Announce Type: new Abstract: We present KernelPro, a closed-loop multi-agent system that automatically generates, profiles, and iteratively optimizes GPU kernel code by integrating large language model (LLM) code generation with hardware profiler feedback and…

21
arXiv — Machine Learning research 4d ago

Finding the Time to Think: Learning Planning Budgets in Real-Time RL

arXiv:2606.26463v1 Announce Type: new Abstract: Deliberating takes time. In real-time settings, that time is not free. Standard reinforcement learning (RL) sidesteps this as the environment waits indefinitely for the agent's decision. Instead, we study real-time RL environments…

38
arXiv — Machine Learning research 4d ago

A Causal Foundation Model for Structure and Outcome Prediction

arXiv:2606.26467v1 Announce Type: new Abstract: We introduce TabPFN-CFM, a causal foundation model that can handle multiple causal problems. TabPFN-CFM predicts both causal structure and outcomes from observational data, supports queries on all three levels of Pearl's Causal…

9
arXiv — Machine Learning research 4d ago

Epiphany-Aware KV Cache Eviction Without the Attention Matrix

arXiv:2606.26472v1 Announce Type: new Abstract: As reasoning models emit chains of thought tens of thousands of tokens long, KV cache increasingly becomes a deployment bottleneck. Existing cache eviction methods rank tokens by attention weight, which is a noisy importance proxy…

21
arXiv — Machine Learning research 4d ago

When Does Quality-Aware Multimodal Fusion Matter? A Leakage-Safe Diagnostic for Decision-Level Dependence

arXiv:2606.26473v1 Announce Type: new Abstract: Many multimodal systems estimate the reliability of each modality and weight their contributions to the final prediction. However, it remains unclear whether these scores influence model decisions or merely correlate with…

20
arXiv — Machine Learning research 4d ago

Localizing RL-Induced Tool Use to a Single Crosscoder Feature

arXiv:2606.26474v1 Announce Type: new Abstract: Fine-tuning through RL reshapes the internal representations of language models to enable agentic behaviors such as tool use, yet the mechanistic basis of these changes remains poorly understood. While RL substantially improves…

4
arXiv — Machine Learning research 4d ago

Retrieval-Warmed Energy-Based Reasoning: A Five-Arm Ablation Methodology for Diffusion-as-Inference on Structured Reasoning Tasks

arXiv:2606.26476v1 Announce Type: new Abstract: Warm-started diffusion samplers accelerate iterative inference, but it is rarely clear which part of the pipeline carries the gain. We study \textbf{retrieval-warmed energy-based reasoning (RW-EBR)} -- an IRED energy-based…

9
arXiv — Machine Learning research 4d ago

What Survives When You Compress a Recursive Reasoner for the Edge?

arXiv:2606.26488v1 Announce Type: new Abstract: Recursive reasoning models can solve complex structured tasks with only a few million parameters by repeatedly updating a latent state. Deploying these models on edge hardware requires significant compression, but unlike…

30
arXiv — Machine Learning research 4d ago

Learning Probabilistic Filters with Strictly Proper Scoring Rules

arXiv:2606.26497v1 Announce Type: new Abstract: Bayesian filtering of partially and noisily observed dynamical systems seeks to infer the evolving conditional distribution of the state of a dynamical system, given observations, in an online fashion. This Bayesian filtering…

7
arXiv — Machine Learning research 4d ago

Multipath Adaptive Gated Bottleneck Latent ODE with Raman Data Fusion for Cell Culture Process Forecasting

arXiv:2606.26520v1 Announce Type: new Abstract: Mammalian cell-culture processes underpin the manufacture of many biopharmaceuticals, yet keeping a run on track is hard: critical process parameters drift over days, and an off-specification trend is often confirmed too late to…

4
arXiv — Machine Learning research 4d ago

Theory-Scale Auto-Formalization of Logics for Computer Science

arXiv:2606.26525v1 Announce Type: new Abstract: Auto-formalization is critical for scalable formal verification, but existing progress largely focuses on isolated statements, while theory-scale auto-formalization, which coherently translates hundreds of interdependent…

8
arXiv — Machine Learning research 4d ago

Sample-efficient Transfer Reinforcement Learning via Adaptive Reward Shaping and Policy-Ratio Reweighting Strategy

arXiv:2606.26527v1 Announce Type: new Abstract: Transfer learning improves policy learning efficiency by reusing knowledge from source tasks, providing a feasible paradigm for safe and efficient autonomous highway lane changing decision-making. Existing methods frequently…

25
arXiv — Machine Learning research 4d ago

CascadeFormer: Depth-Tapered Transformers Motivated by Gradient Fan-in Asymmetry

arXiv:2606.26538v1 Announce Type: new Abstract: Deep Transformers are composed of uniformly stacked residual blocks, yet their deepest layers often add little value. We present two efficiency methods that exploit this asymmetry. CascadeFormer tapers width with depth to match the…

31
arXiv — Machine Learning research 4d ago

Can Large Language Models Reliably Code Qualitative Humanitarian Data? A Benchmark Study Against Human Expert Adjudication

arXiv:2606.26541v1 Announce Type: new Abstract: Data from affected populations are crucial for informing humanitarian response, but their value depends on timely and consistent interpretation of nuanced accounts of need. Humanitarian organizations often lack the staff, time, and…

4
arXiv — Machine Learning research 4d ago

Revisiting Action Factorization for Complex Action Spaces

arXiv:2606.26574v1 Announce Type: new Abstract: Many real-world control problems involve hybrid discrete-continuous action spaces. For example, steering and signaling in autonomous driving, and aiming and firing in robotics or video-games. Despite real-world hybrid factorization…

10
arXiv — Machine Learning research 4d ago

SharQ: Bridging Activation Sparsity and FP4 Quantization for LLM Inference

arXiv:2606.26587v1 Announce Type: new Abstract: Low-bit floating-point formats and semi-structured sparsity are increasingly supported by modern accelerators, yet combining them for LLM activation compression remains challenging: activations contain input-dependent outliers that…

29
arXiv — Machine Learning research 4d ago

Empirical Software Engineering TerraProbe: A Layered-Oracle Framework for Detecting Deceptive Fixes in LLM-Assisted Terraform

arXiv:2606.26590v1 Announce Type: new Abstract: Security misconfigurations in Terraform Infrastructure-as-Code are a growing risk in cloud deployments, and large language models are increasingly used as automated repair agents. Existing evaluations often treat a repair as…

5
arXiv — Machine Learning research 4d ago

Sketched Linear Contrastive Learning: Approximation, Optimization, and Statistical Scaling

arXiv:2606.26617v1 Announce Type: new Abstract: Scaling laws describe how learning performance varies with model size, data size, and compute. While recent theoretical work has established scaling laws for sketched linear regression, much less is understood for contrastive…

25
arXiv — Machine Learning research 4d ago

Discovering Millions of Interpretable Features with Sparse Autoencoders

arXiv:2606.26620v1 Announce Type: new Abstract: Sparse autoencoders (SAEs) have emerged as a powerful tool for decomposing superposed language model representations into sparse and interpretable features. However, training SAEs is computationally expensive, and available…

5
arXiv — Machine Learning research 4d ago

From Weights to Features: SAE-Guided Activation Regularization for LLM Continual Learning

arXiv:2606.26629v1 Announce Type: new Abstract: Weight-space regularization methods such as Elastic Weight Consolidation (EWC) are the standard approach to catastrophic forgetting in continual learning. However, those methods tend to underperform when applied to large language…

15
arXiv — Machine Learning research 4d ago

Target-Aware Bandit Allocation for Scalable Surrogate Optimization in Chemical Space

arXiv:2606.26657v1 Announce Type: new Abstract: Identifying high-utility candidates from massive discrete spaces under expensive evaluations is a recurring challenge across the sciences, with structure-based drug discovery as a prominent example. While surrogate-based…

20

Physics-guided Convolutional Neural Network for Domain Growth Prediction in Systems with Conserved Kinetics

\chisao{}: A GPU-Native Parallel Optimizer for Multimodal Black-Box Functions via Convergence-Anticonvergence Oscillation

Implementation of reinforcement learning in chemical reaction networks: application to phototaxis as curiosity-driven exploration

Neural Architecture Search for Generative Adversarial Networks: A Comprehensive Review and Critical Analysis

KG-TRACE: A Neuro-Symbolic Framework for Mechanistic Grounding in Antimicrobial Resistance Prediction

Necessary but Not Sufficient: Temperature Control and Reproducibility in LLM-as-Judge Safety Evaluations

Clue-Guided Money Laundering Group Discovery

Federated Hash Projected Latent Factor Learning

Statistical and Structural Approaches to Algorithmic Fairness

Topology-Informed Neural Networks for Flood Detection in Optical and Synthetic Aperture Radar Imagery

A General Framework for Learning Algebraic Properties from Cayley Graphs using Graph Neural Networks

Fast LeWorldModel

Dataset Usage Inference without Shadow Models or Held-out Data

Equivariance and Augmentation for Bayesian Neural Networks

SSM Adapters via Hankel Reduced-order Modeling: Injection Site Determines Task Suitability in Long-Context Fine-Tuning

The Red Queen G\"odel Machine: Co-Evolving Agents and Their Evaluators

High-Probability PL-SGD with Markovian Noise: Optimal Mixing and Tail Dependence

EVOM: Agentic Meta-Evolution of Actor-Critic Architectures for Reinforcement Learning

Mesh-RL: Coupled subgrid reinforcement learning

EMA-FS: Accelerating GBDT Training via Gain-Informed Feature Screening

Does Aurora Encode Atmospheric Structure? Latent Regime Analysis and Attribution

SOLAR: AI-Powered Speed-of-Light Performance Analysis

At the Edge of Understanding: Sparse Autoencoders Trace The Limits of Transformer Generalization

Deterministic Pareto-Optimal Policy Synthesis for Multi-Objective Reinforcement Learning

Beyond Feedforward Networks: Reentry Neural Systems as the Fundamental Basis of Subjecthood and Intrinsic Safety of Next-Generation AGI

Otter Weather: Skillful and Computationally Efficient Medium-Range Weather Forecasting

Rethinking Training & Inference for Forecasting: Linking Winner-Take-All back to GMMs

DualEval: Joint Model-Item Calibration for Unified LLM Evaluation

Embedding Foundation Model Predictions in Discrete-Choice Models with Structural Guarantees

Optimizing CUDA like a Human: Micro-Profiling Tools as Expert Surrogates for LLM-Based GPU Kernel Optimization

Finding the Time to Think: Learning Planning Budgets in Real-Time RL

A Causal Foundation Model for Structure and Outcome Prediction

Epiphany-Aware KV Cache Eviction Without the Attention Matrix

When Does Quality-Aware Multimodal Fusion Matter? A Leakage-Safe Diagnostic for Decision-Level Dependence

Localizing RL-Induced Tool Use to a Single Crosscoder Feature

Retrieval-Warmed Energy-Based Reasoning: A Five-Arm Ablation Methodology for Diffusion-as-Inference on Structured Reasoning Tasks

What Survives When You Compress a Recursive Reasoner for the Edge?

Learning Probabilistic Filters with Strictly Proper Scoring Rules

Multipath Adaptive Gated Bottleneck Latent ODE with Raman Data Fusion for Cell Culture Process Forecasting

Theory-Scale Auto-Formalization of Logics for Computer Science

Sample-efficient Transfer Reinforcement Learning via Adaptive Reward Shaping and Policy-Ratio Reweighting Strategy

CascadeFormer: Depth-Tapered Transformers Motivated by Gradient Fan-in Asymmetry

Can Large Language Models Reliably Code Qualitative Humanitarian Data? A Benchmark Study Against Human Expert Adjudication

Revisiting Action Factorization for Complex Action Spaces

SharQ: Bridging Activation Sparsity and FP4 Quantization for LLM Inference

Empirical Software Engineering TerraProbe: A Layered-Oracle Framework for Detecting Deceptive Fixes in LLM-Assisted Terraform

Sketched Linear Contrastive Learning: Approximation, Optimization, and Statistical Scaling

Discovering Millions of Interpretable Features with Sparse Autoencoders

From Weights to Features: SAE-Guided Activation Regularization for LLM Continual Learning

Target-Aware Bandit Allocation for Scalable Surrogate Optimization in Chemical Space