arXiv — Machine Learning
500 articles archived · Visit source ↗ · RSS
-
arXiv — Machine Learning research 6d ago
A Fair Evaluation of Graph Foundation Models for Node Property Prediction
arXiv:2606.24509v1 Announce Type: new Abstract: Due to the wide use of graph-structured data in different fields of industry and science, the development of Graph Foundation Models (GFMs) has recently attracted a lot of attention. While many different types of models are called…
33 -
arXiv — Machine Learning research 6d ago
Reasoning as Attractor Dynamics: Latent Memory Retrieval via Gibbs-Weighted Energy Minimization
arXiv:2606.24543v1 Announce Type: new Abstract: Large Language Models (LLMs) are traditionally viewed as autoregressive generators. However, from the perspective of collective computation, they function as high-dimensional Dense Associative Memories that store complex reasoning…
24 -
arXiv — Machine Learning research 6d ago
QC-SMOTE: Quality-Controlled SMOTE for Imbalanced Classification
arXiv:2606.24625v1 Announce Type: new Abstract: Class imbalance poses a significant challenge in classification, where existing methods such as SMOTE often generate low-quality synthetic samples in regions with noise or class overlap. We propose QC-SMOTE, a quality-controlled…
26 -
arXiv — Machine Learning research 6d ago
FlowPipe: LLM-Enhanced Conditional Generative Flow Networks for Data Preparation Pipeline Construction
arXiv:2606.24679v1 Announce Type: new Abstract: Data preparation pipelines improve data quality in machine learning by transforming raw tables into learning-ready data through sequential cleaning and feature transformation operators. However, automatically constructing such…
25 -
arXiv — Machine Learning research 6d ago
Grad Detect: Gradient-Based Hallucination Detection in LLMs
arXiv:2606.24790v1 Announce Type: new Abstract: Large Language Models (LLMs) have demonstrated remarkable capabilities across diverse tasks, yet they remain prone to generating hallucinations. Detecting these hallucinations is critical for deploying LLMs reliably in high-stakes…
9 -
arXiv — Machine Learning research 6d ago
Real vs. Complex Spectral Bases for Neural Operators: The Role of Green's Function Alignment
arXiv:2606.24851v1 Announce Type: new Abstract: Fourier Neural Operators (FNO) learn solution operators of partial differential equations by parameterizing global convolutions in the complex Fourier domain. For real-valued PDE solutions, the complex FFT carries representational…
20 -
arXiv — Machine Learning research 6d ago
EnerInfer: Energy-Aware On-Device LLM Inference
arXiv:2606.23001v1 Announce Type: cross Abstract: On-device LLM inference is increasingly attractive for privacy-preserving, reliable, and cost-effective deployment, yet its energy and thermal costs remain a critical bottleneck. Existing systems primarily optimize for decoding…
13 -
arXiv — Machine Learning research 6d ago
Self-Recognition Finetuning can Prevent and Reverse Emergent Misalignment
arXiv:2606.23700v1 Announce Type: cross Abstract: Emergent misalignment (EM) has been linked to the activation of misaligned persona vectors and evil character traits, suggesting that EM operates through disruption of the model's aligned character rather than direct learning of…
8 -
-
arXiv — Machine Learning research 6d ago
Zero-Shot Neural Priors for Generalizable Cross-Subject and Cross-Task EEG Decoding
arXiv:2606.23706v1 Announce Type: cross Abstract: The development of generalizable electroencephalography (EEG) decoding models is essential for robust brain-computer interfaces (BCI) and objective neural biomarkers in mental health. Conventional approaches have been hindered by…
29 -
arXiv — Machine Learning research 6d ago
Coordinate-Queryable Neural Field Reconstruction for EEG Spatial Super-Resolution with Unseen-Electrode Generation
arXiv:2606.23707v1 Announce Type: cross Abstract: EEG spatial super-resolution (EEGSR) in real deployments is challenged by random channel missingness, unstable electrode quality, and changing visible-channel patterns caused by bad contacts or device variability. Most existing…
36 -
arXiv — Machine Learning research 6d ago
WiFi-Based People Counting Using Beam-Steerable Antennas: A Test-bed Study
arXiv:2606.23710v1 Announce Type: cross Abstract: Ubiquitous perception through RF signals is a pivotal opportunity for future technology: it enables personalized services such as smart living, remote healthcare, automated logistics or interaction through free-space gestures.…
37 -
arXiv — Machine Learning research 6d ago
Dimensionality Reduction of QAOA Parameter Space with Kernel PCA for Max-Cut
arXiv:2606.23718v1 Announce Type: cross Abstract: The Quantum Approximate Optimization Algorithm (QAOA) is a leading variational algorithm for combinatorial optimization on near term quantum devices. As circuit depth increases, the number of optimization parameters grows, making…
35 -
arXiv — Machine Learning research 6d ago
A Hybrid Quantum-Classical Approach for Melt Pool Prediction in Laser Powder Bed Fusion
arXiv:2606.23719v1 Announce Type: cross Abstract: Laser powder bed fusion (LPBF) is a promising additive manufacturing technique that suffers from quality assurance concerns. Predicting melt pools from process parameters is crucial for assessing quality prior to manufacturing…
29 -
arXiv — Machine Learning research 6d ago
Computational references are not experiments: pre-registered validation of machine-learned sodium-cathode voltages
arXiv:2606.23725v1 Announce Type: cross Abstract: Machine-learning screens for battery materials are trained and judged almost entirely against computed reference voltages, and those references carry their own systematic errors. We report a case in which this matters…
5 -
arXiv — Machine Learning research 6d ago
Sol Video Inference Engine: Agent-Native Full-Stack Acceleration Framework for Efficient Video Generation
arXiv:2606.23743v1 Announce Type: cross Abstract: Modern video diffusion models achieve higher generation quality through scaling, but this also increases inference cost. Although many acceleration methods have been proposed, a central challenge is that the most effective…
30 -
arXiv — Machine Learning research 6d ago
JEDEL: Zero-Shot DNA-Encoded Library Design for Early-Stage Drug Discovery
arXiv:2606.23745v1 Announce Type: cross Abstract: We present JEDEL, a framework for generating synthesis-ready DNA-encoded libraries (DELs) directly from three-dimensional pharmacophore representations of active ligands. JEDEL is the first model to map pharmacophore interaction…
5 -
arXiv — Machine Learning research 6d ago
Verifiable Foundation Models for Robot Safety
arXiv:2606.23754v1 Announce Type: cross Abstract: Deploying foundation models for robot control raises a central challenge: the expressive power that enables rich, multimodal perception also makes these models opaque and difficult to analyze formally, rendering them intractable…
4 -
-
arXiv — Machine Learning research 6d ago
Hessian-augmented Supervised Learning for Hamilton-Jacobi-Bellman PDEs
arXiv:2606.23827v1 Announce Type: cross Abstract: A data-driven method is developed for approximating value functions in deterministic optimal control problems with nonlinear control-affine dynamics. The Pontryagin Maximum Principle optimality system is solved from multiple…
36 -
arXiv — Machine Learning research 6d ago
Do LLM Attribution Metrics Transfer? Auditing Retrieval-Augmented Generation Evaluation Across Datasets and Constructs
arXiv:2606.23915v1 Announce Type: cross Abstract: Practice often treats automatic metrics for attribution in LLM retrieval-augmented generation as interchangeable. We audit eight automatic scorers -- lexical, embedding, and BERTScore baselines alongside…
28 -
arXiv — Machine Learning research 6d ago
Flow-Corrected Thompson Sampling for Non-Stationary Contextual Bandits
arXiv:2606.23933v1 Announce Type: cross Abstract: We study non-stationary linear contextual bandits where the reward model drifts over time, rendering classical contextual bandit algorithms brittle because historical data becomes systematically biased. We propose Flow-Corrected…
27 -
arXiv — Machine Learning research 6d ago
When Retrieval Metrics Mislead: Measuring Policy Signal in Long-Horizon Tool-Use Agents
arXiv:2606.23937v1 Announce Type: cross Abstract: Exact-match retrieval recall is often used as a proxy for whether a retriever supplies useful policy context to a downstream decision model. We test this proxy for pre-action policy classification in tau-bench using Qwen2.5-3B/7B…
11 -
arXiv — Machine Learning research 6d ago
Constrained Variable Projection for Structured Problems
arXiv:2606.23939v1 Announce Type: cross Abstract: Variable projection is a classical technique for separable nonlinear least-squares problems, in which variables that enter linearly are eliminated exactly, yielding a reduced nonlinear problem. By expressing this framework as a…
31 -
arXiv — Machine Learning research 6d ago
Prediction of Viscoelastic Droplet Impact Dynamics Using a Vision Transformer-Based Approach
arXiv:2606.23940v1 Announce Type: cross Abstract: Droplet impact on solid surfaces is a complex fluid dynamics problem with applications in spray cooling, inkjet printing, and pharmaceutical processing. Although numerical simulations are widely used to investigate these…
24 -
arXiv — Machine Learning research 6d ago
Stochastic Expectation Maximization for Robust State-Space Radio Interferometric Imaging
arXiv:2606.23944v1 Announce Type: cross Abstract: State--space models provide a flexible framework for analyzing dynamical systems, yet they often rely on Gaussian assumptions that fail to capture heavy-tailed or outlier-prone measurement noise. We propose a robust estimation…
16 -
arXiv — Machine Learning research 6d ago
Does My Embedding Reflect That $A = B$? Evaluating Mathematical Equivalence in Embedding Models
arXiv:2606.23959v1 Announce Type: cross Abstract: Because mathematics is highly abstract, a single statement can take very different forms depending on what subfield it is framed in. There are many examples where breakthroughs occurred after researchers discovered that a…
25 -
arXiv — Machine Learning research 6d ago
Critique of Agent Model
arXiv:2606.23991v1 Announce Type: cross Abstract: What is an agent? What constitutes agency? With the rise of Large Language Model (LLM) systems marketed as ``coding agents'', ``AI co-scientists'', and other ``agentic" tools that promise to drive up productivity, and at the same…
31 -
arXiv — Machine Learning research 6d ago
RASC+: Retrieval-Constrained LLM Adjudication for Clinical Value Set Authoring
arXiv:2606.23992v1 Announce Type: cross Abstract: Clinical value sets define the standardized terminology codes used in quality measurement, phenotyping, cohort construction, and clinical decision support. The recently introduced Retrieval-Augmented Set Completion (RASC)…
32 -
arXiv — Machine Learning research 6d ago
Low-rank Updates in Slowly Time-varying Graphs for Spatial-Temporal Signal Interpolation
arXiv:2606.24011v1 Announce Type: cross Abstract: A crucial assumption in graph signal processing (GSP) is the existence of an underlying graph that captures the pairwise similarities between nodes, allowing filters to be designed based on this graph for tasks such as denoising.…
11 -
-
arXiv — Machine Learning research 6d ago
CAVEWOMAN: How Large Language Models Behave Under Linguistic Input and Output Compression
arXiv:2606.24083v1 Announce Type: cross Abstract: "Talk short. Drop grammar. Save token." This caveman style is widely promoted as a way to cut inference cost, but whether it actually saves anything depends on which channel (the user's prompt or the model's response) is being…
25 -
arXiv — Machine Learning research 6d ago
PORTER: Language-Grounded Event Representations for Portable Structured EHR Foundation Models
arXiv:2606.24102v1 Announce Type: cross Abstract: Most electronic health record (EHR) foundation models encode clinical events as discrete event tokens from a fixed vocabulary and therefore cannot directly represent events containing unseen concepts or new combinations of…
35 -
arXiv — Machine Learning research 6d ago
Uniform Sampling from High-dimensional Spectral Norm Balls
arXiv:2606.24134v1 Announce Type: cross Abstract: Motivated by an application in machine learning optimization, this paper focuses on the challenges of sampling a matrix uniformly from the unit spectral norm ball. It is proven that all singular values of sampled matrices…
27 -
arXiv — Machine Learning research 6d ago
Autonomous Video Generation with Counterfactual Controllability for Self-Evolving World Models
arXiv:2606.24152v1 Announce Type: cross Abstract: Existing literature claims that video generation essentially is world modelling. On the one hand, the claim is productive because it pushes generative AI beyond static images and toward temporally extended physical scenes. On the…
15 -
arXiv — Machine Learning research 6d ago
BehaviorBench: Benchmarking Foundation Models for Behavioral Science Tasks
arXiv:2606.24162v1 Announce Type: cross Abstract: Foundation models have been increasingly applied to behavioral science domains such as psychology, sociology, and economics. While these models show promise in individual tasks such as survey response prediction and human-subject…
27 -
arXiv — Machine Learning research 6d ago
A P\={a}ninian Foundation for Indic Language Processing
arXiv:2606.24172v1 Announce Type: cross Abstract: More than a billion people communicate in Indic languages, yet the natural language processing infrastructure serving them remains fragmented and underdeveloped. The cause is structural: the field organizes its tools and…
24 -
arXiv — Machine Learning research 6d ago
Automated Residual Plot Assessment With the R Package autovi and the Shiny Application autovi.web
arXiv:2606.24236v1 Announce Type: cross Abstract: Visual assessment of residual plots is a common approach for diagnosing linear models, but it relies on manual evaluation, which does not scale well and can lead to inconsistent decisions across analysts. The lineup protocol,…
16 -
-
arXiv — Machine Learning research 6d ago
Deep numerical schemes for systems of Ergodic BSDEs with applications to regime-switching forward utilities
arXiv:2606.24271v1 Announce Type: cross Abstract: In this paper, we introduce two neural-network-based numerical schemes for solving systems of coupled ergodic Backward Stochastic Differential Equations (eBSDEs), motivated by the approximation of optimal strategies within the…
27 -
arXiv — Machine Learning research 6d ago
PROTECT-90: A Fault Dataset for Power System Protection
arXiv:2606.24298v1 Announce Type: cross Abstract: The increasing interest in data-driven methods for power system protection is accompanied by a lack of standardized, publicly available high-voltage waveform datasets that enable transparent and reproducible evaluation. To…
36 -
arXiv — Machine Learning research 6d ago
Open-Vocabulary BEV Segmentation with 3D-Aware Geometric Constraints
arXiv:2606.24353v1 Announce Type: cross Abstract: Bird's-eye view (BEV) perception fuses multi-camera images into a unified top-down representation for autonomous driving. Despite recent progress, state-of-the-art methods remain confined to closed-set scenarios, making them…
6 -
arXiv — Machine Learning research 6d ago
PHANTOM: A Large-Scale Dataset of Multimodal Adversarial Attacks for Vision-Language Models
arXiv:2606.24388v1 Announce Type: cross Abstract: We introduce a large-scale, open-source dataset of pre-generated adversarial attacks for vision-language models (VLMs). The dataset is designed to be diverse, representative, and practical, extending existing benchmarks by…
38 -
arXiv — Machine Learning research 6d ago
RE4: Transformation-aware Imitation of Object Interactions Using Manipulation Modes
arXiv:2606.24403v1 Announce Type: cross Abstract: Object interaction tasks have been a focus of advances in imitation learning. End-to-end methods, dominated by diffusion and flow-based variants have shown leaps in performance while sacrificing interpretability. Object-centric…
23 -
arXiv — Machine Learning research 6d ago
MedPCFM: Improving Medical Point Cloud Completion by Integrating Point Transformers and Flow Matching
arXiv:2606.24433v1 Announce Type: cross Abstract: Medical point cloud completion is important for anatomical reconstruction and downstream clinical workflows, yet generative modeling in this setting remains insufficiently studied. We investigate completion through…
28 -
arXiv — Machine Learning research 6d ago
An Agnostic Machine Learning Model of Photosynthetic Habitability
arXiv:2606.24458v1 Announce Type: cross Abstract: The search for exoplanet biosignatures is guided by whether planetary environments can sustain photosynthesis. As such, the Photosynthetic Habitable Zone (PHZ) was recently proposed, as the overlap between the canonical habitable…
8 -
arXiv — Machine Learning research 6d ago
CrossPool: Efficient Multi-LLM Serving for Cold MoE Models through KV-Cache and Weight Disaggregation
arXiv:2606.24506v1 Announce Type: cross Abstract: Emerging LLM services increasingly host many sparse MoE models, yet most models receive sparse requests and remain cold. This creates a GPU memory problem: model weights are stable and model-determined, while KV-cache is…
8 -
arXiv — Machine Learning research 6d ago
EERLoss: A Novel Loss Function for Training Deep Biometric Models. A Case Study in Keystroke Dynamics
arXiv:2606.24586v1 Announce Type: cross Abstract: Deep learning approaches to biometric verification are commonly trained by optimizing indirect objectives, creating a misalignment between the optimization process and the primary evaluation metric, typically the Equal Error Rate…
19 -
-
arXiv — Machine Learning research 6d ago
ASALT: Adaptive State Alignment for Lateral Transfer in Multi-agent Reinforcement Learning
arXiv:2606.24601v1 Announce Type: cross Abstract: Multi-agent reinforcement learning (MARL) addresses the problem of training multiple agents that pursue collaborative, competitive, or mixed objectives. Prior work has investigated transfer learning between source and target…
29