arXiv — Machine Learning
500 articles archived · Visit source ↗ · RSS
-
arXiv — Machine Learning research 1d ago
Joint discovery of governing partial differential equations from multi-source datasets by competitive optimization
arXiv:2606.30699v1 Announce Type: new Abstract: Discovering governing equations directly from observational data is a key step towards interpretable scientific machine learning. Current data-driven approaches typically operate on a single dataset, inherently limiting their…
38 -
-
arXiv — Machine Learning research 1d ago
From Search to Synthesis: Training LLMs as Zero-Shot Workflow Generators
arXiv:2606.30704v1 Announce Type: new Abstract: Large language models (LLMs) excel across a wide range of tasks, yet their instance-specific solutions often lack the structural consistency needed for reliable deployment. Workflows that encode recurring algorithmic patterns at…
13 -
arXiv — Machine Learning research 1d ago
Why Do Few-Step Text Latents Fail When Image Latents Work? Non-Commitment at Sharp Categorical Readouts
arXiv:2606.30705v1 Announce Type: new Abstract: Deterministic few-step generation succeeds on continuous image latents but collapses to incoherent text on continuous text latents, and we show the cause is geometric rather than a training or scaling deficiency: a smooth,…
30 -
arXiv — Machine Learning research 1d ago
Hierarchical Global Attention (HGA)
arXiv:2606.30709v1 Announce Type: new Abstract: Hierarchical Global Attention (HGA) is a drop-in replacement for dense causal attention in pretrained long-context transformers. HGA preserves the original checkpoint parameters: the pretrained $W_Q$, $W_K$, $W_V$, and $W_O$…
23 -
arXiv — Machine Learning research 1d ago
ReactionAtlas: Ab origine exploration of chemical reaction networks with machine learning
arXiv:2606.30778v1 Announce Type: new Abstract: Mapping a chemical reaction network, the graph of minima and transition states (TS) and the elementary reactions connecting them, is the natural language of chemistry, from catalysis to combustion to the origin of life.…
5 -
arXiv — Machine Learning research 1d ago
Revocable Learned State via Process Sidecars
arXiv:2606.30788v1 Announce Type: new Abstract: Language models are often adapted in stages: a public skill phase, a private memory phase, and a later safety phase that learns to refuse outputs tied to the remembered entities. Revoking the memory after the safety phase is not…
17 -
arXiv — Machine Learning research 1d ago
Predictable GRPO: A Closed-Form Model of Training Dynamics
arXiv:2606.30789v1 Announce Type: new Abstract: Group Relative Policy Optimization (GRPO) has become a standard tool for improving the reasoning ability of large language models, yet its training dynamics are still described empirically: reward trajectories are fit with…
16 -
arXiv — Machine Learning research 1d ago
Gradient Smoothing: Coupling Layer-wise Updates for Improved Optimization
arXiv:2606.30813v1 Announce Type: new Abstract: Deep neural networks with repeated architectural blocks, such as transformers, often exhibit structured relationships across layers that emerge during training. Motivated by this observation, we introduce \emph{Depth-wise Gradient…
25 -
arXiv — Machine Learning research 1d ago
Mind the Residual Gap: Probabilistic Downscaling under Real-World Bias
arXiv:2606.30821v1 Announce Type: new Abstract: Probabilistic downscaling is the task of modeling the conditional distribution of high-resolution fields given coarse inputs, and is a central challenge to atmospheric science, climate modeling, and other multiscale physical…
21 -
arXiv — Machine Learning research 1d ago
Partition-Guided Distance Saliency: Bridging Decision and Objective Spaces in Many-Objective Optimization
arXiv:2606.30836v1 Announce Type: new Abstract: Explainability in Many-Objective Optimization (MaO) is currently hindered by the escalating complexity of the Pareto front, which renders the relationship between high-dimensional decision variables and objective outcomes…
16 -
arXiv — Machine Learning research 1d ago
A Stationary-Distribution Theory for Triplet-Based Plateau Search in Random Forest Ensemble-Size Selection
arXiv:2606.30837v1 Announce Type: new Abstract: The number of trees is a central computational parameter in Random Forests: increasing it reduces finite-ensemble variability but increases training and prediction cost. Plateau-based tuning adapts this parameter through local…
18 -
-
arXiv — Machine Learning research 1d ago
Behavior Cloning is Not All You Need: The Optimality of On-Policy Distillation for Noisy Expert Feedback
arXiv:2606.30923v1 Announce Type: new Abstract: Imitation Learning is a natural framework for learning in sequential decision-making systems and has emerged as the dominant paradigm through which we understand language model training. A central puzzle is that, while in theory…
10 -
arXiv — Machine Learning research 1d ago
Personalizing Marketplace Policies with Competing Objectives and Constrained Experiments: Evidence from a Job Marketplace
arXiv:2606.30932v1 Announce Type: new Abstract: Two-sided marketplaces connect distinct user groups whose interests often conflict -- improving outcomes on one side could degrade the other side's experience. To address this challenge, we deploy an integrated framework for…
31 -
arXiv — Machine Learning research 1d ago
Quality-Aware Modulation for Diffusion Transformers
arXiv:2606.30934v1 Announce Type: new Abstract: Modern text-to-image diffusion models, such as diffusion transformers (DiT), rely on timestep or prompt embeddings to modulate the strength of the denoising process in each timestep. While this modulation communicates the current…
31 -
arXiv — Machine Learning research 1d ago
Physics-informed Conditional Normalizing Flows for Angles-only Cislunar Orbit Determination
arXiv:2606.30936v1 Announce Type: new Abstract: Generative Astrodynamics is advanced in this work by extending generative modelling to an orbit determination problem in the cislunar environment. The task is formulated as conditional density estimation, aiming to infer the…
38 -
arXiv — Machine Learning research 1d ago
Multistage Defer Trees for Hybrid Interpretability: If at First You Can't Succeed, Tree Again
arXiv:2606.30995v1 Announce Type: new Abstract: Recent work has shown that well-optimized individual decision trees can match complex black box models in some settings, primarily in noisy domains. For the remaining settings, however, complex ensembled compositions of trees often…
26 -
arXiv — Machine Learning research 1d ago
Estimating Supply Incrementality in Two-sided Marketplaces: A Causal Machine Learning Approach
arXiv:2606.30999v1 Announce Type: new Abstract: In two-sided marketplaces with heterogeneous products, it is important to understand the causal relationship between additional supply and marketplace outcomes, such as the total quantity transacted or transaction value in the…
26 -
arXiv — Machine Learning research 1d ago
Offline Reinforcement Learning for Fluid Controls: Data-based Multi-observational Policy Extraction
arXiv:2606.31025v1 Announce Type: new Abstract: Active flow control is a fundamental application in engineering. Recent advances in deep reinforcement learning have made progress in this field. However, the classical online RL approaches require extensive real-time interactions…
19 -
arXiv — Machine Learning research 1d ago
OTCache: Optimal Transport for Geometry-Aware Caching in Diffusion Models
arXiv:2606.31026v1 Announce Type: new Abstract: We propose OTCache, a training-free framework for accelerating diffusion sampling via caching schedule prediction. Existing graph-based caching methods reduce redundant computation by optimizing shortest-path objectives, but rely…
34 -
arXiv — Machine Learning research 1d ago
Teaching LLMs to Recommend and Defer in Underrepresented Epilepsy Care
arXiv:2606.31036v1 Announce Type: new Abstract: Specialist epilepsy expertise is scarce in resource-constrained settings, making LLM-based decision support attractive for frontline clinicians managing longitudinal treatment. Such systems must adapt to local prescribing practice…
12 -
arXiv — Machine Learning research 1d ago
Warp RL: Reshaping Base Policy Distributions for Dynamics Adaptation
arXiv:2606.31043v1 Announce Type: new Abstract: Residual reinforcement learning adapts a pretrained robot policy by learning an additive correction to its actions. While effective when adaptation amounts to shifting the base policy's action distribution, additive corrections…
26 -
-
arXiv — Machine Learning research 1d ago
Fora: From Weight-Space to Function-Space Protection in Capability-Preserving Fine-Tuning
arXiv:2606.31092v1 Announce Type: new Abstract: Full fine-tuning adapts large language models to new tasks but can erode capabilities they already possess. Existing remedies protect through proxies such as parameter distances, importance penalties, output matching, or dominant…
11 -
arXiv — Machine Learning research 1d ago
Explaining Machine Learning and Memorization with Statistical Mechanics
arXiv:2606.31110v1 Announce Type: new Abstract: Artificial neural networks (NNs) and machine learning (ML) algorithms are poorly understood from a theoretical perspective, which makes it difficult to fully realize their potential and overcome their weaknesses. For instance, ML…
8 -
arXiv — Machine Learning research 1d ago
Visualizing High-Dimensional Graph Embeddings via Informed Multi-View Projections
arXiv:2606.31119v1 Announce Type: new Abstract: Graphs are commonly visualized in 2D, where humans readily interpret spatial relationships, yet such layouts often distort higher-dimensional structure. We propose to embed graphs in high-dimensional space and search for…
38 -
arXiv — Machine Learning research 1d ago
Can Tabular In-Context Learners Generalize to Biomolecular Property Prediction?
arXiv:2606.31126v1 Announce Type: new Abstract: Predicting biomolecular properties from limited labeled data is a central bottleneck in protein engineering and small-molecule design. As strong pretrained encoders now supply rich fixed-length representations, the difficulty has…
28 -
arXiv — Machine Learning research 1d ago
A Bayesian Filtering Approach for Learning Lagrangian Dynamics from Noisy Measurements
arXiv:2606.31137v1 Announce Type: new Abstract: This paper proposes a Bayesian filtering-based approach for learning the dynamics of a physical system from partial, noisy measurements. We model the system dynamics using a Lagrangian mechanics formulation. As in Lagrangian neural…
38 -
arXiv — Machine Learning research 1d ago
PPT-Eval: A Benchmark for Computer-Use Agents on PowerPoint Tasks
arXiv:2606.31154v1 Announce Type: new Abstract: Creating and editing slides is a rich, multimodal activity that is ubiquitous in professional and educational settings, making it an ideal testbed for real-world computer-use agents. Microsoft PowerPoint is among the most widely…
25 -
arXiv — Machine Learning research 1d ago
ComplianceGate: Classifier-Gated Multi-Tier LLM Routing for Inference in Regulated Industries
arXiv:2606.31163v1 Announce Type: new Abstract: Large language models deployed in regulated industries operate under two constraints: compliance enforcement and cost efficiency. Personally identifiable information (PII) in user queries can reach model endpoints before the system…
14 -
arXiv — Machine Learning research 1d ago
AETDICE: Unified Framework and Offline Optimization for Nonlinear Multi-Objective RL
arXiv:2606.31178v1 Announce Type: new Abstract: Optimizing nonlinear preferences in multi-objective reinforcement learning (MORL) is essential for capturing complex trade-offs like risk aversion or fairness. However, such non-linearity has historically bifurcated nonlinear MORL…
31 -
arXiv — Machine Learning research 1d ago
Transformers as Bayesian In-Context Experimenters: Smoothness-Adaptive Efficient ATE Estimation
arXiv:2606.31184v1 Announce Type: new Abstract: Adaptive experiments for average treatment effects (ATE) require randomized allocations balancing valid inference with statistical efficiency. The oracle design is a covariate-dependent Neyman rule governed by unknown…
18 -
arXiv — Machine Learning research 1d ago
ISM:Self-Improving Strategy Memory for Continual Mathematical Reasoning
arXiv:2606.31191v1 Announce Type: new Abstract: We propose Intelligent Schema Memory (ISM), a self-evolving memory-augmented system that improves mathematical reasoning for a frozen LLM under continual learning with hard episodic resets. ISM maintains a compact, self-refined…
34 -
arXiv — Machine Learning research 1d ago
Probing Memorization of Tabular In-Context Learning
arXiv:2606.31208v1 Announce Type: new Abstract: Large tabular models (LTMs), i.e., tabular foundation models leveraging in-context learning (ICL), achieve state-of-the-art performance on tabular tasks. While LLMs are known to unintentionally memorize training data, the…
19 -
arXiv — Machine Learning research 1d ago
Learning Gaussian Graphical Models from a Glauber Trajectory Without Mixing
arXiv:2606.31230v1 Announce Type: new Abstract: We study the task of learning the structure of a $d$-sparse Gaussian graphical model on $n$ variables from a single trajectory of Glauber dynamics. Beyond algorithmic considerations, many applications present temporally correlated…
28 -
-
-
arXiv — Machine Learning research 1d ago
Revisiting the Volume Hypothesis
arXiv:2606.31282v1 Announce Type: new Abstract: Modern deep neural networks often contain far more parameters than needed to fit their training data, yet they achieve impressive generalization. A common explanation for this success is the implicit bias of stochastic gradient…
31 -
arXiv — Machine Learning research 1d ago
Sequential sparse Gaussian process quantile regression
arXiv:2606.31284v1 Announce Type: new Abstract: Quantile regression aims to estimate the conditional quantiles of a response variable from observed data. In a Bayesian setting, Gaussian process quantile regression provides uncertainty quantification but faces significant…
37 -
arXiv — Machine Learning research 1d ago
Probabilistic Inversion with Flow Matching
arXiv:2606.31288v1 Announce Type: new Abstract: We demonstrate the application of Flow Matching, a technique originating from generative Artificial Intelligence, to probabilistic inversion in geophysical settings, such as seismic Full-Waveform inversion. We adapt the…
7 -
-
arXiv — Machine Learning research 1d ago
Deep Reinforcement Learning for Spacecraft Attitude Control During Atmospheric Re-Entry
arXiv:2606.31291v1 Announce Type: new Abstract: Deep reinforcement learning has the potential to solve attitude control problems more adaptively, precisely, and robustly by handling nonlinear dynamics, uncertainties, and failure cases more effectively than traditional attitude…
11 -
arXiv — Machine Learning research 1d ago
Safe Online Learning via Smooth Safety-Structured Policy Composition
arXiv:2606.31320v1 Announce Type: new Abstract: Safe online reinforcement learning requires policies to respect safety constraints while maintaining smooth optimization dynamics. Existing approaches typically rely on either strict safety enforcement via action interventions,…
7 -
arXiv — Machine Learning research 1d ago
Expected Gain-based Escalation in Vertical Federated Learning
arXiv:2606.31331v1 Announce Type: new Abstract: Collaborative inference can improve predictive performance by integrating complementary information across agents, but applying collaborative fusion to every sample can incur unnecessary communication and computational overhead.…
17 -
arXiv — Machine Learning research 1d ago
Dualformer: Efficient Feature Extractor for Complex-valued Blind Communication Signal Analysis
arXiv:2606.31352v1 Announce Type: new Abstract: Designing effective feature extractors is critical for blind signal analysis tasks such as automatic modulation recognition (AMR), signal scheme recognition (SSR), and \color{black} signal structure parsing (SSP). In this work, we…
10 -
arXiv — Machine Learning research 1d ago
Calibrating the Evaluator: Does Probability Calibration Mitigate Preference Coupling in LLM Agent Feedback Loops?
arXiv:2606.31371v1 Announce Type: new Abstract: When large language model (LLM) agents adapt their behavior through evaluator feedback, systematic evaluator biases propagate into the agent's learned strategy distribution - a phenomenon termed evaluator preference coupling. Prior…
38 -
arXiv — Machine Learning research 1d ago
Resolving superposition in AI for interpretability and cross-modal alignment in patient-neuronal images
arXiv:2606.31394v1 Announce Type: new Abstract: Artificial intelligence is transforming our capability to solve biological challenges. In dimensionality bottleneck regimes exacerbated by high-dimensional biological data, Neural networks force distinct concepts into the lower…
14 -
arXiv — Machine Learning research 1d ago
Mixture-of-Control: State-Aware Fine-Tuning for Transformer-based Models
arXiv:2606.31397v1 Announce Type: new Abstract: State-based fine-tuning has emerged as a compelling alternative to weight-based adaptation for transformers, updating lightweight controls into states rather than model weights, offering substantial memory savings while retaining…
27 -
arXiv — Machine Learning research 1d ago
Contextual Slate GLM Bandits with Limited Adaptivity
arXiv:2606.31449v1 Announce Type: new Abstract: We investigate the contextual slate bandit problem with generalized linear rewards under limited adaptivity. At each round, the learner is presented with $N$ sets of items, where each item is represented by a $d$-dimensional…
22