arXiv — Machine Learning
500 articles archived · Visit source ↗ · RSS
-
arXiv — Machine Learning research 1d ago
Zero-Shot Quantization for Object Detectors using Off-the-Shelf Generative Models
arXiv:2606.31456v1 Announce Type: new Abstract: With an increasing number of Object Detection (OD) models being deployed on edge devices, Zero-Shot Quantization for OD (ZSQ-OD) aims to quantize these models when access to the original training data is prohibited. Existing…
34 -
arXiv — Machine Learning research 1d ago
TabPATE: Differentially Private Tabular In-Context Learning Without Public Data
arXiv:2606.31474v1 Announce Type: new Abstract: Tabular foundation models enable accurate in-context learning (ICL) from small labeled datasets, but the private records placed in context can leak through model predictions. We first show that even basic membership inference…
38 -
arXiv — Machine Learning research 1d ago
Constrained Online Convex Optimization without Slater's Condition
arXiv:2606.31480v1 Announce Type: new Abstract: We study constrained online convex optimization with adversarial losses and stochastic or adversarial constraints. For stochastic constraints, existing algorithms that achieve nearly optimal regret and constraint violation bounds…
27 -
arXiv — Machine Learning research 1d ago
Fork-Think with Confidence
arXiv:2606.31484v1 Announce Type: new Abstract: Parallel thinking has enjoyed great success for boosting LLM performance on reasoning tasks without the need for any re-training. However, existing methods follow a think-first-then-decide paradigm, i.e., they first sample multiple…
38 -
arXiv — Machine Learning research 1d ago
RaBitQCache: Rotated Binary Quantization for KVCache in Long Context LLM Inference
arXiv:2606.31519v1 Announce Type: new Abstract: Long-context Large Language Model inference is severely bottlenecked by the massive Key-Value (KV) cache, yet existing sparse attention methods often suffer from static fixed-budget (Top-k) retrieval or rely on proxy scores that…
5 -
arXiv — Machine Learning research 1d ago
On the Convergence of Self-Improving Online LLM Alignment
arXiv:2606.31524v1 Announce Type: new Abstract: The Self-Improving Alignment (SAIL) algorithm addresses distribution shift by reducing a bilevel formulation of the problem to an efficient, single-level method. Empirically, SAIL has demonstrated strong performance on this task.…
8 -
-
arXiv — Machine Learning research 1d ago
Introduction to Stochastic Differential Equations for Generative Machine Learning: A Variational Perspective
arXiv:2606.31576v1 Announce Type: new Abstract: The use of ordinary and stochastic differential equations has led to substantial progress in generative machine learning with applications to, for example, image, video and biomolecule generation. This paper provides a…
23 -
arXiv — Machine Learning research 1d ago
Robustness of neural networks to random noise perturbations of their inputs
arXiv:2606.31581v1 Announce Type: new Abstract: We investigate the problem of the robustness of a trained neural network to the perturbation of its input values. More specifically, we examine the interplay between the accuracy of the network, as measured by the mean squared…
11 -
arXiv — Machine Learning research 1d ago
Evil Spectra: How Optimisers can Amplify or Suppress Emergent Misalignment
arXiv:2606.31591v1 Announce Type: new Abstract: Emergent misalignment (EM) is a recently discovered phenomenon in LLMs where fine-tuning on a narrow misaligned task, such as writing insecure code, leads to broadly misaligned behaviour on unrelated prompts. Previous work has…
11 -
-
arXiv — Machine Learning research 1d ago
ECHO: Prune to act, trace to learn with selective turn memory in agentic RL
arXiv:2606.31650v1 Announce Type: new Abstract: Long-horizon language agents must repeatedly interact with tools, accumulate evidence, and make decisions under bounded context windows. Existing context-management methods make such rollouts feasible by truncating distant history,…
12 -
arXiv — Machine Learning research 1d ago
Improving Certified Robustness via Adversarial Distillation
arXiv:2606.31653v1 Announce Type: new Abstract: Certified training aims to produce models whose predictions can be formally verified against adversarial perturbations, typically by optimising upper bounds on the worst-case loss over an allowed perturbation set. For neural…
28 -
arXiv — Machine Learning research 1d ago
When to Truncate a Feature Ranking: A Residual-Overlap Stopping Rule for Subset Selection
arXiv:2606.31686v1 Announce Type: new Abstract: Feature rankings are widely used in supervised feature selection because they are simple, scalable and easy to interpret. Variables are first ranked by a relevance score, and a subset is then obtained by retaining the top-ranked…
17 -
arXiv — Machine Learning research 1d ago
Diffusing Blame: Task-Dependent Credit Assignment in Biologically Plausible Dual-Stream Networks
arXiv:2606.31700v1 Announce Type: new Abstract: Biological neural circuits obey Dale's principle: each neuron's synapses are uniformly excitatory or inhibitory. Artificial networks that respect this constraint must coordinate separate excitatory and inhibitory populations,…
28 -
arXiv — Machine Learning research 1d ago
Nonlinearity-Aware LoRA: Structured Gate Adaptation under Low-Rank Constraints
arXiv:2606.31717v1 Announce Type: new Abstract: Low-rank adaptation (LoRA) is commonly viewed as an update-space approximation to full fine-tuning, yet this view is incomplete for self-gated Transformer feed-forward networks. In gated FFNs, a low-rank residual can change not…
13 -
arXiv — Machine Learning research 1d ago
FedXDS: Leveraging Model Attribution Methods to counteract Data Heterogeneity in Federated Learning
arXiv:2606.31742v1 Announce Type: new Abstract: Explainable AI (XAI) methods have demonstrated significant success in recent years at identifying relevant features in input data that drive deep learning model decisions, enhancing interpretability for users. However, the…
4 -
arXiv — Machine Learning research 1d ago
Addressing Over-Refusal in LLMs with Competing Rewards
arXiv:2606.31748v1 Announce Type: new Abstract: Safety training on language models often induces over-refusal: improved safety on harmful prompts at the cost of increased refusal on harmless ones. Though this trade-off can be mitigated by training models with reinforcement…
25 -
arXiv — Machine Learning research 1d ago
Policy Optimization Achieves Data-Dependent Regret Bounds in MDPs with Unknown Transitions
arXiv:2606.31769v1 Announce Type: new Abstract: We study policy optimization for online episodic tabular Markov decision processes with unknown transition kernels, aiming for best-of-both-worlds guarantees together with data-dependent regret bounds. Recent work (Dann et al.,…
8 -
arXiv — Machine Learning research 1d ago
Bridging the Gap Between Latent and Explicit Reasoning with Looped Transformers
arXiv:2606.31779v1 Announce Type: new Abstract: Language models typically reason via explicit chain-of-thought (CoT), generating intermediate steps token-by-token. Latent CoT offers an alternative: it performs multi-step reasoning in the model's hidden states, replacing decoded…
34 -
arXiv — Machine Learning research 1d ago
Relational and Sequential Conformal Inference for Energy Time Series over Graphs via Foundation Models
arXiv:2606.31804v1 Announce Type: new Abstract: Accurate energy demand forecasting is essential for the reliable operation and planning of modern sustainable energy systems. Spatial-temporal graph neural networks (STGNNs) have recently achieved strong performance in point…
7 -
arXiv — Machine Learning research 1d ago
Geometry-Preserving Orthonormal Initialization for Low-Rank Adaptation in RLVR
arXiv:2606.31813v1 Announce Type: new Abstract: Low-rank adaptation (LoRA) and its variants enable parameter-efficient fine-tuning of large language models under the supervised fine-tuning (SFT) paradigm. However, their efficacy and behavior under Reinforcement learning with…
24 -
arXiv — Machine Learning research 1d ago
Low-dimensional topology of deep neural networks
arXiv:2606.31856v1 Announce Type: new Abstract: We study layered models, including feedforward networks, ResNets, and transformers, by limiting each layer to a width of $d = 3$, i.e., $\mathbb{R}^3$ as representation space. This allows us to track how a neural network changes…
17 -
arXiv — Machine Learning research 1d ago
Review Residuals: Update-Conditioned Residual Gating for Transformers
arXiv:2606.31859v1 Announce Type: new Abstract: Residual connections add every sublayer's proposed update with a fixed coefficient of one; the network never evaluates whether an update is reliable before committing it. Drawing on the human-factors principle of independent…
23 -
arXiv — Machine Learning research 1d ago
Sequential RC-TGAN: Generating Relational Time Series with Spectral Envelope Loss
arXiv:2606.31904v1 Announce Type: new Abstract: The generation of synthetic relational databases often involves modeling complex temporal dynamics, such as transaction logs or event sequences. A significant challenge in this domain is the handling of categorical time series…
26 -
arXiv — Machine Learning research 1d ago
Interface-Aware Neural Newton Preconditioning for Robust Cohesive Zone Model Simulations
arXiv:2606.31921v1 Announce Type: new Abstract: Cohesive Zone Models (CZMs) are widely used to simulate interface fracture, delamination, adhesive failure, and fiber--matrix debonding in aerospace composite structures. In implicit quasi-static finite element analyses, cohesive…
27 -
arXiv — Machine Learning research 1d ago
Making Sense of Touch from the Child's View for Contrastive Learning
arXiv:2606.31943v1 Announce Type: new Abstract: Is the sense of touch a mechanism for human babies' learning of visual concepts? If so, can we quantify its importance, and to what extent do babies rely on their sense of touch for visual learning? To approach these questions in a…
35 -
arXiv — Machine Learning research 1d ago
Signed-Permutation Coordinate Transport for RMSNorm Transformers
arXiv:2606.31963v1 Announce Type: new Abstract: Modern LLM workflows move coordinate-indexed objects across checkpoints: steering vectors, sparse autoencoders, top-$k$ neuron sets, attribution lists, and merge alignments. This is only well posed after fixing the model's…
37 -
arXiv — Machine Learning research 1d ago
Amplifying Membership Signal Through Chained Regeneration
arXiv:2606.31991v1 Announce Type: new Abstract: The tendency of large generative models to memorize training data makes sample verification critical for privacy auditing and copyright enforcement. Current membership (MIA) and dataset inference (DI) attacks often rely on one-shot…
7 -
arXiv — Machine Learning research 1d ago
Radial Suppression Accelerates Algorithmic Generalization: A Geometric Analysis of Delayed Generalization
arXiv:2606.32000v1 Announce Type: new Abstract: Why do neural networks memorize algorithmic training data long before they generalize? We present a geometric case study demonstrating that, on tasks where generalization requires discovering structured low-dimensional circuits,…
17 -
arXiv — Machine Learning research 1d ago
Surrogate Fidelity: When Can Open LLMs Explain Closed Ones?
arXiv:2606.32008v1 Announce Type: new Abstract: Mechanistic interpretability (MI) requires full access to model internals, yet the APIs for most widely deployed language models at best expose log-probabilities over output tokens. This creates a surrogate problem: when do…
28 -
arXiv — Machine Learning research 1d ago
CoMet: Context and Multiplicity Decomposition for Multimodal Uncertainty Estimation
arXiv:2606.32012v1 Announce Type: new Abstract: Uncertainty estimation has been a long-standing challenge in AI models; it amounts to "knowing what you don't know," and metacognition is notoriously difficult even for humans (cf. the Dunning-Kruger effect). Although it is still…
8 -
arXiv — Machine Learning research 1d ago
FedLAB: Traceable Semantic Codebooks for Federated Multimodal Graph Foundation Learning
arXiv:2606.32016v1 Announce Type: new Abstract: Multimodal graph foundation models aim to learn reusable knowledge from graphs enriched with text, images, attributes, and relational topology, thereby supporting diverse graph-centric and modality-centric tasks. In practice,…
14 -
arXiv — Machine Learning research 1d ago
TRIAGE: Role-Typed Credit Assignment for Agentic Reinforcement Learning
arXiv:2606.32017v1 Announce Type: new Abstract: Agentic reinforcement learning requires assigning credit to environment-facing actions such as searches, clicks, edits, navigation commands, and object interactions. Standard GRPO uses the final verifier outcome as a uniform…
13 -
arXiv — Machine Learning research 1d ago
SemRF: A Semantic Reference Frame for Residual-Stream Dynamics in Language Models
arXiv:2606.32022v1 Announce Type: new Abstract: Residual-stream analysis asks how language-model computation evolves across depth, but intermediate decoding requires comparable readout coordinates across layers. If embedding anchors and unembedding readout disagree on the chosen…
23 -
arXiv — Machine Learning research 1d ago
AdaJEPA: An Adaptive Latent World Model
arXiv:2606.32026v1 Announce Type: new Abstract: Latent world models enable planning from high-dimensional observations by predicting future states in a compact latent space. However, these models are typically kept frozen at test time: when their predictions become inaccurate,…
24 -
arXiv — Machine Learning research 1d ago
QVal: Cheaply Evaluating Dense Supervision Signals for Long-Horizon LLM Agents
arXiv:2606.32034v1 Announce Type: new Abstract: LLM agents increasingly act over long horizons, where a single trajectory can contain hundreds or thousands of actions. In these settings, outcome-only rewards provide too sparse guidance, failing to inform the model about the…
36 -
arXiv — Machine Learning research 1d ago
Seven-dimensional Trajectory Reconstruction for VAMOS++
arXiv:2503.18959v1 Announce Type: cross Abstract: The VAMOS++ magnetic spectrometer is characterized by a large angular and momentum acceptance and highly non-linear ion optics properties requiring the use of software ion trajectory reconstruction methods to measure the ion…
31 -
-
arXiv — Machine Learning research 1d ago
MediEncoder: Nonlinear Representation Learning for High-Dimensional Causal Mediation Analysis
arXiv:2606.30648v1 Announce Type: cross Abstract: Causal mediation analysis decomposes a treatment effect into indirect pathways through mediators and direct pathways not operating through them. Modern biomedical studies often involve high-dimensional covariates and mediators…
10 -
arXiv — Machine Learning research 1d ago
Can Physician Expertise Improve Machine Learning Identification of Delirium?
arXiv:2606.30651v1 Announce Type: cross Abstract: Delirium is common in hospitalized patients and is often missed in routine care. We present a user-centered interactive machine learning (UC-iML) framework for delirium detection support that combines physician-guided feature…
30 -
arXiv — Machine Learning research 1d ago
Estimating the Effect of Timing on Coupon Effectiveness
arXiv:2606.30664v1 Announce Type: cross Abstract: The coupon incentive is one of the most common tools marketers use to court users to engage with a business at various stages of the customer life cycle. A variety of factors can affect the effectiveness of a coupon incentive on…
7 -
arXiv — Machine Learning research 1d ago
Explainable Artificial Intelligence For The Detection and Characterisation of Stage B Heart Failure
arXiv:2606.30665v1 Announce Type: cross Abstract: Stage B heart failure is characterized by asymptomatic structural or functional cardiac abnormalities. Identifying individuals at this stage is clinically important, as early detection may enable targeted interventions to prevent…
20 -
arXiv — Machine Learning research 1d ago
Listening Between the Lines: Joint Learning of ASR Embeddings and LLM-Augmented Linguistics for Dementia Detection
arXiv:2606.30675v1 Announce Type: cross Abstract: Early detection of dementia through speech analysis offers a non-invasive screening alternative, but capturing both acoustic and linguistic biomarkers remains challenging. We propose a multimodal framework leveraging Whisper for…
28 -
-
arXiv — Machine Learning research 1d ago
Locker-based Truck-Drone Routing with Integrated Considerations of Pickups, Deliveries, and No-Fly Zones
arXiv:2606.30680v1 Announce Type: cross Abstract: Truck-drone delivery is an emerging last-mile logistics mode combining the long-haul capacity of trucks with the flexible service capability of drones. In locker-based operations, smart lockers serve not only as temporary parcel…
22 -
arXiv — Machine Learning research 1d ago
A Coherence Law for Trainability in Noisy Equivariant Quantum Neural Networks
arXiv:2606.30688v1 Announce Type: cross Abstract: Symmetry provides a quantum neural network structure, but on its own it does not keep the network trainable once noise is present. We ask which physical quantity decides whether the gradients of an equivariant circuit survive…
22 -
arXiv — Machine Learning research 1d ago
ViTL: Temporal Logic-Guided Zero-Shot Natural Language Navigation via Vision-Language Models
arXiv:2606.30696v1 Announce Type: cross Abstract: Enabling robots to follow natural language commands to complete zero-shot long-horizon tasks remains challenging. It requires extracting implicit temporal and logical constraints from natural language commands and executing…
4 -
arXiv — Machine Learning research 1d ago
BEST-RQ-2: Contextualize-Then-Predict, a Two-Step Approach for Self-Supervised Audio Representations
arXiv:2606.30700v1 Announce Type: cross Abstract: Self-supervised learning enables audio representations that transfer across domains and tasks. We present BEST-RQ-2, an evolution of BEST-RQ that retains frozen randomprojection-based discrete targets while introducing a two-step…
19 -
arXiv — Machine Learning research 1d ago
Diffusion-warm sampling of the XY model enables fast thermalization at scale
arXiv:2606.30773v1 Announce Type: cross Abstract: We introduce a novel technique for scalable sampling of spin-system states with continuous symmetries using diffusion models. By applying our approach to the XY model, a fundamental continuous-spin model in condensed matter…
7