Tag

Research papers

500 articles archived under #paper · RSS

arXiv — Machine Learning research 4h ago

Accelerating Discrete Diffusion Models with Parallel-In-Time Sampling

arXiv:2607.00773v1 Announce Type: new Abstract: Discrete diffusion models are widely used for learning and generating discrete distributions. As the generation process is inherently sequential, the acceleration of sampling is of significant importance. In this work, we…

25
arXiv — Machine Learning research 4h ago

Which Metric Reflects the Spelling Rate Accuracy in Event-Related Potential-Based Brain-Computer Interfaces?

arXiv:2607.00794v1 Announce Type: new Abstract: For predictive models, the often-reported performance metrics are the loss and accuracy. In synchronous Brain- Computer Interface (BCI) systems, these metrics are informative for most BCI paradigms; however, for Event-Related…

36
arXiv — Machine Learning research 4h ago

Task-Relevant Representation Decoupling for Visual Reinforcement Learning Generalization

arXiv:2607.00796v1 Announce Type: new Abstract: Visual Reinforcement Learning (VRL) has achieved considerable success in solving control tasks. However, generalizing learned policies to new environments remains a major challenge, as agents often overfit to task-irrelevant…

32
arXiv — Machine Learning research 4h ago

Local Motion Matters: A Deconstruct-Recompose Paradigm for Reinforcement Learning Pre-training from Videos

arXiv:2607.00808v1 Announce Type: new Abstract: Pre-training on large-scale videos to improve reinforcement learning efficiency is promising yet remains challenging. Existing methods typically treat the agent as an indivisible entity, modeling motion patterns globally. Such…

8
arXiv — Machine Learning research 4h ago

From Pixels to Temporal Correlations: Learning Informative Representations for Reinforcement Learning Pre-training

arXiv:2607.00811v1 Announce Type: new Abstract: Unsupervised pre-training on large-scale datasets has demonstrated significant potential for improving the sample efficiency and performance of Reinforcement Learning (RL). Given the large-scale action-free internet videos,…

13
arXiv — Machine Learning research 4h ago

Spectroscopy Analysis with Machine Learning Regression for the Quantification of Carbon and Nitrogen Contents in Inceptisol and Oxisol Soil Types: Comparing Different Preprocessing and Validation methods as well as Feature Importance

arXiv:2607.00834v1 Announce Type: new Abstract: Near-Infrared (NIR) spectroscopy has emerged as a promising alternative to traditional soil analysis methods, offering advantages such as speed, low cost, and non-destructive testing. This work proposes a machine learning (ML)…

9
arXiv — Machine Learning research 4h ago

Constrained Bayesian Optimisation with Multiple Information Sources

arXiv:2607.00865v1 Announce Type: new Abstract: Bayesian Optimisation (BO) under unknown constraints is particularly challenging when feasible regions are small. In such settings, existing methods that typically rely solely on evaluations of the true objective and constraints…

32
arXiv — Machine Learning research 4h ago

Beyond Activation Alignment:The Alignment-Diversity Tradeoff in Task-Aware LLM Quantization

arXiv:2607.00908v1 Announce Type: new Abstract: Mixed-precision quantization (MPQ) has become a key technique for deploying large language models under stringent memory and compute constraints. We first identify a phenomenon that we term the Perplexity Illusion: layers ranked as…

7
arXiv — Machine Learning research 4h ago

Valdi: Value Diffusion World Models

arXiv:2607.00917v1 Announce Type: new Abstract: World models can enable Model Predictive Control (MPC), but this requires dynamics prediction that is both fast enough for online use and expressive enough to represent uncertain futures. Diffusion models offer a natural mechanism…

18
arXiv — Machine Learning research 4h ago

Human-Machine Collaboration on Generative Meta-Learning: Model and Algorithm

arXiv:2607.00926v1 Announce Type: new Abstract: Generalizing machine learning models to environments that differ from their training distribution remains a critical hurdle, particularly when data from the target domain is entirely or partially unavailable. We propose Generative…

33
arXiv — Machine Learning research 4h ago

Explainable AI for Cancer Drug Response Prediction: Beyond Univariate Feature Attributions

arXiv:2607.00931v1 Announce Type: new Abstract: Predicting cancer drug response from transcriptomic profiles is a cornerstone of precision oncology, yet the scientific value of machine learning models hinges not solely on predictive accuracy, but also on their capacity to…

30
arXiv — Machine Learning research 4h ago

Diffeomorphic Optimization

arXiv:2607.00947v1 Announce Type: new Abstract: Generative models learn data distributions that reside on a low-dimensional manifold within a higher-dimensional ambient space. Optimizing differentiable objectives on this manifold is challenging: the ambient loss landscape is…

16
arXiv — Machine Learning research 4h ago

Aionoscope: Debugging Latent-State Accessibility in Time-Series Representations

arXiv:2607.00956v1 Announce Type: new Abstract: Time-series models are often evaluated by what they can forecast or classify, but those scores do not show whether their representations preserve the process state a user may want to inspect: event timing, phase, amplitude,…

7
arXiv — Machine Learning research 4h ago

LeNEPA: No-Augmentation Next-Latent Prediction for Time-Series Representation Learning

arXiv:2607.00958v1 Announce Type: new Abstract: Time series are central to modern data mining applications, from industrial telemetry and server metrics to finance and physiology, yet time-series self-supervised learning often depends on view and augmentation choices that encode…

14
arXiv — Machine Learning research 4h ago

Automatic Detection of Stress from Speech in the Trier Social Stress Test

arXiv:2607.00986v1 Announce Type: new Abstract: Automatically detecting stress in speech provides an unobtrusive way to gain insights relevant to behavioral research or clinical assessment. This study investigates the automatic differentiation between a stressful and…

12
arXiv — Machine Learning research 4h ago

Generative Model Proposal based Particle Filtering for Data Assimilation

arXiv:2607.01012v1 Announce Type: new Abstract: Data assimilation models state dynamics conditioned on sequential observations, and has wide-ranging scientific applications. In the filtering setting, the goal is to model the posterior over the current state given all…

21
arXiv — Machine Learning research 4h ago

Seahorse: A Unified Benchmarking Framework for Spatiotemporal Event Modeling

arXiv:2607.01022v1 Announce Type: new Abstract: Spatiotemporal point processes (STPPs) model event data in continuous time and space, with applications in mobility, epidemiology, and public safety. Recent neural STPPs span expressive intensity models, conditional density models,…

14
arXiv — Machine Learning research 4h ago

The Model Organism Lottery: Model Organism Interpretability Strongly Depends on Training Methodology

arXiv:2607.01033v1 Announce Type: new Abstract: Model organisms (MOs) - language models trained to exhibit undesired or unnatural behaviours - are frequently used as testbeds for evaluating white-box interpretability techniques. Current MOs are typically constructed via post-hoc…

10
arXiv — Machine Learning research 4h ago

GSRQ: Gain-Shape Residual Quantization for Sub-1-bit KV Cache

arXiv:2607.01065v1 Announce Type: new Abstract: The deployment of Large Language Models (LLMs) with extended context windows is increasingly constrained by the linear growth of Key-Value (KV) cache memory. Vector Quantization (VQ), particularly Residual Quantization (RQ), is a…

36
arXiv — Machine Learning research 4h ago

Balancing Expressivity and Learnability in Quantum Kernel Bandit Optimization

arXiv:2607.01080v1 Announce Type: new Abstract: We investigate Gaussian process (GP) bandit optimization with quantum kernels, assuming the mean reward function lies in the reproducing kernel Hilbert space (RKHS) induced by the quantum kernel. This setting is motivated by…

6
arXiv — Machine Learning research 4h ago

When Context Compensates for Sparse Event History: AlphaEarth for Spatio-Temporal Point-Process Forecasting

arXiv:2607.01082v1 Announce Type: new Abstract: Spatio-temporal point-process models must often generalise across space when local event histories are sparse. We study whether exogenous spatial context can compensate in such regimes. Using a fixed log-Gaussian Cox process…

32
arXiv — Machine Learning research 4h ago

Staleness-Learning Rate Scaling Laws for Asynchronous RLHF

arXiv:2607.01083v1 Announce Type: new Abstract: High-throughput RLHF systems often decouple rollout generation from policy optimization, leading to the use of stale rollouts during learner updates. In this work, we study the effect of such staleness in asynchronous GRPO. We make…

23
arXiv — NLP / Computation & Language research 4h ago

CausalMix: Data Mixture as Causal Inference for Language Model Training

arXiv:2607.01104v1 Announce Type: cross Abstract: In Large Language Model (LLM) training, data mixing plays a pivotal role in determining model performance. Recent methods optimize mixture weights via proxy models, but they rely on the assumption of static data distributions. As…

31
arXiv — Machine Learning research 4h ago

SynLaD: Latent Diffusion for Generating Synthesizable Molecules Conditioned on 3D Pharmacophore Profiles

arXiv:2607.01105v1 Announce Type: new Abstract: We present SynLaD, a latent diffusion framework for small-molecule generation that unifies ligand-based drug design objectives (what to make) with synthetic accessibility (how to make it). Current models typically optimize one…

10
arXiv — Machine Learning research 4h ago

Muon as a Residual Connection

arXiv:2607.01124v1 Announce Type: new Abstract: Muon has recently emerged as one of the most effective optimizers for training large neural networks, yet its empirical success has been explained from several different perspectives. In this paper, we propose a simple mechanistic…

6
arXiv — Machine Learning research 4h ago

ZO-Act: Efficient Zeroth-Order Fine-Tuning via One-Shot Activation-Informed Low-Rank Subspaces

arXiv:2607.01125v1 Announce Type: new Abstract: Zeroth-order (ZO) optimization enables fine-tuning large language models when backpropagation is unavailable or memory-prohibitive, but existing methods often perturb full model weights or randomly constructed low-dimensional…

4
arXiv — Machine Learning research 4h ago

GAIA: Geometry-Adaptive Operator Learning for Forward and Inverse Problems

arXiv:2607.01128v1 Announce Type: new Abstract: Operator learning for partial differential equations (PDEs) on arbitrary geometries builds fast neural surrogates for large-scale simulation. Although recent geometry-adaptive neural operators have made substantial progress, they…

17
arXiv — Machine Learning research 4h ago

Sequentially-Controlled Interactive Multi-Particle Flow-Maps for Online Feedback-Driven Search

arXiv:2607.01144v1 Announce Type: new Abstract: While generative models have enabled training-free reward alignment, current methods typically excel in local exploration within narrow regions of the underlying distribution. These approaches struggle when preferences are unknown…

16
arXiv — Machine Learning research 4h ago

A Lightweight Self-Supervised Learning Framework for Multivariate Time Series using Hierarchical-JEPA on ECG Data

arXiv:2607.01145v1 Announce Type: new Abstract: Data analysis in the medical domain often encounters scenarios involving a limited target dataset and a large, unannotated dataset with a general distribution. Under such circumstances, self-supervised learning (SSL) methods are…

31
arXiv — Machine Learning research 4h ago

Efficient Compression of Structured and Unstructured Volumes via Learned 3D Gaussian Representation

arXiv:2607.01164v1 Announce Type: new Abstract: Recent work has shown that implicit neural representations (INRs) can be trained to effectively compress structured and unstructured volume data, allowing for direct data querying with a reduced memory footprint. However, as…

36
arXiv — Machine Learning research 4h ago

Decision-Aware Training for Sample-Based Generative Models

arXiv:2607.01171v1 Announce Type: new Abstract: Sample-based generative models are increasingly used for probabilistic forecasting in high-stakes decision settings, yet their training objectives are blind to the decision maker's cost structure. These models are commonly trained…

5
arXiv — NLP / Computation & Language research 4h ago

QuasiMoTTo: Quasi-Monte Carlo Test-Time Scaling

arXiv:2607.01179v1 Announce Type: cross Abstract: Scaling inference compute, by generating many parallel attempts per problem, is a costly but reliable lever for improving language model capabilities. By default these attempts are generated independently, wasting inference…

36
arXiv — NLP / Computation & Language research 4h ago

Right in the Right Way: LM Training with Verifiable Rewards and Human Demonstrations

arXiv:2607.01181v1 Announce Type: cross Abstract: RL with verifiable rewards (RLVR) has emerged as a powerful paradigm for training LMs on tasks with well-defined success metrics, such as code generation and mathematical reasoning. However, current RLVR methods optimize only…

25
arXiv — Machine Learning research 4h ago

Neural Certificate Pricing for Combinatorial Optimization Problems

arXiv:2607.01185v1 Announce Type: new Abstract: Combinatorial optimization (CO) problems are difficult because certifiable discrete structure induces exponential search. One needs to search over the set exponentially many candidates to certify optimality, however, the structural…

23
arXiv — Machine Learning research 4h ago

Quantum vs. Classical Machine Learning: A Unified Empirical Comparison

arXiv:2607.01197v1 Announce Type: new Abstract: Quantum computing has emerged as a promising computational paradigm for machine learning (ML), with the potential to offer computational advantages over classical approaches. At this stage, the evidence supporting the performance…

25
arXiv — Machine Learning research 4h ago

TiRex-2: Generalizing TiRex to Multivariate Data and Streaming

arXiv:2607.01204v1 Announce Type: new Abstract: We introduce TiRex-2, a recurrent xLSTM-based time series foundation model that generalizes the univariate TiRex to multivariate forecasting with both past and future covariates. Real-world forecasting is inherently sequential:…

17
arXiv — Machine Learning research 4h ago

Language-Critique Imitation Learning from Suboptimal Demonstrations

arXiv:2607.01225v1 Announce Type: new Abstract: Prior work on imitation learning from suboptimal demonstrations typically relies on compressed supervision signals such as confidence estimates, discriminator scores, or importance weights. These scalar signals are inherently…

32
arXiv — NLP / Computation & Language research 4h ago

Is One Layer Enough? Training A Single Transformer Layer Can Match Full-Parameter RL Training

arXiv:2607.01232v1 Announce Type: cross Abstract: Reinforcement learning (RL) has become a central component of post-training large language models (LLMs), yet little is understood about how RL adaptation is distributed across transformer layers. Existing approaches typically…

7
arXiv — Machine Learning research 4h ago

Why Advanced Encoders Lag on Sparse Retrieval? The Answer and an Approach to Bridging Vocabulary Gaps

arXiv:2607.00004v1 Announce Type: cross Abstract: While advanced foundation models like ModernBERT significantly outperform older architectures in dense retrieval, they surprisingly lag behind the aging BERT-base baseline in learned sparse retrieval (LSR). We identify the root…

16
arXiv — Machine Learning research 4h ago

Urban Deceleration Behavior Modes Under Scene Context: An Early-Kinematic Classifier from Argoverse 2 Multi-Agent Trajectories

arXiv:2607.00027v1 Announce Type: cross Abstract: Urban deceleration is one of the most empirically studied yet least taxonomically organized behaviors in car-following research. Recent perception-equipped autonomous-vehicle datasets enable trajectory-anchored mode discovery. We…

18
arXiv — Machine Learning research 4h ago

Spatio-Temporal Gaussian Process for Building Terrain-Incorporating Wind Power Curves

arXiv:2607.00051v1 Announce Type: cross Abstract: Accurate modeling of wind turbine power curves is crucial for optimal wind farm operation. Nearly all existing power curve models focus on temporal variables such as wind speed and temperature while overlooking the influence of…

16
arXiv — NLP / Computation & Language research 4h ago

Harnessing the Latent Space: From Steering Vectors to Model Calibrators for Control and Trust

arXiv:2607.00083v1 Announce Type: new Abstract: Language models have changed from unreliable text generators to highly-capable large models with trillions of parameters. Capability increases come hand-in-hand with increases in scale, making understanding the internal…

36
arXiv — NLP / Computation & Language research 4h ago

CogTax: A Four-Level Cognitive Taxonomy for Command-Line Computing Education

arXiv:2607.00140v1 Announce Type: cross Abstract: As computing education expands beyond traditional programming into operational domains such as systems administration and command-line environments, existing pedagogical frameworks struggle to capture a dimension that is critical…

12
arXiv — Machine Learning research 4h ago

A Mechanism-Driven Theory of Phase Transitions in Active Learning

arXiv:2607.00144v1 Announce Type: cross Abstract: Active learning (AL) performance is known to be budget-dependent, yet regimes are typically defined by heuristic label counts that fail to generalize across datasets or architectures. We characterize AL dynamics by reframing…

12
arXiv — Machine Learning research 4h ago

Steal the Patch Size: Adversarially Manipulate Vision-Language Models

arXiv:2607.00174v1 Announce Type: cross Abstract: We present a black-box model-stealing attack that recovers private vision-tokenizer configurations of deployed vision-language models (VLMs), including the visual patch size and input preprocessing pipeline. The key idea is a…

34
arXiv — Machine Learning research 4h ago

HydraCollab: Adaptive Collaborative-Perception for Distributed Autonomous Systems

arXiv:2607.00191v1 Announce Type: cross Abstract: Collaborative-perception enables multi-robot systems to enhance situational awareness by sharing perceptual information. Existing collaborative-perception systems face an inherent trade-off between communication bandwidth…

22
arXiv — Machine Learning research 4h ago

Homogenization of $\ell_2$-Adversarial Training in High-Dimensions: Exact Dynamics under Stochastic Gradient Descent

arXiv:2607.00207v1 Announce Type: cross Abstract: We develop a framework for analyzing the learning dynamics of $\ell_2$-adversarial training of single-index models on Gaussian mixtures in the high-dimensional limit under streaming stochastic gradient descent (SGD). We derive…

35
arXiv — NLP / Computation & Language research 4h ago

SLIM-RL: Risk-Budgeted Random-Masking RL for Diffusion LLMs Without Trajectory Slicing

arXiv:2607.00208v1 Announce Type: new Abstract: Reinforcement learning for diffusion large language models (dLLMs) has largely moved to trajectory-aware methods. The current state of the art, TraceRL, holds that random masking is mismatched with the model's inference trajectory,…

19
arXiv — Machine Learning research 4h ago

Sample Complexities of Estimating Gumbel--Max Watermark Proportions with and without Reduction to Pivotal Statistics

arXiv:2607.00224v1 Announce Type: cross Abstract: Watermarking promises a statistical trace of large language model (LLM) use, but real documents, after editing or paraphrasing, rarely arrive as purely human-written or purely machine-generated. This motivates a quantitative…

38
arXiv — Machine Learning research 4h ago

Leveraging Multimodality for Real-Time Classification of Transients and Variables found by the Zwicky Transient Facility

arXiv:2607.00228v1 Announce Type: cross Abstract: Modern time-domain surveys such as the Zwicky Transient Facility (ZTF) generate hundreds of thousands of alerts each night, making real-time decisions for follow-up observations a central challenge in time-domain astronomy.…

28

Accelerating Discrete Diffusion Models with Parallel-In-Time Sampling

Which Metric Reflects the Spelling Rate Accuracy in Event-Related Potential-Based Brain-Computer Interfaces?

Task-Relevant Representation Decoupling for Visual Reinforcement Learning Generalization

Local Motion Matters: A Deconstruct-Recompose Paradigm for Reinforcement Learning Pre-training from Videos

From Pixels to Temporal Correlations: Learning Informative Representations for Reinforcement Learning Pre-training

Spectroscopy Analysis with Machine Learning Regression for the Quantification of Carbon and Nitrogen Contents in Inceptisol and Oxisol Soil Types: Comparing Different Preprocessing and Validation methods as well as Feature Importance

Constrained Bayesian Optimisation with Multiple Information Sources

Beyond Activation Alignment:The Alignment-Diversity Tradeoff in Task-Aware LLM Quantization

Valdi: Value Diffusion World Models

Human-Machine Collaboration on Generative Meta-Learning: Model and Algorithm

Explainable AI for Cancer Drug Response Prediction: Beyond Univariate Feature Attributions

Diffeomorphic Optimization

Aionoscope: Debugging Latent-State Accessibility in Time-Series Representations

LeNEPA: No-Augmentation Next-Latent Prediction for Time-Series Representation Learning

Automatic Detection of Stress from Speech in the Trier Social Stress Test

Generative Model Proposal based Particle Filtering for Data Assimilation

Seahorse: A Unified Benchmarking Framework for Spatiotemporal Event Modeling

The Model Organism Lottery: Model Organism Interpretability Strongly Depends on Training Methodology

GSRQ: Gain-Shape Residual Quantization for Sub-1-bit KV Cache

Balancing Expressivity and Learnability in Quantum Kernel Bandit Optimization

When Context Compensates for Sparse Event History: AlphaEarth for Spatio-Temporal Point-Process Forecasting

Staleness-Learning Rate Scaling Laws for Asynchronous RLHF

CausalMix: Data Mixture as Causal Inference for Language Model Training

SynLaD: Latent Diffusion for Generating Synthesizable Molecules Conditioned on 3D Pharmacophore Profiles

Muon as a Residual Connection

ZO-Act: Efficient Zeroth-Order Fine-Tuning via One-Shot Activation-Informed Low-Rank Subspaces

GAIA: Geometry-Adaptive Operator Learning for Forward and Inverse Problems

Sequentially-Controlled Interactive Multi-Particle Flow-Maps for Online Feedback-Driven Search

A Lightweight Self-Supervised Learning Framework for Multivariate Time Series using Hierarchical-JEPA on ECG Data

Efficient Compression of Structured and Unstructured Volumes via Learned 3D Gaussian Representation

Decision-Aware Training for Sample-Based Generative Models

QuasiMoTTo: Quasi-Monte Carlo Test-Time Scaling

Right in the Right Way: LM Training with Verifiable Rewards and Human Demonstrations

Neural Certificate Pricing for Combinatorial Optimization Problems

Quantum vs. Classical Machine Learning: A Unified Empirical Comparison

TiRex-2: Generalizing TiRex to Multivariate Data and Streaming

Language-Critique Imitation Learning from Suboptimal Demonstrations

Is One Layer Enough? Training A Single Transformer Layer Can Match Full-Parameter RL Training

Why Advanced Encoders Lag on Sparse Retrieval? The Answer and an Approach to Bridging Vocabulary Gaps

Urban Deceleration Behavior Modes Under Scene Context: An Early-Kinematic Classifier from Argoverse 2 Multi-Agent Trajectories

Spatio-Temporal Gaussian Process for Building Terrain-Incorporating Wind Power Curves

Harnessing the Latent Space: From Steering Vectors to Model Calibrators for Control and Trust

CogTax: A Four-Level Cognitive Taxonomy for Command-Line Computing Education

A Mechanism-Driven Theory of Phase Transitions in Active Learning

Steal the Patch Size: Adversarially Manipulate Vision-Language Models

HydraCollab: Adaptive Collaborative-Perception for Distributed Autonomous Systems

Homogenization of $\ell_2$-Adversarial Training in High-Dimensions: Exact Dynamics under Stochastic Gradient Descent

SLIM-RL: Risk-Budgeted Random-Masking RL for Diffusion LLMs Without Trajectory Slicing

Sample Complexities of Estimating Gumbel--Max Watermark Proportions with and without Reduction to Pivotal Statistics

Leveraging Multimodality for Real-Time Classification of Transients and Variables found by the Zwicky Transient Facility