Tag

Training

450 articles archived under #training · RSS

arXiv — NLP / Computation & Language research 21d ago

Detecting Sensitive Personal Information in Japanese Pre-Training Corpora for Large Language Models

arXiv:2606.12114v1 Announce Type: new Abstract: Sensitive personal information can appear in large-scale pre-training corpora for large language models (LLMs). Detecting and filtering such information is therefore essential to ensure compliance with privacy regulations and…

34
arXiv — NLP / Computation & Language research 21d ago

ALIGNBEAM : Inference-Time Alignment Transfer via Cross-Vocabulary Logit Mixing

arXiv:2606.12342v1 Announce Type: new Abstract: Domain fine-tuning degrades the safety of large language models: fine-tuned specialists readily comply with harmful prompts framed in domain language. Existing inference-time defenses that mix logits from a safe anchor model…

18
Hugging Face Daily Papers research 21d ago

Lius: Translation Model Based Instructional Lingustic Using Continual Instruction Tuning In Kupang Malay

Abstract Continual Instruction Tuning enables effective fine-tuning of large language models for low-resource language translation, achieving superior performance compared to standard instruction tuning and multilingual models. Generated by Qwen/Qwen2.5-Coder-32B-Instruct Large…

4
r/MachineLearning community 21d ago

Pyrecall open source tool for detecting catastrophic forgetting during LLM fine-tuning[P]

Surprised there's no real tooling for this given how much research exists on continual learning. Built pyrecall to fill the gap. Snapshots skill scores before/after fine-tuning, flags regressions, rolls back LoRA adapters by name. Fully local, no external APIs. v0.1.0, MIT, pip…

17
r/LocalLLaMA community 21d ago

SenseNova U1 dropped an infographic-specific finetune

it's the same U1-8B-MoT base with an extended MT (multi-task) training phase focused on structured visual output. the benchmark jumps are significant: IGenBench I-ACC (infographic accuracy) : 4.2👉17.0 (4x) Chart Understanding: 51.3👉69.5Text Rendering: 39.8👉46.6Overall…

32
Hugging Face Daily Papers research 22d ago

Attention Amnesia in Hybrid LLMs: When CoT Fine-Tuning Breaks Long-Range Recall, and How to Fix It

Abstract Chain-of-thought supervised fine-tuning degrades long-context recall in hybrid linear-attention models by biasing attention gradients toward short-range patterns, but a training-free method called QK-Restore can restore long-context capabilities by reverting query-key…

8
arXiv — Machine Learning research 22d ago

Two to Tango: Coupled Task-Reference Selection for Safe LLM Fine-tuning

arXiv:2606.09866v1 Announce Type: new Abstract: Fine-tuning safety aligned large language models (LLMs) on downstream data improves adaptation but may erode learned safety behavior. Existing methods use fixed safety examples, global constraints, or one-sided task filtering. Our…

28
arXiv — Machine Learning research 22d ago

When RL Fails after SFT: Rejuvenating Model Plasticity for Robust SFT-to-RL Handoff

arXiv:2606.09932v1 Announce Type: new Abstract: Supervised Fine-Tuning (SFT) followed by Reinforcement Learning (RL) has become a standard pipeline for Large Language Model (LLM) post-training. SFT is expected to provide a useful behavioral prior for RL to further enhance model…

30
arXiv — Machine Learning research 22d ago

A Unified Adaptive Feature Composition Framework for Multi-Task Generalization in Wireless Foundation Models

arXiv:2606.10277v1 Announce Type: new Abstract: Though wireless foundation models (WFMs) have shown strong potential in learning universal channel representations, their adaptation to various downstream tasks remains constrained by existing paradigms. Fine-tuning strategies…

19
arXiv — Machine Learning research 22d ago

Revisiting Positive Samples in Graph Contrastive Learning: From the Perspective of Message Passing

arXiv:2606.10284v1 Announce Type: new Abstract: Graph Contrastive Learning (GCL), which trains graph encoders by maximizing similarity between positive samples and minimizing it between negative ones, has emerged as a mainstream graph pre-training paradigm. It is widely…

16
arXiv — NLP / Computation & Language research 22d ago

CodeAlchemy: Synthetic Code Rewriting at Scale

arXiv:2606.10087v1 Announce Type: new Abstract: Pre-training on raw code teaches syntax but provides sparse signal for diverse real-world task formats. While synthetic data has proven transformative for language models, code remains largely unexplored beyond limited quality…

29
arXiv — NLP / Computation & Language research 22d ago

The Order Matters: Sequential Fine-Tuning of LLaMA for Coherent Automated Essay Scoring

arXiv:2606.10327v1 Announce Type: new Abstract: Automated Essay Scoring (AES) systems must judge interdependent discourse elements (e.g., lead, claim, evidence, conclusion), yet most approaches treat these in isolation, harming coherence and generalization. We investigate…

19
arXiv — NLP / Computation & Language research 22d ago

Hidden Consensus:Preference-Validity Compression in Human Feedback

arXiv:2606.10569v1 Announce Type: new Abstract: Standard RLHF pipelines often reduce heterogeneous human judgments into a single scalar reward target. We argue that this reduction can mis-measure alignment in structurally plural societies, where disagreement may reflect…

7
arXiv — NLP / Computation & Language research 22d ago

Small Data, Big Noise: Adversarial Training for Robust Parameter-Efficient Fine-Tuning

arXiv:2606.10610v1 Announce Type: new Abstract: Parameter-Efficient Fine-Tuning (PEFT) has become essential for adapting foundation models to downstream NLP tasks. However, current PEFT methods often struggle with robustness to noise and performance degradation on limited…

25
arXiv — NLP / Computation & Language research 22d ago

Speaker Group Encoding in Self-supervised Speech Recognition Models

arXiv:2606.10654v1 Announce Type: new Abstract: We investigate what self-supervised speech recognition models (S3Ms) learn about speaker groups (SGs). We examine several states of S3Ms: pretrained, finetuned on speaker identification (SID), finetuned on automatic speech…

10
arXiv — NLP / Computation & Language research 22d ago

Attention Expansion: Enhancing Keyphrase Extraction from Long Documents with Attention-Augmented Contextualized Embeddings

arXiv:2606.10716v1 Announce Type: new Abstract: Pre-trained language models (PLMs) have achieved strong performance in keyphrase extraction (KPE), largely due to their ability to generate rich contextualized representations. However, long-document KPE remains challenging because…

30
arXiv — NLP / Computation & Language research 22d ago

Attention Amnesia in Hybrid LLMs: When CoT Fine-Tuning Breaks Long-Range Recall, and How to Fix It

arXiv:2606.11052v1 Announce Type: new Abstract: Chain-of-thought (CoT) supervised fine-tuning (SFT) is widely adopted to improve reasoning ability, yet we find that it systematically degrades long-context recall in hybrid linear-attention models. Across architectures including…

26
arXiv — NLP / Computation & Language research 22d ago

Supervised Fine-tuning with Synthetic Rationale Data Hurts Real-World Disease Prediction

arXiv:2606.10279v1 Announce Type: cross Abstract: Supervised fine-tuning with synthetic rationale data is widely assumed to improve language model performance on clinical prediction tasks by teaching models not just what to predict but why. We test this assumption on five-year…

28
arXiv — NLP / Computation & Language research 22d ago

Advancing the State-of-the-Art in Empirical Privacy Auditing

arXiv:2606.10481v1 Announce Type: cross Abstract: Parameter-efficient fine-tuning of large language models (LLMs) can exhibit problematic memorization of individual training examples. Empirical privacy auditing (EPA) quantifies this risk by measuring realistic data leakage on…

23
arXiv — NLP / Computation & Language research 22d ago

Representation-Aware Advantage Estimation: Your Reward Model Provides More Than A Scalar Output

arXiv:2606.10528v1 Announce Type: cross Abstract: Current reinforcement learning from human feedback (RLHF) methods primarily rely on scalar rewards from a trained reward model (RM). While effective, scalar rewards are often noisy and fail to capture fine-grained preference…

32
Hugging Face Daily Papers research 22d ago

Emergent Misalignment Can Be Induced by Sycophancy and Reversed via Alignment Gating

Abstract Sycophancy fine-tuning contributes to emergent misalignment in language models, which can be reversed using Alignment Gating—a method that inserts learnable gates to identify and control unsafe responses while maintaining general capabilities. Generated by…

24
Hugging Face Daily Papers research 22d ago

Test-Time Gradient Guidance of Flow Policies in Reinforcement Learning

Abstract QGF is an RL algorithm that improves policies at test time by using a value gradient to guide a pre-trained flow policy, avoiding training-time instability while maintaining competitive performance. Generated by Qwen/Qwen2.5-Coder-32B-Instruct Expressive continuous…

31
r/LocalLLaMA community 22d ago

Fine-tuned Qwen2.5-7B to 96% of Claude Haiku on a domain-specific task using ~$3 of API calls and zero human labelers

Built a decision-reasoning engine (Orlog) and wanted to fine-tune a local model for it instead of paying per-call forever. The method (DV-DPO): Run a 3-voice council on each question, produce a synthesis Cross-examine: losing voices challenge the synthesis If synthesis gets…

35
Hugging Face Daily Papers research 22d ago

Robotic Policy Adaptation via Weight-Space Meta-Learning

Abstract WIZARD is a weight-space meta-learning framework that generates task-specific LoRA parameters for frozen VLA policies using language instructions and demonstration videos, enabling efficient task adaptation without fine-tuning. Generated by…

31
arXiv — Machine Learning research 23d ago

Shortcuts in the Tail: Debiasing via Post-Hoc Spectral Compression of Fine-Tuning Updates

arXiv:2606.07596v1 Announce Type: new Abstract: Fine-tuning often introduces spurious correlations alongside task knowledge, causing systematic failures on underrepresented groups. Existing mitigations require retraining, group labels, or curated counterfactual data. We show a…

21
arXiv — Machine Learning research 23d ago

Repetition Mismatch: Why Data Mixture Experiments Don't Scale and How to Fix Them

arXiv:2606.07597v1 Announce Type: new Abstract: Pre-training data mixtures are commonly tuned by running small-scale experiments and extrapolating to the target training budget. When high-quality data is scarce and must be repeated, this extrapolation frequently fails, but the…

23
arXiv — Machine Learning research 23d ago

DOG-DPO:Dynamic Optimization in Geometry for Safety Alignment

arXiv:2606.07678v1 Announce Type: new Abstract: Safety alignment for large language models relies on preference data, but current pipelines often train on large, redundant datasets. Existing data selection methods typically score each preference pair independently, collapsing…

12
Hugging Face Daily Papers research 23d ago

On the Geometry of On-Policy Distillation

Abstract On-policy distillation exhibits distinct parameter space dynamics characterized by relaxed off-principal updates and subspace locking, forming a unique geometric pattern separate from supervised fine-tuning and reinforcement learning with verifiable rewards. Generated…

20
NVIDIA Developer Blog official-blog 23d ago

Train Models Faster with JAX and MaxText Using NVFP4 on NVIDIA Blackwell

Pre-training frontier LLMs comes down to throughput. When training spans trillions of tokens across thousands of accelerators, every percentage point of step...

34
r/LocalLLaMA community 23d ago

Nex N2 has a funny "few words do trick" reasoning

I've been playing with Nex N2 Pro (Qwen 3.5 397B finetune) locally today. I noticed straight away that it has a pattern of reasoning that is distinct and uses simple words like "need" and "maybe" a lot. Here's a sample of reasoning. We need answer user asks "what is the theory…

16
Hugging Face Daily Papers research 23d ago

LayerRoute: Input-Conditioned Adaptive Layer Skipping via LoRA Fine-Tuning for Agentic Language Models

Abstract LayerRoute is a lightweight adapter that selectively skips transformer blocks during inference based on input type, achieving compute savings while maintaining or improving model quality through gated routing and LoRA adaptation. Generated by…

19
arXiv — Machine Learning research 24d ago

The Fine-Tuning Trap: Evaluating Negative Transfer and the Role of PEFT in Sub-1B Mathematical Reasoning

arXiv:2606.06920v1 Announce Type: new Abstract: Deploying Small Language Models (SLMs) on edge devices requires efficient fine-tuning strategies that adapt models to new tasks without degrading their general capabilities. In this study, we benchmark five sub-1B models (135M-1B)…

17
arXiv — NLP / Computation & Language research 24d ago

RASFT: Rollout-Adaptive Supervised Fine-Tuning for Reasoning

arXiv:2606.07006v1 Announce Type: cross Abstract: Supervised fine-tuning (SFT) is a prevailing method for adapting large language models to reasoning tasks by imitating offline expert demonstrations, often treating a single expert trajectory as the target behavior. However,…

15
arXiv — NLP / Computation & Language research 24d ago

What Do People Actually Want From AI? Mapping Preference Plurality

arXiv:2606.06674v1 Announce Type: new Abstract: Large Language Models (LLMs) are often fine-tuned through Reinforcement Learning from Human Feedback (RLHF) to align with people's preferences and values. However, this method has known limitations: it aggregates conflicting…

29
arXiv — NLP / Computation & Language research 24d ago

Translate-R1: Cost-Aware Translation Tool Use via Reinforcement Learning

arXiv:2606.06835v1 Announce Type: new Abstract: The performance gap across languages in LLMs is well documented, and closing it natively requires pretraining or fine-tuning on corpora that, for most languages, do not exist. Translation offers an alternative: converting an input…

16
arXiv — NLP / Computation & Language research 24d ago

LLM-Guided Evolution for Medical Decision Pipelines

arXiv:2606.07342v1 Announce Type: new Abstract: Adapting large language models (LLMs) to clinical workflows often requires costly fine-tuning or manual prompt and pipeline engineering. We study LLM-guided MAP-Elites evolution as an inference-time alternative for discovering…

9
r/LocalLLaMA community 26d ago

Github Copilot finally supporting custom endpoints

https://preview.redd.it/082gnmin1l5h1.png?width=1740&format=png&auto=webp&s=2c89f6310c8c654611188183de07857d77cb2417 https://preview.redd.it/169tjrzn1l5h1.png?width=710&format=png&auto=webp&s=9a1fa656ea95037622b0d7ea2e16a23d2122442c I just noticed   submitted by  …

19
Hugging Face Daily Papers research 26d ago

Trust Region Q Adjoint Matching

Abstract Trust Region Q-Adjoint Matching (TRQAM) addresses instability in off-policy reinforcement learning by adaptively controlling path-space KL divergence through projected dual descent, enabling stable fine-tuning of pretrained flow policies. Generated by…

19
Hugging Face Daily Papers research 26d ago

Code2LoRA: Hypernetwork-Generated Adapters for Code Language Models under Software Evolution

Abstract Code2LoRA is a hypernetwork framework that generates repository-specific LoRA adapters for code language models, supporting both static and evolving codebases with efficient parameter-efficient fine-tuning. Generated by Qwen/Qwen2.5-Coder-32B-Instruct Code language…

16
arXiv — Machine Learning research 27d ago

Dominant-Layer ZO: A Single Layer Dominates Zeroth-Order Fine-Tuning of LLMs

arXiv:2606.05516v1 Announce Type: new Abstract: Zeroth-order (ZO) optimization enables memory-efficient fine-tuning of large language models (LLMs) using only forward passes, but it remains unclear how useful adaptation is distributed across layers. In this work, we reveal a…

10
arXiv — Machine Learning research 27d ago

Domain-Adapted Small Language Models with Hybrid Post-Processing: Achieving Cost-Efficient, Low-Latency Multi-Label Structured Prediction via LoRA Fine-Tuning on Scarce Data

arXiv:2606.05781v1 Announce Type: new Abstract: Deploying frontier large language models (LLMs) for domain-specific structured evaluation tasks often incurs substantial latency, cost, and data privacy overhead. We present a hybrid framework that combines a fine-tuned small…

34
arXiv — Machine Learning research 27d ago

High-Dimensional Theory of LoRA Fine-Tuning in a Solvable Attention Model

arXiv:2606.05899v1 Announce Type: new Abstract: We develop a high-dimensional statistical theory of low-rank adaptation (LoRA) in attention models, capturing the interplay between pre-training and fine-tuning. We introduce a solvable framework in which a single-head attention…

32
arXiv — Machine Learning research 27d ago

Steering Vectors are an Adversarial Attack Surface

arXiv:2606.05958v1 Announce Type: new Abstract: Activation steering has become a popular way to control Large Language Model (LLM) behavior without fine-tuning. Since the technique is plug-and-play, users share datasets and precomputed vectors to steer model activations.…

25
arXiv — NLP / Computation & Language research 27d ago

Predict and Reconstruct: Joint Objectives for Self-Supervised Language Representation Learning

arXiv:2606.05173v1 Announce Type: new Abstract: Masked language modelling (MLM) has been the dominant pre-training objective for text encoders since BERT, yet it encourages representations that are strongly anchored to surface-form token identity rather than deeper semantic…

22
arXiv — NLP / Computation & Language research 27d ago

A Model of Multi-turn Human Persuadability Using Probabilistic Belief Tracing

arXiv:2606.05330v1 Announce Type: new Abstract: Large language models can shift human beliefs across high-stakes domains, but most persuasion studies rely on pre/post belief change. These endpoint measures identify whether persuasion occurred, yet miss where and how beliefs…

24
arXiv — NLP / Computation & Language research 27d ago

Predictable Scaling Laws of Optimal Hyperparameters for LLM Continued Pre-training

arXiv:2606.05610v1 Announce Type: new Abstract: The efficacy of continued pre-training for Large Language Models (LLMs) hinges upon hyperparameter configurations, such as learning rate and batch size. However, current practices often rely on heuristics or grid searches, leading…

8
Hugging Face Daily Papers research 27d ago

Stable-Layers: Fine-Tuning Image Layer Decomposition Models with VLM-Scored Reinforcement Learning

Abstract Stable-Layers uses reinforcement learning with vision-language model feedback to improve layer decomposition without paired data, employing Flow-GRPO and LoRA adaptation for optimized policy training. Generated by Qwen/Qwen2.5-Coder-32B-Instruct We present…

38
arXiv — Machine Learning research 28d ago

EvalStop: Using World Feedback to Detect and Correct Reward Overoptimization in Multi-Tenant RLHF Platforms

arXiv:2606.04145v1 Announce Type: new Abstract: Cloud LLM fine-tuning platforms increasingly serve RLHF workloads, where a learned reward model is optimized as a proxy for human quality. As Gao et al. (2023) showed, this proxy diverges from world feedback (downstream eval…

24
arXiv — Machine Learning research 28d ago

ADAPTOOD: Uncertainty-Aware Fine-Tuning for Out-of-Distribution ECG Time Series Models

arXiv:2606.04164v1 Announce Type: new Abstract: Data samples used for training often differ from those encountered during fine-tuning and deployment, and while ML models show promise, their performance remains limited when only small annotated datasets are available. Performance…

10
arXiv — Machine Learning research 28d ago

When Autoregressive Consistency Hurts Safety Alignment

arXiv:2606.04168v1 Announce Type: new Abstract: Safety alignment in large language models (LLMs) is fragile in part because it is often shallow: fine-tuning mainly reshapes the model's behavior near the first few output tokens. We argue that this phenomenon can be understood…

21

Detecting Sensitive Personal Information in Japanese Pre-Training Corpora for Large Language Models

ALIGNBEAM : Inference-Time Alignment Transfer via Cross-Vocabulary Logit Mixing

Lius: Translation Model Based Instructional Lingustic Using Continual Instruction Tuning In Kupang Malay

Pyrecall open source tool for detecting catastrophic forgetting during LLM fine-tuning[P]

SenseNova U1 dropped an infographic-specific finetune

Attention Amnesia in Hybrid LLMs: When CoT Fine-Tuning Breaks Long-Range Recall, and How to Fix It

Two to Tango: Coupled Task-Reference Selection for Safe LLM Fine-tuning

When RL Fails after SFT: Rejuvenating Model Plasticity for Robust SFT-to-RL Handoff

A Unified Adaptive Feature Composition Framework for Multi-Task Generalization in Wireless Foundation Models

Revisiting Positive Samples in Graph Contrastive Learning: From the Perspective of Message Passing

CodeAlchemy: Synthetic Code Rewriting at Scale

The Order Matters: Sequential Fine-Tuning of LLaMA for Coherent Automated Essay Scoring

Hidden Consensus:Preference-Validity Compression in Human Feedback

Small Data, Big Noise: Adversarial Training for Robust Parameter-Efficient Fine-Tuning

Speaker Group Encoding in Self-supervised Speech Recognition Models

Attention Expansion: Enhancing Keyphrase Extraction from Long Documents with Attention-Augmented Contextualized Embeddings

Attention Amnesia in Hybrid LLMs: When CoT Fine-Tuning Breaks Long-Range Recall, and How to Fix It

Supervised Fine-tuning with Synthetic Rationale Data Hurts Real-World Disease Prediction

Advancing the State-of-the-Art in Empirical Privacy Auditing

Representation-Aware Advantage Estimation: Your Reward Model Provides More Than A Scalar Output

Emergent Misalignment Can Be Induced by Sycophancy and Reversed via Alignment Gating

Test-Time Gradient Guidance of Flow Policies in Reinforcement Learning

Fine-tuned Qwen2.5-7B to 96% of Claude Haiku on a domain-specific task using ~$3 of API calls and zero human labelers

Robotic Policy Adaptation via Weight-Space Meta-Learning

Shortcuts in the Tail: Debiasing via Post-Hoc Spectral Compression of Fine-Tuning Updates

Repetition Mismatch: Why Data Mixture Experiments Don't Scale and How to Fix Them

DOG-DPO:Dynamic Optimization in Geometry for Safety Alignment

On the Geometry of On-Policy Distillation

Train Models Faster with JAX and MaxText Using NVFP4 on NVIDIA Blackwell

Nex N2 has a funny "few words do trick" reasoning

LayerRoute: Input-Conditioned Adaptive Layer Skipping via LoRA Fine-Tuning for Agentic Language Models

The Fine-Tuning Trap: Evaluating Negative Transfer and the Role of PEFT in Sub-1B Mathematical Reasoning

RASFT: Rollout-Adaptive Supervised Fine-Tuning for Reasoning

What Do People Actually Want From AI? Mapping Preference Plurality

Translate-R1: Cost-Aware Translation Tool Use via Reinforcement Learning

LLM-Guided Evolution for Medical Decision Pipelines

Github Copilot finally supporting custom endpoints

Trust Region Q Adjoint Matching

Code2LoRA: Hypernetwork-Generated Adapters for Code Language Models under Software Evolution

Dominant-Layer ZO: A Single Layer Dominates Zeroth-Order Fine-Tuning of LLMs

Domain-Adapted Small Language Models with Hybrid Post-Processing: Achieving Cost-Efficient, Low-Latency Multi-Label Structured Prediction via LoRA Fine-Tuning on Scarce Data

High-Dimensional Theory of LoRA Fine-Tuning in a Solvable Attention Model

Steering Vectors are an Adversarial Attack Surface

Predict and Reconstruct: Joint Objectives for Self-Supervised Language Representation Learning

A Model of Multi-turn Human Persuadability Using Probabilistic Belief Tracing

Predictable Scaling Laws of Optimal Hyperparameters for LLM Continued Pre-training

Stable-Layers: Fine-Tuning Image Layer Decomposition Models with VLM-Scored Reinforcement Learning

EvalStop: Using World Feedback to Detect and Correct Reward Overoptimization in Multi-Tenant RLHF Platforms

ADAPTOOD: Uncertainty-Aware Fine-Tuning for Out-of-Distribution ECG Time Series Models

When Autoregressive Consistency Hurts Safety Alignment