News / #rag Tag Rag 500 articles archived under #rag · RSS Sign in to follow arXiv — NLP / Computation & Language research 21d ago An Ontology-Guided Multi-Anchor Graph Retrieval Framework for Traffic Legal Liability Determination arXiv:2606.11910v1 Announce Type: new Abstract: Traffic law liability determination is critical for assigning legal penalties, requiring the simultaneous identification of interdependent statutory provisions across multiple legal dimensions. However, existing retrieval-augmented… 28 arXiv — NLP / Computation & Language research 21d ago uva-irlab-conv at SemEval-2026 Task 8: Multi-Turn RAG with Learned Sparse Retrieval and Listwise Reranking arXiv:2606.11945v1 Announce Type: new Abstract: This report describes our participation in SemEval-2026 Task 8 on multi-turn retrieval and question answering. The task evaluates conversational systems across four domains (finance, cloud documentation, government, Wikipedia), and… 28 arXiv — NLP / Computation & Language research 21d ago Augmenting Molecular Language Models with Local $n$-gram Memory arXiv:2606.12113v1 Announce Type: new Abstract: Transformer-based language models for SMILES strings suffer from a locality gap: standard character-level tokenization fragments chemically meaningful motifs, forcing models to repeatedly learn local syntax at the expense of… 34 arXiv — NLP / Computation & Language research 21d ago Measuring Epistemic Resilience of LLMs Under Misleading Medical Context arXiv:2606.12291v1 Announce Type: new Abstract: Large language models (LLMs) now reach expert-level scores on medical licensing exams, encouraging the assumption that high scores imply safe medical judgment while patients increasingly use them for health advice. We show this… 14 TechCrunch — AI news-outlet 22d ago How memory tools can make AI models worse New research suggests that AI memory systems can degrade model performance and encourage sycophantic tendencies. 28 NVIDIA Developer Blog official-blog 22d ago Designing Production-Ready Battery Energy Storage Systems for AI Factories AI factories are changing what data-center infrastructure must do. Unlike traditional data centers, AI factories are built to manufacture intelligence at scale.... 29 llama.cpp releases dev-tools 22d ago b9589 CUDA: Fix ssm_scan_f32 data-races ( #24360 ) Add missing syncthreads before resuing cub_temp_storage __syncthreads() is required before being allowed to resue TempStorage smem:… 32 arXiv — Machine Learning research 22d ago Conformal Risk Prediction for Non-Alcoholic Fatty Liver Disease Using Gradient Boosting with Distribution-Free Coverages arXiv:2606.09860v1 Announce Type: new Abstract: Non-alcoholic fatty liver disease (NAFLD) affects roughly 25% of global adults, posing substantial hepatic and cardiovascular risks. Yet, population-level screening tools remain inadequate. We present Method, a machine-learning… 4 arXiv — Machine Learning research 22d ago Hyperparameter Learning for Latent Factorization of Tensors for Representation Learning to Large-scale Dynamic Weighted Directed Network arXiv:2606.09880v1 Announce Type: new Abstract: Large-scale dynamic weighted directed networks (DWDNs) are widely used to model time-varying interactions among nodes. Latent factorization of tensors (LFT) extracts target knowledge from DWDNs via low-rank embedding. However,… 27 arXiv — Machine Learning research 22d ago Integrating Out, Twice:The Open-System Case That Neural-Network Ensemble Theory Is Missing arXiv:2606.09950v1 Announce Type: new Abstract: Averaging a neural network over its random parameters and marginalizing a Gaussian sector are the same operation, the Schur complement of the eliminated block, and when that block is closed it returns a covariance and its inverse.… 25 arXiv — Machine Learning research 22d ago Compositional Generative Modeling from Decentralized Data arXiv:2606.10153v1 Announce Type: new Abstract: Learning the compositional nature of the physical world requires joint observation of interacting factors. However, because practical data is often decentralized, these factors are fragmented across isolated silos. Existing… 33 arXiv — Machine Learning research 22d ago DUET -- Dual User Embedding Transformers for Offsite Conversion Prediction arXiv:2606.10243v1 Announce Type: new Abstract: Offsite conversion rate (OCVR) prediction is an important ranking problem in computational recommendation systems. This task presents a modeling challenge: click signals are abundant and exhibit short temporal horizons, whereas… 25 arXiv — NLP / Computation & Language research 22d ago MIRAGE: A Polarity-Flipping Encoding Subspace in LLM Agents arXiv:2606.10304v1 Announce Type: new Abstract: When LLM agents are coerced into covertly encoding sensitive data (Base64, ROT13, acrostic, synonym chains, and beyond), the resulting outputs evade output-side detection but the underlying computation does not. Across nine… 37 arXiv — NLP / Computation & Language research 22d ago Attention Expansion: Enhancing Keyphrase Extraction from Long Documents with Attention-Augmented Contextualized Embeddings arXiv:2606.10716v1 Announce Type: new Abstract: Pre-trained language models (PLMs) have achieved strong performance in keyphrase extraction (KPE), largely due to their ability to generate rich contextualized representations. However, long-document KPE remains challenging because… 30 arXiv — NLP / Computation & Language research 22d ago Attention-Discounted Adaptive Sampler for Masked Diffusion Language Models arXiv:2606.10829v1 Announce Type: new Abstract: Masked diffusion language models can reduce inference steps by revealing multiple tokens per denoising iteration, but this parallelism is fragile: positions that are individually confident may be unsafe to commit together when… 18 arXiv — NLP / Computation & Language research 22d ago Agentic Hybrid RAG for Evidence-Grounded Muon Collider Analysis arXiv:2606.10381v1 Announce Type: cross Abstract: Muon collider research spans accelerator physics, detector instrumentation, and high-energy phenomenology, with relevant evidence scattered across a rapidly expanding and heterogeneous body of scientific literature. As… 37 arXiv — NLP / Computation & Language research 22d ago Leveraging Social Media Data for COVID-19 Studies arXiv:2606.10459v1 Announce Type: cross Abstract: Nowadays, social media networks have become widely preferred sources of information. Especially during the time of the Coronavirus disease 2019 COVID 19 pandemic, social media has been one of the most used platforms to get the… 20 arXiv — NLP / Computation & Language research 22d ago Infini Memory: Maintainable Topic Documents for Long-Term LLM Agent Memory arXiv:2606.10677v1 Announce Type: cross Abstract: Long-term LLM agents need persistent memory that can track changing facts and provide relevant evidence across sessions. Existing memory systems often store observations as isolated records, summaries, or indexed fragments, which… 20 Hugging Face Daily Papers research 22d ago One Token per Multimodal Evidence: Latent Memory for Resource-Constrained QA Abstract Latent Memory introduces a compressed representation approach for external memory in question answering, reducing token consumption and storage requirements while maintaining competitive performance across text-only and multimodal benchmarks. Generated by… 28 Hugging Face Daily Papers research 22d ago Precision Is Not Faithfulness: Coverage-Aware Evaluation of Grounded Generation with a Complete Oracle Abstract Reference-free faithfulness metrics suffer from a blind spot measuring only precision, leading to rewards for abstention; completeness in deterministic domains enables measurement of both precision and recall, revealing that high-precision models often have poor fact… 34 llama.cpp releases dev-tools 23d ago b9585 graph: Fix granite speech model inference by applying embedding scale when deepstack is not used ( #24357 ) llama-graph : apply embedding scale when deepstack is not used nits: remove non-existant hunyuan-vl from the tests apply suggestion from @gabe-l-hart Co-authored-by: Xuan… 25 Hugging Face Daily Papers research 23d ago SDR: Set-Distance Rewards for Radiology Report Generation Abstract Set-based rewards using embedding distances improve chest X-ray report generation by enabling effective post-training and test-time selection without requiring causal reasoning structures. Generated by Qwen/Qwen2.5-Coder-32B-Instruct Reinforcement learning with… 14 r/LocalLLaMA community 23d ago Still a VERY lightweight open web-search tool for smaller local LLMs - now with SearXNG support Hey everyone, TinySearch v0.2.0 (first stable beta) is out. The first version used DuckDuckGo directly, which worked well enough to prove the idea, but yeah.. relying on one search source was way too fragile lol. DDG started throwing limits/CAPTCHAs more often in the last 2… 25 Hugging Face Daily Papers research 23d ago Text-to-Image Models Need Less from Text Encoders Than You Think Abstract Text-to-image models primarily utilize basic text representation aspects like word merging and order rather than complex contextual information encoded in full text embeddings. Generated by Qwen/Qwen2.5-Coder-32B-Instruct Text-to-image models rely on text prompts as… 36 Hugging Face Daily Papers research 23d ago Answer Presence Drives RAG Rewriting Gains Abstract Controlled interventions reveal that gold answer presence in rewritten contexts significantly boosts QA performance, with removal causing substantial F1 drops and injection improving results, while conventional probing methods show fragility to sentinel changes.… 35 Hugging Face Daily Papers research 23d ago Trajectory-Refined Distillation Abstract On-policy distillation suffers from prefix failure where dense token-level supervision creates fragmented gradients; trajectory-refined distillation addresses this by correcting student rollouts at the trajectory level before distillation. Generated by… 37 arXiv — Machine Learning research 23d ago UNIQ: Conformal Calibration for Adaptive Conservatism in Offline Reinforcement Learning arXiv:2606.07592v1 Announce Type: new Abstract: Offline reinforcement learning requires careful conservatism to mitigate distribution shift, yet most existing methods apply a fixed penalty uniformly across all states regardless of local data coverage. We present UNIQ… 5 arXiv — Machine Learning research 23d ago A Topological Characterization of Graph Neural Networks via Stochastic Block Model Embeddings on the n-Sphere arXiv:2606.07598v1 Announce Type: new Abstract: We propose a topological framework for comparing trained Graph Neural Networks (GNNs) by mapping the Stochastic Block Models (SBMs) induced on the graphon-signal space of a Message Passing Neural Network (MPNN) onto the unit… 15 arXiv — Machine Learning research 23d ago Large Language Models Should Learn Personalized Rather Than Aggregated Human Preferences arXiv:2606.07629v1 Announce Type: new Abstract: Current approaches to aligning large language models (LLMs) aggregate diverse human preferences into a single reward signal, effectively optimizing for a hypothetical ``average user'' who represents no real person particularly… 10 arXiv — Machine Learning research 23d ago Temporal Coverage over Density: Parsimonious Training-Set Design for ML Climate Downscaling arXiv:2606.07898v1 Announce Type: new Abstract: High-resolution regional climate simulations provide critical information for climate impacts assessments but remain computationally expensive, motivating the development of machine-learning downscalers and emulators. A key… 16 arXiv — Machine Learning research 23d ago Minibatch Selection via Partition Matroid Constrained Gradient Matching arXiv:2606.07954v1 Announce Type: new Abstract: Training large language models (LLMs) on heterogeneous data requires selecting minibatches that balance convergence speed with coverage across domains. Existing methods either select samples independently within each domain or rely… 5 arXiv — Machine Learning research 23d ago CausShield: Sample Reconstruction-Resilient Vertical FL via Causal Representation Learning arXiv:2606.08027v1 Announce Type: new Abstract: Vertical federated learning (VFL) is a distributed learning paradigm that leverages vertically partitioned features across isolated parties without sharing raw samples; however, it remains vulnerable to active sample reconstruction… 38 llama.cpp releases dev-tools 23d ago b9568 mtp: support for gemma-4 E2B and E4B assistants ( #24282 ) models: update converter to support smaller assistants models: add masked_embd tensors to gemma4-assist arch gemma-4: remove temp debug for conversion gemma-4-mtp: filter out masked_embedding tensors during conversion… 23 r/MachineLearning community 24d ago Why I stopped using semantic embeddings for tool selection and switched back to BM25 [D] I've been building agents for about a year and recently shipped one for a client running ~140 MCP-exposed tools at peak. Along the way I made the canonical mistake. I used cosine similarity over tool description embeddings to pick which tools the model could see per turn. Worked… 9 Hugging Face Daily Papers research 24d ago GENEB: Why Genomic Models Are Hard to Compare Abstract GENEB presents a comprehensive benchmark for evaluating genomic foundation models across diverse tasks and architectures under a unified protocol. Generated by Qwen/Qwen2.5-Coder-32B-Instruct Progress in genomic foundation models is difficult to assess due to fragmented… 25 arXiv — Machine Learning research 24d ago FAIR-Calib: Frontier-Aware Instability-Reweighted Calibration for Post-Training Quantization of Diffusion Large Language Models arXiv:2606.06547v1 Announce Type: new Abstract: Diffusion Large Language Models (dLLMs) refine tokens iteratively but commit them irreversibly, leading to a "stability lag" where early decisions remain fragile even after being written. We reveal that Post-Training Quantization… 21 arXiv — Machine Learning research 24d ago PandaAI: A Practical Agent CQ2 for Neuro-symbolic Data Analysis And Integrated Decision-Making in Quantitative Finance arXiv:2606.06823v1 Announce Type: new Abstract: While deep learning has excelled in various domains, its application to sequential decision-making in finance remains challenging due to the low Signal-to-Noise Ratio (SNR) and non-stationarity of financial data. Leveraging the… 20 arXiv — Machine Learning research 24d ago Accelerating Multi-Objective Bayesian Optimisation via Predictive-Gradient Catalysts arXiv:2606.06984v1 Announce Type: new Abstract: This paper presents a general acceleration mechanism for multi-objective Bayesian optimisation (MOBO) that leverages Gaussian process predictive gradients as auxiliary signals. Rather than replacing existing Pareto-compliant… 7 arXiv — Machine Learning research 24d ago Closed-Form Spectral Regularization for Multi-Task Model Merging arXiv:2606.07289v1 Announce Type: new Abstract: Model merging combines several independently fine-tuned experts into a single multi-task model without any training data, reducing the storage, serving, and decentralized-development costs of large foundation models.… 38 arXiv — Machine Learning research 24d ago Bootstrap Theory of Representational Emergence: Explanatory Insufficiency as a Driver of Representation Learning and World Models arXiv:2606.07303v1 Announce Type: new Abstract: Representation learning is central to modern machine learning, enabling transitions from handcrafted features to learned embeddings, latent spaces, foundation models, world models, and digital twins. Yet most research examines how… 29 arXiv — Machine Learning research 24d ago Graph Neural Network leveraging Higher-order Class Label Connectivity for Heterophilous Graphs arXiv:2606.07475v1 Announce Type: new Abstract: Node classification in graph neural networks (GNNs) has been widely applied in various fields of graph analysis. GNNs achieve high-accuracy node classification in homophilous graphs, where nodes with the same class label tend to be… 32 arXiv — NLP / Computation & Language research 24d ago Evidence Graph Consistency in Retrieval-Augmented Generation: A Model-Dependent Analysis of Hallucination Detection arXiv:2606.06748v1 Announce Type: new Abstract: Retrieval-Augmented Generation (RAG) reduces but does not eliminate hallucination in large language models. Existing detection methods rely on flat similarity between generated answers and retrieved passages, ignoring structural… 36 arXiv — NLP / Computation & Language research 24d ago A Four-Condition Diagnostic Protocol for Evidence Utilization in Long-Context and Retrieval-Augmented Language Models arXiv:2606.06758v1 Announce Type: new Abstract: Final-answer accuracy, retrieval recall, and citation overlap do not by themselves identify whether a long-context or retrieval-augmented language model used the evidence it was given. A model can answer from parametric memory,… 21 arXiv — NLP / Computation & Language research 24d ago TA-RAG: Tone-Aware Retrieval-Augmented Generation for Peer-Support Health Communication arXiv:2606.06794v1 Announce Type: new Abstract: Retrieval-augmented generation (RAG) successfully grounds large language model (LLM) outputs in trusted documents, but factual grounding alone is insufficient for sensitive peer-support health communication. In domains such as HIV… 25 arXiv — NLP / Computation & Language research 24d ago Didact: A Cross-Domain Capability Discovery System for Defence arXiv:2606.06942v1 Announce Type: new Abstract: Policymakers in defence and defence-aligned sectors must monitor rapidly evolving research alongside sector priorities relevant to operational and strategic needs. In practice, these sources are fragmented across heterogeneous… 5 arXiv — NLP / Computation & Language research 24d ago OpenHalDet: A Unified Benchmark for Hallucination Detection across Diverse Generation Scenarios arXiv:2606.06959v1 Announce Type: new Abstract: Hallucination detection is essential for the reliable deployment of large language models (LLMs). However, existing evaluations face two core challenges: inconsistent inference configuration and evaluation, and limited coverage of… 5 arXiv — NLP / Computation & Language research 24d ago Modeling semantic association in self-paced reading with language model embeddings arXiv:2606.07066v1 Announce Type: new Abstract: Semantic association between a word and its context has been identified as an important component of reading comprehension, even when word predictability is accounted for. Recent research has highlighted the potential of language… 36 arXiv — NLP / Computation & Language research 24d ago Learning Perspectivist Social Meaning via Demographic-Conditioned Fusion Embeddings arXiv:2606.07123v1 Announce Type: new Abstract: Social meaning in language is inherently perspectival, varying across annotator backgrounds, demographics, and ideological positions. However, most NLP systems collapse this variation into a single ground-truth label, ignoring the… 15 arXiv — NLP / Computation & Language research 24d ago Geometry of Semantic Space: Comparative Study of Discrete and Continuous Models arXiv:2606.07183v1 Announce Type: new Abstract: This work examines the semantic geometry underlying NLP models. We compare supervised vector embeddings, such as CamemBERT, with lexical co-occurrence graphs that encode semantic relations more directly. While transformer-based… 12 arXiv — NLP / Computation & Language research 24d ago Your UnEmbedding Matrix is Secretly a Feature Lens for Text Embeddings arXiv:2606.07502v1 Announce Type: new Abstract: Large language models exhibit impressive zero-shot capabilities across a wide range of downstream tasks. However, they struggle to function as off-the-shelf embedding models, leading to suboptimal performance on massive text… 15 Page 7 of 10 · 500 articles ← Newer Older →