News / #security Tag Security 500 articles archived under #security · RSS Sign in to follow r/LocalLLaMA community 3h ago SenseNova-U1-8b-MoT-Infographic-V2 (released yesterday) - An open source SOTA beast for infographic design and image editing. I’m pretty jaded like most of y’all. I don’t really get excited by new models much anymore. Last few weeks have been kinda meh to be honest. Monday, I stumbled upon SenseNova’s Mixture of Transformers models and they seem kinda like a different animal than other typical image… 4 arXiv — NLP / Computation & Language research 4h ago TRACE: State-Aware Query Processing over Temporal Evidence Graphs for Conversational Data arXiv:2607.00339v1 Announce Type: new Abstract: Conversational data is increasingly used as a persistent source of user state for long-running assistants and AI agents. However, querying this data remains challenging because conversations naturally evolve: plans are revised,… 8 arXiv — NLP / Computation & Language research 4h ago A Mechanistic View of Authority Hierarchy in LLM Sycophancy arXiv:2607.00415v1 Announce Type: new Abstract: Authority bias poses a critical safety concern in language models: models systematically prioritize social cues from authority figures over factual consistency, swaying their answers based on source credibility rather than… 17 arXiv — NLP / Computation & Language research 4h ago The Course of News Events: A Comparison of Bottom-Up and Top-Down Approaches for Collecting Text-Based Data about Disasters arXiv:2607.00849v1 Announce Type: new Abstract: News articles are an important source of information on disaster impacts and adaptation. A key methodological challenge in socio-environmental studies is how to select a representative data sample. Two approaches are common:… 35 arXiv — NLP / Computation & Language research 4h ago Beyond Document Grounding: Span-Level Hallucination Detection over Code, Tool Output, and Documents arXiv:2607.00895v1 Announce Type: new Abstract: Hallucination detection for retrieval-augmented generation (RAG) is usually evaluated on natural-language document evidence. However, grounded generation systems increasingly rely on structured inputs: source code, developer-tool… 14 arXiv — NLP / Computation & Language research 4h ago LuxIT: A Luxembourgish Instruction Tuning Dataset from Monolingual Seed Data arXiv:2510.24434v3 Announce Type: replace Abstract: The effectiveness of instruction-tuned Large Language Models (LLMs) is often limited in low-resource linguistic settings due to a lack of high-quality training data. We introduce LuxIT, a novel, monolingual instruction tuning… 10 r/MachineLearning community 7h ago Making Optimization Work When Labels Are Scarce [R] https://www.gnosyslabs.com/case-studies/safety-classifier-sparse-labels Gnosys is an autonomous model engineer: it improves prompts and classifiers when ground truth is too sparse for conventional optimization. On ToxicChat, a public safety benchmark, under realistic label… 23 Hacker News — AI on Front Page community 7h ago Oomwoo, an open-source robot vacuum you build yourself Article URL: https://makerspet.com/blog/building-an-open-source-robot-vacuum-meet-oomwoo/ Comments URL: https://news.ycombinator.com/item?id=48755005 Points: 241 # Comments: 41 37 r/LocalLLaMA community 14h ago Plurality Released: fully Free and Open Source AI agents/chatbot platform for local AI Hello everyone! Some of you might recognize my user from the work I have done on Cosmos Cloud, but today I am here to talk to you about an entirely different project: Plurality. https://github.com/azukaar/plurality Plurality has been in development for a bit more than a year and… 22 Hacker News — AI on Front Page community 18h ago Monetization Gateway: Charge for any resource behind Cloudflare via x402 Article URL: https://blog.cloudflare.com/monetization-gateway/ Comments URL: https://news.ycombinator.com/item?id=48746914 Points: 237 # Comments: 144 28 Hacker News — AI on Front Page community 19h ago Box3D, an open source 3D physics engine Article URL: https://box2d.org/posts/2026/06/announcing-box3d/ Comments URL: https://news.ycombinator.com/item?id=48745445 Points: 246 # Comments: 47 12 r/LocalLLaMA community 21h ago Thinking about grabbing 4x Ascend GX10s Some in this sub have tested GLM5.2 on 4x DGX Sparks (or Ascend GX10) with 400-500 tok/s prompt processing and ~15 tok/s output at 128k context. Not blazing fast, but usable imo, especially with quantization. My thinking: If there's an open-source fable 5 sometime in december or… 20 r/MachineLearning community 22h ago A system-level approach to prompt injection: separating instruction and data channels in LLM agents [P] Prompt injection has emerged as one of the most persistent failure modes in tool-using LLM systems, particularly in agentic workflows where models interact with external data sources. Most mitigation strategies focus on input filtering or model-side alignment, but these… 9 arXiv — Machine Learning research 1d ago Joint discovery of governing partial differential equations from multi-source datasets by competitive optimization arXiv:2606.30699v1 Announce Type: new Abstract: Discovering governing equations directly from observational data is a key step towards interpretable scientific machine learning. Current data-driven approaches typically operate on a single dataset, inherently limiting their… 38 arXiv — Machine Learning research 1d ago Teaching LLMs to Recommend and Defer in Underrepresented Epilepsy Care arXiv:2606.31036v1 Announce Type: new Abstract: Specialist epilepsy expertise is scarce in resource-constrained settings, making LLM-based decision support attractive for frontline clinicians managing longitudinal treatment. Such systems must adapt to local prescribing practice… 12 arXiv — Machine Learning research 1d ago Resolving superposition in AI for interpretability and cross-modal alignment in patient-neuronal images arXiv:2606.31394v1 Announce Type: new Abstract: Artificial intelligence is transforming our capability to solve biological challenges. In dimensionality bottleneck regimes exacerbated by high-dimensional biological data, Neural networks force distinct concepts into the lower… 14 arXiv — NLP / Computation & Language research 1d ago Beyond Clean Text: Evaluating Encoder and Decoder Robustness for Bangla Event Detection in Noisy Text arXiv:2606.30914v1 Announce Type: new Abstract: Event detection (ED) systems are typically evaluated on clean, curated text, leaving their robustness to real-world noise largely unexplored, particularly for low-resource languages such as Bangla. We introduce a generalized Bangla… 17 arXiv — NLP / Computation & Language research 1d ago Building an ASR Solution for Training and Assessing Children's Reading arXiv:2606.31508v1 Announce Type: new Abstract: Automatic speech recognition for children's reading remains underdeveloped for most African languages, including Bambara, despite its potential value for reproducible literacy assessment. We present an open-source system for… 30 arXiv — NLP / Computation & Language research 1d ago Tone-Conditioned Curriculum Learning for Low-Resource Bantu Speech Recognition arXiv:2606.31642v1 Announce Type: new Abstract: Southern Bantu languages are spoken by over 80 million people, yet current foundation ASR models still produce zero-shot WER above 100%, which limits practical use in education and public services. We addressed this gap with a tone… 18 arXiv — NLP / Computation & Language research 1d ago Cross-lingual Relation Extraction with Large Language Models: Zero-Shot, Few-Shot, and Fine-Tuned Evaluation on Romanian arXiv:2606.31718v1 Announce Type: new Abstract: Relation extraction (RE) for low-resource languages is typically constrained by the lack of annotated corpora. We investigate the feasibility of cross-lingual RE for Romanian by combining automatic dataset translation with large… 38 arXiv — NLP / Computation & Language research 1d ago LuxEmo: Expressive Text-to-Speech Corpus for Luxembourgish arXiv:2606.31947v1 Announce Type: new Abstract: State-of-the-art speech datasets predominantly focus on widely spoken languages, often overlooking low-resource languages such as Luxembourgish, which remain underrepresented in speech technology research. In this work, we… 25 Vercel — AI dev-tools 1d ago Enforce consistent code for agents and humans with konsistent konsistent is now open source. konsistent is a CLI linter for TypeScript codebases that enforces structural conventions, giving agents and humans the consistent context they need to implement features correctly. Deterministic, fast, and covers structural patterns that TypeScript… 6 Simon Willison community 1d ago Quoting Anthropic We’ve received notice that the Department of Commerce has lifted export controls on Claude Fable 5 and Mythos 5. We'll begin restoring access tomorrow, and will share an update soon. — Anthropic , on Twitter Tags: anthropic , claude , generative-ai , claude-mythos , ai ,… 34 Hacker News — AI on Front Page community 1d ago Department of Commerce has lifted export controls on Claude Fable 5 and Mythos 5 Article URL: https://twitter.com/AnthropicAI/status/2072106151890809341 Comments URL: https://news.ycombinator.com/item?id=48740771 Points: 232 # Comments: 90 6 TechCrunch — AI news-outlet 1d ago OpenClaw is finally available on Android and iOS The free open source agentic program is finally invading your phone. 21 r/LocalLLaMA community 1d ago HydraHead: From Head-Level Functional Heterogeneity to Specialized Attention Hybridization (from the Qwen team) The quadratic complexity of attention poses a critical bottleneck for long-context processing, spurring interest in hybrid attention designs. Most open-source hybrid models adopt a layer-wise strategy. Yet, prior work has noted the inherent difficulty of integrating Linear… 9 r/LocalLLaMA community 1d ago Norm-preserving abliteration on Qwen3.6-35B-A3B: 0% refusal, benchmarks intact, open source dataset Been reading the mechanistic interpretability literature on refusal for a while now. The core insight from Arditi et al. (2024) is clean: refusal is mediated by a geometrically consistent direction in the residual stream. You can find it via the difference of means between… 4 arXiv — Machine Learning research 2d ago Modification-Considering Value Learning for Reward Hacking Mitigation in RL arXiv:2606.28955v1 Announce Type: new Abstract: Reinforcement learning agents can exploit misspecified reward signals to achieve high apparent returns while failing on the intended objective, a failure mode known as reward hacking. Existing practical defenses typically constrain… 10 arXiv — Machine Learning research 2d ago Depth Exploration for LLM Decoding arXiv:2606.29223v1 Announce Type: new Abstract: Autoregressive LLM decoding evaluates every generated token through the full layer stack, even though many tokens become predictable at intermediate depths. Existing lossless depth-adaptive methods exploit this redundancy by… 34 arXiv — Machine Learning research 2d ago KrishokChat: A Citation-Grounded Dataset and Benchmark for Bengali Agricultural Advisory arXiv:2606.29243v1 Announce Type: new Abstract: We present KrishokChat, the first citation-grounded Bengali agricultural instruction-tuning dataset for crop advisory in low-resource settings. We establish a foundation of 290 hierarchical Knowledge Nodes, extracting disease… 30 arXiv — Machine Learning research 2d ago Nonlinear mixture model motivated subspace clustering arXiv:2606.29261v1 Announce Type: new Abstract: We derive the linear union-of-subspaces (UoS) model for subspace clustering (SC) from the nonlinear mixture model (NMM) used in blind source separation (BSS) to represent a D-dimensional observation vector as an unknown… 7 arXiv — Machine Learning research 2d ago Optimizer Memory Makes Shuffle Order a First-Order Source of Fine-Tuning Noise arXiv:2606.29554v1 Announce Type: new Abstract: Shuffle order can be a larger source of fine-tuning noise than a memoryless analysis predicts: fixed-clock optimizer memory makes local equal-multiset contrasts first order in the learning rate rather than second order, and the… 8 arXiv — NLP / Computation & Language research 2d ago SEATauBench: Adapting Tool-Agent-User Evaluation Into Low-Resource Southeast Asian Languages arXiv:2606.28715v1 Announce Type: new Abstract: While AI development and evaluation for Southeast Asia (SEA) has grown rapidly, agent capabilities in regional languages are still poorly understood despite its importance to sovereign AI. To fill this gap, we introduce… 28 arXiv — NLP / Computation & Language research 2d ago Open but Incompatible: A License Compatibility Analysis of Corpora for Low-Resource African Languages arXiv:2606.28867v1 Announce Type: new Abstract: Creative Commons licenses dominate African NLP corpus releases, but their compatibility rules are rarely applied. CC-BY-SA and CC-BY-NC cannot be combined in a single published dataset; a NoDerivs clause silently prohibits… 28 arXiv — NLP / Computation & Language research 2d ago FinInvest-GTCN: Explainable Graph-Temporal-Causal Modeling for Risk-Aware Investment Decision Optimization arXiv:2606.28933v1 Announce Type: new Abstract: Venture capital (VC) investment decisions face distinct challenges, such as multi-source heterogeneous data, non-stationary time series, and the demand for explainable predictions in high-stakes, low-data settings. To overcome… 16 arXiv — NLP / Computation & Language research 2d ago A3M: Adaptive, Adversarial and Multi-Objective Learning for Strategic Bidding in Repeated Auctions arXiv:2606.28943v1 Announce Type: new Abstract: Learning to bid in repeated multi-unit auctions with bandit feedback poses a fundamental challenge. Existing methods often rely on rigid explore-then-exploit schedules, assume stationary adversaries, and optimize solely for bidder… 9 arXiv — NLP / Computation & Language research 2d ago To Reason or to Fabricate: Reasoning Without Shortcuts via Hint-Anchored Pairwise Aggregation arXiv:2606.29481v1 Announce Type: new Abstract: While reinforcement learning (RL) significantly enhances LLM reasoning, its efficacy is severely undermined by Pre-RL data overlap, where RL datasets overlap with pretraining or SFT corpora, causing models to exploit shortcuts by… 11 arXiv — NLP / Computation & Language research 2d ago SrDetection: A Self-Referential Framework for Data Leakage Detection in Code Large Language Models arXiv:2606.29815v1 Announce Type: new Abstract: Evaluating code large language models (Code LLMs) requires reliable detection of data leakage, where benchmark performance is artificially inflated by exposure to benchmark data during pre-training. Existing approaches either… 7 arXiv — NLP / Computation & Language research 2d ago IHDec: Divergence-Steered Contrastive Decoding for Securing Multi-Turn Instruction Hierarchies arXiv:2606.29960v1 Announce Type: new Abstract: Large Language Models (LLMs) often fail to maintain instruction hierarchies (IH) when processing multi-source inputs with varying role-level priorities, paradoxically adhering to lower-priority directives during conflicts. While… 29 Vercel — AI dev-tools 2d ago Vercel and Shopify are rebuilding Hydrogen Hydrogen made headless storefronts easy to ship, but not portable. At Vercel Ship 26 in New York, we announced that we are working with Shopify to rebuild it from the ground up, a shared bet on a more open web. The new version is open source and runtime agnostic, meaning it can… 31 Hugging Face Daily Papers research 2d ago ReasoningLens: Hierarchical Visualization and Diagnostic Auditing for Large Reasoning Models Abstract ReasoningLens is an open-source framework that provides hierarchical visualization and diagnostic auditing for complex reasoning chains in large reasoning models, enabling structured analysis and error detection through interactive hierarchies and automated auditing.… 21 r/LocalLLaMA community 2d ago I Hate Dario Amodei, and everything he stands for. I am so incredibly sick of this guy‘s fear mongering about open source while fundamentally misunderstanding how it actually works. He recently dropped some arguments that are so completely detached from reality, it honestly feels like he’s never even touched a local model in his… 31 r/LocalLLaMA community 2d ago Amodei: "Open Source Models Will Eat Your Children"   submitted by   /u/johnnyApplePRNG [link]   [comments] 35 Hacker News — AI on Front Page community 2d ago Ornith-1.0: self-improving open-source models for agentic coding Article URL: https://github.com/deepreinforce-ai/Ornith-1 Comments URL: https://news.ycombinator.com/item?id=48722052 Points: 215 # Comments: 39 31 r/LocalLLaMA community 2d ago What's the full local AI "doomsday prepper" kit for cold storage? 16-bit safetensors of LLMs (obv), copies/source codes of Llama.cpp, ComfyUI, vLLM, Kobold, LMStudio, etc, macOS, Linux OSes, Windows 10&11, etc, Rufus (including older ones), various VMs, P-E-W's Heretic/Grimoire,… For those who want to be as paranoid and maximally doomsday prepped as possible, I am curious what the most thorough "doomsday kit" is of things to store offline copies of "just in case", to still be able to use local AI if things go truly crazy to a super extreme level. So far… 23 r/LocalLLaMA community 2d ago Anthropic's Amodei: "Open Source models [could take us to] a very dangerous place."   submitted by   /u/johnnyApplePRNG [link]   [comments] 4 Vercel — AI dev-tools 2d ago Vercel Open Source Program: Spring 2026 cohort The world runs on open source software. The frameworks, libraries, and tools we rely on are made possible by communities that share ideas and build in the open. At Vercel, we want to help those communities thrive. That’s why we run the Vercel Open Source Program : a developer… 24 OpenAI official-blog 3d ago Mapping Europe’s AI Workforce Opportunity A new OpenAI report maps how AI could reshape jobs across the EU, highlighting which occupations may face automation, growth, or workflow changes. 32 arXiv — Machine Learning research 3d ago Unified Zero-Shot Time Series Forecasting: A Darts Foundation arXiv:2606.27438v1 Announce Type: new Abstract: Since its initial release in 2020, Darts has become a widely used open-source Python library for time series analysis. A series of foundation models have recently claimed accuracy improvements in zero-shot forecasting, promising a… 15 arXiv — Machine Learning research 3d ago Physics-Informed Neural Network with Transfer Learning for State Estimation in Lithium-Ion Batteries using the Single Particle Model with Electrolyte arXiv:2606.28220v1 Announce Type: new Abstract: Physics-informed neural networks (PINNs) have emerged as a powerful tool for solving nonlinear partial differential equations (PDEs), including battery electrochemical models. They typically en-force conservation laws within the… 15 Page 1 of 10 · 500 articles Older →