Tag

Security

500 articles archived under #security · RSS

r/LocalLLaMA community 3h ago

SenseNova-U1-8b-MoT-Infographic-V2 (released yesterday) - An open source SOTA beast for infographic design and image editing.

I’m pretty jaded like most of y’all. I don’t really get excited by new models much anymore. Last few weeks have been kinda meh to be honest. Monday, I stumbled upon SenseNova’s Mixture of Transformers models and they seem kinda like a different animal than other typical image…

4
arXiv — NLP / Computation & Language research 4h ago

TRACE: State-Aware Query Processing over Temporal Evidence Graphs for Conversational Data

arXiv:2607.00339v1 Announce Type: new Abstract: Conversational data is increasingly used as a persistent source of user state for long-running assistants and AI agents. However, querying this data remains challenging because conversations naturally evolve: plans are revised,…

8
arXiv — NLP / Computation & Language research 4h ago

A Mechanistic View of Authority Hierarchy in LLM Sycophancy

arXiv:2607.00415v1 Announce Type: new Abstract: Authority bias poses a critical safety concern in language models: models systematically prioritize social cues from authority figures over factual consistency, swaying their answers based on source credibility rather than…

17
arXiv — NLP / Computation & Language research 4h ago

The Course of News Events: A Comparison of Bottom-Up and Top-Down Approaches for Collecting Text-Based Data about Disasters

arXiv:2607.00849v1 Announce Type: new Abstract: News articles are an important source of information on disaster impacts and adaptation. A key methodological challenge in socio-environmental studies is how to select a representative data sample. Two approaches are common:…

35
arXiv — NLP / Computation & Language research 4h ago

Beyond Document Grounding: Span-Level Hallucination Detection over Code, Tool Output, and Documents

arXiv:2607.00895v1 Announce Type: new Abstract: Hallucination detection for retrieval-augmented generation (RAG) is usually evaluated on natural-language document evidence. However, grounded generation systems increasingly rely on structured inputs: source code, developer-tool…

14
arXiv — NLP / Computation & Language research 4h ago

LuxIT: A Luxembourgish Instruction Tuning Dataset from Monolingual Seed Data

arXiv:2510.24434v3 Announce Type: replace Abstract: The effectiveness of instruction-tuned Large Language Models (LLMs) is often limited in low-resource linguistic settings due to a lack of high-quality training data. We introduce LuxIT, a novel, monolingual instruction tuning…

10
r/MachineLearning community 7h ago

Making Optimization Work When Labels Are Scarce [R]

https://www.gnosyslabs.com/case-studies/safety-classifier-sparse-labels Gnosys is an autonomous model engineer: it improves prompts and classifiers when ground truth is too sparse for conventional optimization. On ToxicChat, a public safety benchmark, under realistic label…

23
Hacker News — AI on Front Page community 7h ago

Oomwoo, an open-source robot vacuum you build yourself

Article URL: https://makerspet.com/blog/building-an-open-source-robot-vacuum-meet-oomwoo/ Comments URL: https://news.ycombinator.com/item?id=48755005 Points: 241 # Comments: 41

37
r/LocalLLaMA community 14h ago

Plurality Released: fully Free and Open Source AI agents/chatbot platform for local AI

Hello everyone! Some of you might recognize my user from the work I have done on Cosmos Cloud, but today I am here to talk to you about an entirely different project: Plurality. https://github.com/azukaar/plurality Plurality has been in development for a bit more than a year and…

22
Hacker News — AI on Front Page community 18h ago

Monetization Gateway: Charge for any resource behind Cloudflare via x402

Article URL: https://blog.cloudflare.com/monetization-gateway/ Comments URL: https://news.ycombinator.com/item?id=48746914 Points: 237 # Comments: 144

28
Hacker News — AI on Front Page community 19h ago

Box3D, an open source 3D physics engine

Article URL: https://box2d.org/posts/2026/06/announcing-box3d/ Comments URL: https://news.ycombinator.com/item?id=48745445 Points: 246 # Comments: 47

12
r/LocalLLaMA community 21h ago

Thinking about grabbing 4x Ascend GX10s

Some in this sub have tested GLM5.2 on 4x DGX Sparks (or Ascend GX10) with 400-500 tok/s prompt processing and ~15 tok/s output at 128k context. Not blazing fast, but usable imo, especially with quantization. My thinking: If there's an open-source fable 5 sometime in december or…

20
r/MachineLearning community 22h ago

A system-level approach to prompt injection: separating instruction and data channels in LLM agents [P]

Prompt injection has emerged as one of the most persistent failure modes in tool-using LLM systems, particularly in agentic workflows where models interact with external data sources. Most mitigation strategies focus on input filtering or model-side alignment, but these…

9
arXiv — Machine Learning research 1d ago

Joint discovery of governing partial differential equations from multi-source datasets by competitive optimization

arXiv:2606.30699v1 Announce Type: new Abstract: Discovering governing equations directly from observational data is a key step towards interpretable scientific machine learning. Current data-driven approaches typically operate on a single dataset, inherently limiting their…

38
arXiv — Machine Learning research 1d ago

Teaching LLMs to Recommend and Defer in Underrepresented Epilepsy Care

arXiv:2606.31036v1 Announce Type: new Abstract: Specialist epilepsy expertise is scarce in resource-constrained settings, making LLM-based decision support attractive for frontline clinicians managing longitudinal treatment. Such systems must adapt to local prescribing practice…

12
arXiv — Machine Learning research 1d ago

Resolving superposition in AI for interpretability and cross-modal alignment in patient-neuronal images

arXiv:2606.31394v1 Announce Type: new Abstract: Artificial intelligence is transforming our capability to solve biological challenges. In dimensionality bottleneck regimes exacerbated by high-dimensional biological data, Neural networks force distinct concepts into the lower…

14
arXiv — NLP / Computation & Language research 1d ago

Beyond Clean Text: Evaluating Encoder and Decoder Robustness for Bangla Event Detection in Noisy Text

arXiv:2606.30914v1 Announce Type: new Abstract: Event detection (ED) systems are typically evaluated on clean, curated text, leaving their robustness to real-world noise largely unexplored, particularly for low-resource languages such as Bangla. We introduce a generalized Bangla…

17
arXiv — NLP / Computation & Language research 1d ago

Building an ASR Solution for Training and Assessing Children's Reading

arXiv:2606.31508v1 Announce Type: new Abstract: Automatic speech recognition for children's reading remains underdeveloped for most African languages, including Bambara, despite its potential value for reproducible literacy assessment. We present an open-source system for…

30
arXiv — NLP / Computation & Language research 1d ago

Tone-Conditioned Curriculum Learning for Low-Resource Bantu Speech Recognition

arXiv:2606.31642v1 Announce Type: new Abstract: Southern Bantu languages are spoken by over 80 million people, yet current foundation ASR models still produce zero-shot WER above 100%, which limits practical use in education and public services. We addressed this gap with a tone…

18
arXiv — NLP / Computation & Language research 1d ago

Cross-lingual Relation Extraction with Large Language Models: Zero-Shot, Few-Shot, and Fine-Tuned Evaluation on Romanian

arXiv:2606.31718v1 Announce Type: new Abstract: Relation extraction (RE) for low-resource languages is typically constrained by the lack of annotated corpora. We investigate the feasibility of cross-lingual RE for Romanian by combining automatic dataset translation with large…

38
arXiv — NLP / Computation & Language research 1d ago

LuxEmo: Expressive Text-to-Speech Corpus for Luxembourgish

arXiv:2606.31947v1 Announce Type: new Abstract: State-of-the-art speech datasets predominantly focus on widely spoken languages, often overlooking low-resource languages such as Luxembourgish, which remain underrepresented in speech technology research. In this work, we…

25
Vercel — AI dev-tools 1d ago

Enforce consistent code for agents and humans with konsistent

konsistent is now open source. konsistent is a CLI linter for TypeScript codebases that enforces structural conventions, giving agents and humans the consistent context they need to implement features correctly. Deterministic, fast, and covers structural patterns that TypeScript…

6
Simon Willison community 1d ago

Quoting Anthropic

We’ve received notice that the Department of Commerce has lifted export controls on Claude Fable 5 and Mythos 5. We'll begin restoring access tomorrow, and will share an update soon. — Anthropic , on Twitter Tags: anthropic , claude , generative-ai , claude-mythos , ai ,…

34
Hacker News — AI on Front Page community 1d ago

Department of Commerce has lifted export controls on Claude Fable 5 and Mythos 5

Article URL: https://twitter.com/AnthropicAI/status/2072106151890809341 Comments URL: https://news.ycombinator.com/item?id=48740771 Points: 232 # Comments: 90

6
TechCrunch — AI news-outlet 1d ago

OpenClaw is finally available on Android and iOS

The free open source agentic program is finally invading your phone.

21
r/LocalLLaMA community 1d ago

HydraHead: From Head-Level Functional Heterogeneity to Specialized Attention Hybridization (from the Qwen team)

The quadratic complexity of attention poses a critical bottleneck for long-context processing, spurring interest in hybrid attention designs. Most open-source hybrid models adopt a layer-wise strategy. Yet, prior work has noted the inherent difficulty of integrating Linear…

9
r/LocalLLaMA community 1d ago

Norm-preserving abliteration on Qwen3.6-35B-A3B: 0% refusal, benchmarks intact, open source dataset

Been reading the mechanistic interpretability literature on refusal for a while now. The core insight from Arditi et al. (2024) is clean: refusal is mediated by a geometrically consistent direction in the residual stream. You can find it via the difference of means between…

4
arXiv — Machine Learning research 2d ago

Modification-Considering Value Learning for Reward Hacking Mitigation in RL

arXiv:2606.28955v1 Announce Type: new Abstract: Reinforcement learning agents can exploit misspecified reward signals to achieve high apparent returns while failing on the intended objective, a failure mode known as reward hacking. Existing practical defenses typically constrain…

10
arXiv — Machine Learning research 2d ago

Depth Exploration for LLM Decoding

arXiv:2606.29223v1 Announce Type: new Abstract: Autoregressive LLM decoding evaluates every generated token through the full layer stack, even though many tokens become predictable at intermediate depths. Existing lossless depth-adaptive methods exploit this redundancy by…

34
arXiv — Machine Learning research 2d ago

KrishokChat: A Citation-Grounded Dataset and Benchmark for Bengali Agricultural Advisory

arXiv:2606.29243v1 Announce Type: new Abstract: We present KrishokChat, the first citation-grounded Bengali agricultural instruction-tuning dataset for crop advisory in low-resource settings. We establish a foundation of 290 hierarchical Knowledge Nodes, extracting disease…

30
arXiv — Machine Learning research 2d ago

Nonlinear mixture model motivated subspace clustering

arXiv:2606.29261v1 Announce Type: new Abstract: We derive the linear union-of-subspaces (UoS) model for subspace clustering (SC) from the nonlinear mixture model (NMM) used in blind source separation (BSS) to represent a D-dimensional observation vector as an unknown…

7
arXiv — Machine Learning research 2d ago

Optimizer Memory Makes Shuffle Order a First-Order Source of Fine-Tuning Noise

arXiv:2606.29554v1 Announce Type: new Abstract: Shuffle order can be a larger source of fine-tuning noise than a memoryless analysis predicts: fixed-clock optimizer memory makes local equal-multiset contrasts first order in the learning rate rather than second order, and the…

8
arXiv — NLP / Computation & Language research 2d ago

SEATauBench: Adapting Tool-Agent-User Evaluation Into Low-Resource Southeast Asian Languages

arXiv:2606.28715v1 Announce Type: new Abstract: While AI development and evaluation for Southeast Asia (SEA) has grown rapidly, agent capabilities in regional languages are still poorly understood despite its importance to sovereign AI. To fill this gap, we introduce…

28
arXiv — NLP / Computation & Language research 2d ago

Open but Incompatible: A License Compatibility Analysis of Corpora for Low-Resource African Languages

arXiv:2606.28867v1 Announce Type: new Abstract: Creative Commons licenses dominate African NLP corpus releases, but their compatibility rules are rarely applied. CC-BY-SA and CC-BY-NC cannot be combined in a single published dataset; a NoDerivs clause silently prohibits…

28
arXiv — NLP / Computation & Language research 2d ago

FinInvest-GTCN: Explainable Graph-Temporal-Causal Modeling for Risk-Aware Investment Decision Optimization

arXiv:2606.28933v1 Announce Type: new Abstract: Venture capital (VC) investment decisions face distinct challenges, such as multi-source heterogeneous data, non-stationary time series, and the demand for explainable predictions in high-stakes, low-data settings. To overcome…

16
arXiv — NLP / Computation & Language research 2d ago

A3M: Adaptive, Adversarial and Multi-Objective Learning for Strategic Bidding in Repeated Auctions

arXiv:2606.28943v1 Announce Type: new Abstract: Learning to bid in repeated multi-unit auctions with bandit feedback poses a fundamental challenge. Existing methods often rely on rigid explore-then-exploit schedules, assume stationary adversaries, and optimize solely for bidder…

9
arXiv — NLP / Computation & Language research 2d ago

To Reason or to Fabricate: Reasoning Without Shortcuts via Hint-Anchored Pairwise Aggregation

arXiv:2606.29481v1 Announce Type: new Abstract: While reinforcement learning (RL) significantly enhances LLM reasoning, its efficacy is severely undermined by Pre-RL data overlap, where RL datasets overlap with pretraining or SFT corpora, causing models to exploit shortcuts by…

11
arXiv — NLP / Computation & Language research 2d ago

SrDetection: A Self-Referential Framework for Data Leakage Detection in Code Large Language Models

arXiv:2606.29815v1 Announce Type: new Abstract: Evaluating code large language models (Code LLMs) requires reliable detection of data leakage, where benchmark performance is artificially inflated by exposure to benchmark data during pre-training. Existing approaches either…

7
arXiv — NLP / Computation & Language research 2d ago

IHDec: Divergence-Steered Contrastive Decoding for Securing Multi-Turn Instruction Hierarchies

arXiv:2606.29960v1 Announce Type: new Abstract: Large Language Models (LLMs) often fail to maintain instruction hierarchies (IH) when processing multi-source inputs with varying role-level priorities, paradoxically adhering to lower-priority directives during conflicts. While…

29
Vercel — AI dev-tools 2d ago

Vercel and Shopify are rebuilding Hydrogen

Hydrogen made headless storefronts easy to ship, but not portable. At Vercel Ship 26 in New York, we announced that we are working with Shopify to rebuild it from the ground up, a shared bet on a more open web. The new version is open source and runtime agnostic, meaning it can…

31
Hugging Face Daily Papers research 2d ago

ReasoningLens: Hierarchical Visualization and Diagnostic Auditing for Large Reasoning Models

Abstract ReasoningLens is an open-source framework that provides hierarchical visualization and diagnostic auditing for complex reasoning chains in large reasoning models, enabling structured analysis and error detection through interactive hierarchies and automated auditing.…

21
r/LocalLLaMA community 2d ago

I Hate Dario Amodei, and everything he stands for.

I am so incredibly sick of this guy‘s fear mongering about open source while fundamentally misunderstanding how it actually works. He recently dropped some arguments that are so completely detached from reality, it honestly feels like he’s never even touched a local model in his…

31
r/LocalLLaMA community 2d ago

Amodei: "Open Source Models Will Eat Your Children"

  submitted by   /u/johnnyApplePRNG [link]   [comments]

35
Hacker News — AI on Front Page community 2d ago

Ornith-1.0: self-improving open-source models for agentic coding

Article URL: https://github.com/deepreinforce-ai/Ornith-1 Comments URL: https://news.ycombinator.com/item?id=48722052 Points: 215 # Comments: 39

31
r/LocalLLaMA community 2d ago

What's the full local AI "doomsday prepper" kit for cold storage? 16-bit safetensors of LLMs (obv), copies/source codes of Llama.cpp, ComfyUI, vLLM, Kobold, LMStudio, etc, macOS, Linux OSes, Windows 10&11, etc, Rufus (including older ones), various VMs, P-E-W's Heretic/Grimoire,…

For those who want to be as paranoid and maximally doomsday prepped as possible, I am curious what the most thorough "doomsday kit" is of things to store offline copies of "just in case", to still be able to use local AI if things go truly crazy to a super extreme level. So far…

23
r/LocalLLaMA community 2d ago

Anthropic's Amodei: "Open Source models [could take us to] a very dangerous place."

  submitted by   /u/johnnyApplePRNG [link]   [comments]

4
Vercel — AI dev-tools 2d ago

Vercel Open Source Program: Spring 2026 cohort

The world runs on open source software. The frameworks, libraries, and tools we rely on are made possible by communities that share ideas and build in the open. At Vercel, we want to help those communities thrive. That’s why we run the Vercel Open Source Program : a developer…

24
OpenAI official-blog 3d ago

Mapping Europe’s AI Workforce Opportunity

A new OpenAI report maps how AI could reshape jobs across the EU, highlighting which occupations may face automation, growth, or workflow changes.

32
arXiv — Machine Learning research 3d ago

Unified Zero-Shot Time Series Forecasting: A Darts Foundation

arXiv:2606.27438v1 Announce Type: new Abstract: Since its initial release in 2020, Darts has become a widely used open-source Python library for time series analysis. A series of foundation models have recently claimed accuracy improvements in zero-shot forecasting, promising a…

15
arXiv — Machine Learning research 3d ago

Physics-Informed Neural Network with Transfer Learning for State Estimation in Lithium-Ion Batteries using the Single Particle Model with Electrolyte

arXiv:2606.28220v1 Announce Type: new Abstract: Physics-informed neural networks (PINNs) have emerged as a powerful tool for solving nonlinear partial differential equations (PDEs), including battery electrochemical models. They typically en-force conservation laws within the…

15

SenseNova-U1-8b-MoT-Infographic-V2 (released yesterday) - An open source SOTA beast for infographic design and image editing.

TRACE: State-Aware Query Processing over Temporal Evidence Graphs for Conversational Data

A Mechanistic View of Authority Hierarchy in LLM Sycophancy

The Course of News Events: A Comparison of Bottom-Up and Top-Down Approaches for Collecting Text-Based Data about Disasters

Beyond Document Grounding: Span-Level Hallucination Detection over Code, Tool Output, and Documents

LuxIT: A Luxembourgish Instruction Tuning Dataset from Monolingual Seed Data

Making Optimization Work When Labels Are Scarce [R]

Oomwoo, an open-source robot vacuum you build yourself

Plurality Released: fully Free and Open Source AI agents/chatbot platform for local AI

Monetization Gateway: Charge for any resource behind Cloudflare via x402

Box3D, an open source 3D physics engine

Thinking about grabbing 4x Ascend GX10s

A system-level approach to prompt injection: separating instruction and data channels in LLM agents [P]

Joint discovery of governing partial differential equations from multi-source datasets by competitive optimization

Teaching LLMs to Recommend and Defer in Underrepresented Epilepsy Care

Resolving superposition in AI for interpretability and cross-modal alignment in patient-neuronal images

Beyond Clean Text: Evaluating Encoder and Decoder Robustness for Bangla Event Detection in Noisy Text

Building an ASR Solution for Training and Assessing Children's Reading

Tone-Conditioned Curriculum Learning for Low-Resource Bantu Speech Recognition

Cross-lingual Relation Extraction with Large Language Models: Zero-Shot, Few-Shot, and Fine-Tuned Evaluation on Romanian

LuxEmo: Expressive Text-to-Speech Corpus for Luxembourgish

Enforce consistent code for agents and humans with konsistent

Quoting Anthropic

Department of Commerce has lifted export controls on Claude Fable 5 and Mythos 5

OpenClaw is finally available on Android and iOS

HydraHead: From Head-Level Functional Heterogeneity to Specialized Attention Hybridization (from the Qwen team)

Norm-preserving abliteration on Qwen3.6-35B-A3B: 0% refusal, benchmarks intact, open source dataset

Modification-Considering Value Learning for Reward Hacking Mitigation in RL

Depth Exploration for LLM Decoding

KrishokChat: A Citation-Grounded Dataset and Benchmark for Bengali Agricultural Advisory

Nonlinear mixture model motivated subspace clustering

Optimizer Memory Makes Shuffle Order a First-Order Source of Fine-Tuning Noise

SEATauBench: Adapting Tool-Agent-User Evaluation Into Low-Resource Southeast Asian Languages

Open but Incompatible: A License Compatibility Analysis of Corpora for Low-Resource African Languages

FinInvest-GTCN: Explainable Graph-Temporal-Causal Modeling for Risk-Aware Investment Decision Optimization

A3M: Adaptive, Adversarial and Multi-Objective Learning for Strategic Bidding in Repeated Auctions

To Reason or to Fabricate: Reasoning Without Shortcuts via Hint-Anchored Pairwise Aggregation

SrDetection: A Self-Referential Framework for Data Leakage Detection in Code Large Language Models

IHDec: Divergence-Steered Contrastive Decoding for Securing Multi-Turn Instruction Hierarchies

Vercel and Shopify are rebuilding Hydrogen

ReasoningLens: Hierarchical Visualization and Diagnostic Auditing for Large Reasoning Models

I Hate Dario Amodei, and everything he stands for.

Amodei: "Open Source Models Will Eat Your Children"

Ornith-1.0: self-improving open-source models for agentic coding

What's the full local AI "doomsday prepper" kit for cold storage? 16-bit safetensors of LLMs (obv), copies/source codes of Llama.cpp, ComfyUI, vLLM, Kobold, LMStudio, etc, macOS, Linux OSes, Windows 10&11, etc, Rufus (including older ones), various VMs, P-E-W's Heretic/Grimoire,…

Anthropic's Amodei: "Open Source models [could take us to] a very dangerous place."

Vercel Open Source Program: Spring 2026 cohort

Mapping Europe’s AI Workforce Opportunity

Unified Zero-Shot Time Series Forecasting: A Darts Foundation

Physics-Informed Neural Network with Transfer Learning for State Estimation in Lithium-Ion Batteries using the Single Particle Model with Electrolyte