Tag

Open source

331 articles archived under #open-source · RSS

Ars Technica — AI news-outlet 1mo ago

Millions of AI agents imperiled by critical vulnerability in open source package

"BadHost" was found in Starlette, a package with 325 million weekly downloads.

19
Hacker News — AI on Front Page community 1mo ago

Print with dozens of colors: Our new open-source ColorMix for PrusaSlicer

Article URL: https://blog.prusa3d.com/our-new-open-source-colormix-model-in-prusaslicer-and-easyprint_136079/ Comments URL: https://news.ycombinator.com/item?id=48283410 Points: 214 # Comments: 67

5
Interconnects (Nathan Lambert) research 1mo ago

Some ideas for what comes next, May 2026

Gemini Flash 3.5, Mythos, open-closed balance, America's open-source surge, emerging power struggles and more.

10
r/LocalLLaMA community 1mo ago

Small set of local MCP server installers for home Linux users

Hi all, I have published a small open-source MCP server bundle called MCP Basic Servers : https://github.com/mchowy-troll/mcp-basic-servers It is a collection of simple Bash installer scripts for running local MCP HTTP servers on Linux . The idea is simple: run one script,…

38
r/LocalLLaMA community 1mo ago

New KV Quants coming 😍 Welcome OSCAR kv quant open sourced by togetherAI

Just when we started embracing turboquant this happens   submitted by   /u/yehyakar [link]   [comments]

5
Smol AI News news-outlet 1mo ago

not much happened today

**Inference optimization** is increasingly architectural, with **EAGLE 3.1** improving speculative decoding and long-context handling, collaborating with **vLLM** and **TorchSpec**. **Perplexity** open-sourced a rebuilt **Unigram tokenizer** cutting CPU use by **5–6×** and…

15
r/MachineLearning community 1mo ago

Reconstructing the agent methodology: Decoupling decision-making and execution - open source [P]

I’ve been thinking about a problem in current agent systems: Most agents are becoming very good at execution, but the decision layer before execution is still unclear. Coding agents, research agents, tool loops, sandboxes, workflows, and harnesses are all improving quickly. Once…

38
r/MachineLearning community 1mo ago

I’m building an open-source decision layer above AI agents [P]

Hi everyone, I’m Jia, the creator of Spice. I’ve been working on an open-source project called Spice. The simplest way to describe it is: Spice is a decision layer above agents. Most agent systems today are very focused on execution, They are getting better at doing tasks after…

30
arXiv — Machine Learning research 1mo ago

Open Multimodal Datasets and Open-Source Software for Data-Driven Modeling of Multiphase Transport and Thermal Systems

arXiv:2605.23037v1 Announce Type: new Abstract: Data-driven modeling is becoming central to multiphase transport, electronics cooling, acoustic diagnostics, and thermal-fluid digital twins, but progress is limited by fragmented datasets and raw instrument files that are…

8
arXiv — Machine Learning research 1mo ago

What Linear Probes Miss: Multi-View Probing for Weight-Space Learning

arXiv:2605.23410v1 Announce Type: new Abstract: The explosive growth of open-source model repositories has created a Model Jungle, where checkpoints are frequently shared without adequate documentation or metadata. While weight-space learning offers a pathway to identify and…

20
arXiv — Machine Learning research 1mo ago

An Open-Source Training Dataset for Foundation Models for Black-box Optimization

arXiv:2605.23417v1 Announce Type: new Abstract: Most black-box optimization methods require extensive hyperparameter tuning, often limiting their ability to generalize across different optimization domains. Foundation models for black-box optimization that learn optimization…

21
arXiv — NLP / Computation & Language research 1mo ago

Knowledge Distillation for Low-Resource Open-source Text-to-SQL Model

arXiv:2605.22843v1 Announce Type: new Abstract: Text-to-SQL converts natural language questions into executable SQL queries, enabling non-technical users to access relational databases for analytics and intelligent data services. In real-world scenarios, performance is often…

18
arXiv — NLP / Computation & Language research 1mo ago

Benchmarking Google Embeddings 2 against Open-Source Models for Multilingual Dense Retrieval and RAG Systems

arXiv:2605.23618v1 Announce Type: new Abstract: We benchmark Google Embeddings (GE2), a Vertex-AI-hosted bi-encoder with 2,048-token context and explicit task-type conditioning, against five open-source alternatives: BGE-M3, E5-large, Multilingual-E5-large (mE5-L), LaBSE, and…

7
arXiv — NLP / Computation & Language research 1mo ago

OpenSkillEval: Automatically Auditing the Open Skill Ecosystem for LLM Agents

arXiv:2605.23657v1 Announce Type: new Abstract: Skills, i.e., structured workflow instructions distilled for large language models (LLMs), are becoming an increasingly important mechanism for improving agent performance on real-world downstream tasks. However, as the open-source…

5
arXiv — NLP / Computation & Language research 1mo ago

InfiGFusion: Graph-on-Logits Distillation via Efficient Gromov-Wasserstein for Model Fusion

arXiv:2505.13893v2 Announce Type: replace Abstract: Recent advances in large language models (LLMs) have intensified efforts to fuse heterogeneous open-source models into a unified system that inherits their complementary strengths. Existing logit-based fusion methods maintain…

35
r/LocalLLaMA community 1mo ago

hipEngine: Fast Native Qwen 3.6 Inference for RDNA3 (Strix Halo, 7900 XTX)

A few weeks ago, after finishing FastDMS , I started toying around writing some RDNA3 kernels again to see how fast I could get Qwen 3.6 MoE running. It turned out well enough, so over the past couple weeks, I turned those experiments into hipEngine , a new open source (AGPLv3)…

13
r/LocalLLaMA community 1mo ago

Could Open Models be trained to secretly go rogue?

I was discussing with some other folks how safe is to use open weights models from China and the topic of "trojan horse" came up. We know that, at least with current architecture, models can't run code on their own. They are entirely dependent on tools and harnesses. We also…

20
Hacker News — AI on Front Page community 1mo ago

Show HN: Audiomass – a free, open-source multitrack audio editor for the web

Article URL: https://audiomass.co/?multitrack=1 Comments URL: https://news.ycombinator.com/item?id=48258015 Points: 338 # Comments: 68

29
r/MachineLearning community 1mo ago

Working on a cgo-free CUDA binding in Go for ML stuff Week 3 - open source [P]

At our work we use CUDA in Rust since the company switched to it recently. Rust has pretty good Driver API bindings but it made me wonder why the hell we cant have something decent in Go without cgo. I mostly build ML tools in the last month and Go is my main language for pretty…

30
r/MachineLearning community 1mo ago

PapersWithCode new features - week 1 [P]

Hi, Niels here from the open-source team at Hugging Face. It's been one week since I launched paperswithcode.co , a revival of the website we all loved. It allows us to keep track of the state-of-the-art (SOTA) across various domains of AI, from agents to computer vision and…

23
r/LocalLLaMA community 1mo ago

Qwen Plays ̶p̶̶o̶̶k̶̶e̶̶m̶̶o̶̶n̶ ? / QWEN PLAYS DCSS! - qwen3.6-35b-a3b@q4_k_xl plays open source roguelike adventure DCSS (and does a decent job)

Hi, (TLDR.): Qwen in its MTP version has tool call bugs and outputs everything into tool/thinking blocks - mangeling the output - canceling the +speed with repeated wrong tool calls! DCSS works well with non MTP qwen even on smaller qwants. im Testing the new MTP models and…

19
Hacker News — AI on Front Page community 1mo ago

Microsoft open-sources "the earliest DOS source code discovered to date"

https://opensource.microsoft.com/blog/2026/04/28/continuing-... Comments URL: https://news.ycombinator.com/item?id=48253386 Points: 224 # Comments: 55

33
r/LocalLLaMA community 1mo ago

Command A+ (218B MoE) running on Apple Silicon — MLX port, PR open

Cohere dropped Command A+ on the 20th (218B total / 25B active, 128 experts top-8, Apache 2.0). Wrote a cohere2_moe implementation for mlx-lm to get it running on Apple Silicon. Architecture notes for anyone digging into this model: - Single shared expert with a larger…

12
r/MachineLearning community 1mo ago

Spice: We built an open-sourced decision layer that sits above your AI agents (controls agent actions before execution) [P]

Hi guys, been exploring here for a while, wanted to share something we've been working on. It's called Spice , an open-source decision layer above agents. We have tons of great execution agents now — Claude Code, Codex, hermes, etc. They're good at doing stuff. But they're…

6
r/LocalLLaMA community 1mo ago

meituan-longcat/LongCat-Video-Avatar-1.5 · Hugging Face

🚀 Model Introduction We are excited to announce the release of LongCat-Video-Avatar 1.5, an upgraded open-source framework that prioritizes extreme empirical optimization and production-readiness for audio-driven human video generation. Built upon the LongCat-Video foundation…

21
r/LocalLLaMA community 1mo ago

I fine-tuned Cohere Transcribe to support diarization and timestamps

Hi I'll keep it short: Cohere-transcribe is currently the best open source speech to text model (and possibly even better than other proprietary models). BUT it doesn't support diarization (speaker identification) and timestamps, even though there are tokens for it in the…

36
r/LocalLLaMA community 1mo ago

DeepSeek is pushing forward with $10.29 billion financing round, with Liang Wenfeng committing to continue developing open-source AI models rather than pursuing short-term commercialization goals

https://www.bloomberg.com/news/articles/2026-05-22/deepseek-founder-declares-agi-goal-as-10-billion-round-advances   submitted by   /u/External_Mood4719 [link]   [comments]

17
r/LocalLLaMA community 1mo ago

Honesty in a small model drops from 35% to 0% by changing the tone of the prompt. Sharing the findings.

My paper got published today at Arxiv. It raises questions about how language models behave when the framing of a request shifts. Small open-source AI models can be moved from honest to dishonest behaviour by little more than a change in tone. Asked to solve coding problems…

4
r/LocalLLaMA community 1mo ago

'Am I OpenAI compatible' - a tool and documentation for unified api signatures in open source AI.

This has turned out to be useful to many of my friends so I thought I'd share here as well. I created a tool and documentation page for most major open-souce project's adherence to 'OpenAI compatibility' after seeing inconsistencies between engines like vLLM and llama.cpp. Now…

18
arXiv — Machine Learning research 1mo ago

The Devil is in the Condition Numbers: Why is GLU Better than non-GLU Structure?

arXiv:2605.20749v1 Announce Type: new Abstract: Gated Linear Units (GLU) and their variants are widely adopted in modern open-source large language model architectures and consistently outperform their non-gated counterparts, yet the underlying reasons for this advantage remain…

34
arXiv — NLP / Computation & Language research 1mo ago

Do No Harm? Hallucination and Actor-Level Abuse in Web-Deployed Medical Large Language Models

arXiv:2605.20591v1 Announce Type: new Abstract: Medical large language models (LLMs), including custom medical GPTs (MedGPTs) and open-source models, are increasingly deployed on web platforms to provide clinical guidance. However, they pose risks of hallucination, policy…

33
r/MachineLearning community 1mo ago

l9gpu - open-source GPU observability with workload-level attribution [P]

GPU monitoring tools like DCGM give you hardware-level metrics but no workload context. When a node is saturated, you can't tell which experiment, team, or job is responsible without digging through logs. We built l9gpu to close that gap. It's a node-level agent that exports GPU…

25
r/LocalLLaMA community 1mo ago

Re. what ever happened to Cohere’s Command-A series of models?

Hey everyone, Nick Frosst here from Cohere. A few months ago Aidan (my cofounder) left a comment in here about our Command series and how we were working on some more powerful, open-weights models behind the scenes. We just launched Command A+ and we wanted to share it with you…

37
r/MachineLearning community 1mo ago

NOML-NOML: hierarchical TD3 + anchor policy for flight control [P]

I built a custom RL algorithm for continuous flight control and open-sourced it. Sharing here in case the structural ideas are useful for anyone doing continuous control where one action axis dominates. I've been training continuous control on a 6-DoF flight sim…

31
arXiv — NLP / Computation & Language research 1mo ago

GoLongRL: Capability-Oriented Long Context Reinforcement Learning with Multitask Alignment

arXiv:2605.19577v1 Announce Type: new Abstract: We present GoLongRL, a fully open-source, capability-oriented post-training recipe for long-context reinforcement learning with verifiable rewards (RLVR). Existing long-context RL methods often treat data construction as a matter…

17
Hugging Face Daily Papers research 1mo ago

GoLongRL: Capability-Oriented Long Context Reinforcement Learning with Multitask Alignment

Abstract GoLongRL presents an open-source approach for long-context reinforcement learning with diverse reward optimization through capability-oriented data construction and TMN-Reweight methodology. AI-generated summary We present GoLongRL, a fully open-source,…

37
r/LocalLLaMA community 1mo ago

PrivateScribe.ai - Fully local, MIT licensed, free AI transcription built with HIPAA/legal safeguards in mind - One Year Update!

I first posted about PrivateScribe.ai ~1yr ago and have recently jumped back intent on bringing it to a functionality that makes it actually usable by non-technical users. One year ago it worked but only the bare minimum. Since then I've gotten ⭐️74 github stars!⭐️ and have had…

31
r/LocalLLaMA community 1mo ago

Open weights GLM and Mimo are better than Gemini 3.5 flash according to arena

While we are weathering the gemini 3.5 flash hype, keep in mind that according to arena, GLM and Mimo are better. https://arena.ai/leaderboard/text/coding-no-style-control #7 GLM #9 Mimo #12 Gemini 3.5 Flash   submitted by   /u/Terminator857 [link]   [comments]

5
r/LocalLLaMA community 1mo ago

Floor for local meeting summarization on a 6GB GPU: qwen3.5:0.8b works at 57s, Granite 4 350M hallucinates

Disclosure: I made this. Open-source, MIT, Windows + Linux. Not affiliated with voiceflow.com (the chatbot SaaS, name collision, sorry). Why this exists: I wanted local-only dictation and meeting transcription, because audio shouldn't have to leave the machine just to become…

13
The Information — AI news-outlet 1mo ago

Is the Gap Widening Between Anthropic and Open-Source Models?

Some developers have told me that the rising costs of frontier AI models from Anthropic and other firms could prompt them to shift to cheaper open-source AI. After all, when companies as sophisticated as Uber are accidentally blowing through their entire year’s AI budget in a…

8
Hacker News — AI on Front Page community 1mo ago

Show HN: Forge – Guardrails take an 8B model from 53% to 99% on agentic tasks

Hi HN, I'm Antoine Zambelli, AI Director at Texas Instruments. I built Forge, an open-source reliability layer for self-hosted LLM tool-calling. What it does: - Adds domain-and-tool-agnostic guardrails (retry nudges, step enforcement, error recovery, VRAM-aware context…

14
r/LocalLLaMA community 1mo ago

bytedance released an open source model that attempts to do just about anything with only 3b parameters

Lance is a lightweight native unified multimodal model that supports image and video understanding, generation, and editing within a single framework. Efficient at 3B scale. With only 3B active parameters , Lance delivers strong performance across image generation, image…

32
arXiv — Machine Learning research 1mo ago

Provably Shorter Scratchpads in Hybrid DeltaNet-Attention Decoders

arXiv:2605.16640v1 Announce Type: new Abstract: We investigate the expressive power of hybrid recurrent-attention decoders, a class of architectures used in recent open-source language models such as Qwen3-Next and its successors. These models combine Gated Attention heads with…

28
arXiv — NLP / Computation & Language research 1mo ago

Roll Out and Roll Back: Diffusion LLMs are Their Own Efficiency Teachers

arXiv:2605.16941v1 Announce Type: new Abstract: Diffusion Large Language Models (DLLMs) promise fast parallel generation, yet open-source DLLMs still face a severe quality-speed trade-off: accelerating decoding by revealing multiple tokens often causes substantial quality…

7
r/MachineLearning community 1mo ago

Witchcraft, fast local semantic search on top of SQLite [P]

Witchcraft ( https://github.com/dropbox/witchcraft ) , an open source project that I built at Dropbox, is a from-scratch re-implementation of Stanford's XTR-Warp semantic search engine ( https://github.com/jlscheerer/xtr-warp ) in safe rust, using a single-file SQLite database…

32
r/MachineLearning community 1mo ago

Reviving PapersWithCode (by Hugging Face) [P]

Hi, Niels here from the open-source team at Hugging Face. Like many others, I was a huge fan of paperswithcode. Sadly, that website is no longer maintained after its acquisition by Meta. Hence, I've been working on reviving it. I obviously use AI agents to parse papers at scale…

10
Hacker News — AI on Front Page community 1mo ago

Show HN: Files.md – Open-source alternative to Obsidian

Article URL: https://github.com/zakirullin/files.md Comments URL: https://news.ycombinator.com/item?id=48179677 Points: 208 # Comments: 121

14
r/LocalLLaMA community 1mo ago

New models when? Forecasting release date.

After the recent releases, there's almost a sense of emptiness. When do you think new models will be released? Looking at the chart, it's between the end of May and the beginning of June, but... I don't know why, it seems like something's changing about "open weights"  …

4
arXiv — NLP / Computation & Language research 1mo ago

CompactQE: Interpretable Translation Quality Estimation via Small Open-Weight LLMs

arXiv:2605.15763v1 Announce Type: new Abstract: Current state-of-the-art Quality Estimation (QE) in machine translation relies on massive, proprietary LLMs, raising data privacy concerns. We demonstrate that smaller, open-source LLMs (<30B parameters) are a viable,…

29
r/LocalLLaMA community 1mo ago

Cutoff dates of open source models

I was trying Qwen 3.6-27b and Gemma4 in a siomple web chat. Asked them both a qn like 'recommend the best llm for a 5060ti' and was suprised when they both replied 'user is asking about a card that doesn't exist'. I then saw their knowledge cutoff was early 2025, hence why. But…

12

Millions of AI agents imperiled by critical vulnerability in open source package

Print with dozens of colors: Our new open-source ColorMix for PrusaSlicer

Some ideas for what comes next, May 2026

Small set of local MCP server installers for home Linux users

New KV Quants coming 😍 Welcome OSCAR kv quant open sourced by togetherAI

not much happened today

Reconstructing the agent methodology: Decoupling decision-making and execution - open source [P]

I’m building an open-source decision layer above AI agents [P]

Open Multimodal Datasets and Open-Source Software for Data-Driven Modeling of Multiphase Transport and Thermal Systems

What Linear Probes Miss: Multi-View Probing for Weight-Space Learning

An Open-Source Training Dataset for Foundation Models for Black-box Optimization

Knowledge Distillation for Low-Resource Open-source Text-to-SQL Model

Benchmarking Google Embeddings 2 against Open-Source Models for Multilingual Dense Retrieval and RAG Systems

OpenSkillEval: Automatically Auditing the Open Skill Ecosystem for LLM Agents

InfiGFusion: Graph-on-Logits Distillation via Efficient Gromov-Wasserstein for Model Fusion

hipEngine: Fast Native Qwen 3.6 Inference for RDNA3 (Strix Halo, 7900 XTX)

Could Open Models be trained to secretly go rogue?

Show HN: Audiomass – a free, open-source multitrack audio editor for the web

Working on a cgo-free CUDA binding in Go for ML stuff Week 3 - open source [P]

PapersWithCode new features - week 1 [P]

Qwen Plays ̶p̶̶o̶̶k̶̶e̶̶m̶̶o̶̶n̶ ? / QWEN PLAYS DCSS! - qwen3.6-35b-a3b@q4_k_xl plays open source roguelike adventure DCSS (and does a decent job)

Microsoft open-sources "the earliest DOS source code discovered to date"

Command A+ (218B MoE) running on Apple Silicon — MLX port, PR open

Spice: We built an open-sourced decision layer that sits above your AI agents (controls agent actions before execution) [P]

meituan-longcat/LongCat-Video-Avatar-1.5 · Hugging Face

I fine-tuned Cohere Transcribe to support diarization and timestamps

DeepSeek is pushing forward with $10.29 billion financing round, with Liang Wenfeng committing to continue developing open-source AI models rather than pursuing short-term commercialization goals

Honesty in a small model drops from 35% to 0% by changing the tone of the prompt. Sharing the findings.

'Am I OpenAI compatible' - a tool and documentation for unified api signatures in open source AI.

The Devil is in the Condition Numbers: Why is GLU Better than non-GLU Structure?

Do No Harm? Hallucination and Actor-Level Abuse in Web-Deployed Medical Large Language Models

l9gpu - open-source GPU observability with workload-level attribution [P]

Re. what ever happened to Cohere’s Command-A series of models?

NOML-NOML: hierarchical TD3 + anchor policy for flight control [P]

GoLongRL: Capability-Oriented Long Context Reinforcement Learning with Multitask Alignment

GoLongRL: Capability-Oriented Long Context Reinforcement Learning with Multitask Alignment

PrivateScribe.ai - Fully local, MIT licensed, free AI transcription built with HIPAA/legal safeguards in mind - One Year Update!

Open weights GLM and Mimo are better than Gemini 3.5 flash according to arena

Floor for local meeting summarization on a 6GB GPU: qwen3.5:0.8b works at 57s, Granite 4 350M hallucinates

Is the Gap Widening Between Anthropic and Open-Source Models?

Show HN: Forge – Guardrails take an 8B model from 53% to 99% on agentic tasks

bytedance released an open source model that attempts to do just about anything with only 3b parameters

Provably Shorter Scratchpads in Hybrid DeltaNet-Attention Decoders

Roll Out and Roll Back: Diffusion LLMs are Their Own Efficiency Teachers

Witchcraft, fast local semantic search on top of SQLite [P]

Reviving PapersWithCode (by Hugging Face) [P]

Show HN: Files.md – Open-source alternative to Obsidian

New models when? Forecasting release date.

CompactQE: Interpretable Translation Quality Estimation via Small Open-Weight LLMs

Cutoff dates of open source models