Tag

Open source

329 articles archived under #open-source · RSS

r/MachineLearning community 16d ago

Open weights are not enough: we need open training frameworks for research and better algorithms [P]

Open weights are important and critical, but they are not enough by themselves. If we want open ML and AI research to move forward, we also need open training frameworks: codebases that do more than run jobs. They should make the training process visible, understandable, and…

34
r/LocalLLaMA community 16d ago

archex: local-first, deterministic code-context for AI agents — no API key, no telemetry (Apache 2.0)

archex turns a repo into a ranked, token-budgeted context bundle for coding agents: the symbols, imports, dependency-graph neighbors, and provenance the model needs, assembled before it reasons. It returns context, not an answer — your local model still does the thinking. The…

24
r/LocalLLaMA community 17d ago

Made a macOS app that creates highly personal macOS apps. Works with models as small as Gemma 4 E2B

Apologies in advance as the video is demonstrating with GPT 5.4 mini (a local model would take too long for a video), however I’ve made the same app with Gemma 4 E4B. Been working on an open source project for a while called Ironsmith. The gist is you can create highly…

13
r/MachineLearning community 17d ago

I built an open-source Knowledge Graph pipeline with hybrid retrieval to improve LLM multi-hop reasoning [P]

Hey everyone, I built an open-source full-stack pipeline (Django + React) that constructs a Knowledge Graph from raw text, detects thematic communities, and uses hybrid search to solve the "lost in the middle" problem in standard vector retrieval. The Pipeline: Ingestion &…

8
r/LocalLLaMA community 17d ago

z.ai Poll on X: MIT-licensed open weights are losing

You can cast your vote here: https://x.com/ZixuanLi_/status/2065646648777416770#m Just to be clear: I am not urging or brigading anyone to vote specifically for MIT-licensed open weights. Please choose the option you genuinely prefer. I previously shared this in another post,…

27
Hacker News — AI on Front Page community 17d ago

I indexed 669 GB of my GoPro videos using my M1 Max computer and local ML models

TLDR: I had 2,207 GoPro videos, and I need to rewatch them to find interesting moments from my cycling journey. I built a project to index them locally on my M1 Max using open-source ML models, search for those moments, and send the best clips straight to my DaVinci Resolve…

28
r/LocalLLaMA community 17d ago

Xiaomi is now serving MiMo V2.5 at 1000-3000tps using DFlash & Persistent kernel. DFLash model is out, open-source release promised coming soon

https://mimo.xiaomi.com/blog/mimo-tilert-1000tps   submitted by   /u/Dany0 [link]   [comments]

20
r/LocalLLaMA community 18d ago

Local models in mid-2026

Open weights got close enough to run at home this year, not by needing more RAM but the reverse: sparse attention, MoE, latent KV compression, multi-token prediction and four-bit quant.   submitted by   /u/mattjcoles [link]   [comments]

11
r/LocalLLaMA community 18d ago

Open source AI Must Win

  submitted by   /u/rm-rf-rm [link]   [comments]

12
r/MachineLearning community 18d ago

I’m building a free bilingual machine-learning notebook course — looking for feedback on structure and coverage [R]

Hi everyone, I’m building an open-source machine-learning tutorial repository in Jupyter Notebook format: https://github.com/mohammadijoo/Machine_Learning_Tutorials The course is bilingual: English and Persian/Farsi versions are organized in parallel. The goal is to make a…

18
r/LocalLLaMA community 18d ago

Add arch support for cohere2-MoE by michaelw9999 · Pull Request #24260 · ggml-org/llama.cpp

It's time to recompile your llama.cpp and download new model: https://huggingface.co/CohereLabs/North-Mini-Code-1.0 https://huggingface.co/unsloth/North-Mini-Code-1.0-GGUF Model Summary North Mini Code is an open weights research release of a 30B-A3B parameter model optimized…

10
r/LocalLLaMA community 18d ago

This is coming to Chinese open source models pretty soon. - prepare yourself.

Don’t be surprised . Prepare yourself. This could happen anytime. There’s a bigger strategy here than just Fable5   submitted by   /u/MLExpert000 [link]   [comments]

35
r/LocalLLaMA community 19d ago

GLM 5.2 is out - open weights to be released next week. How did it do on my one-shot Pac-Man test?

Quick initial impressions: - at 70 tok/s slower than GLM 5.1 - seems to spend more time reasoning - better results with my Pac-Man test The one-shot result is almost functional; apart from the ghosts getting stuck immediately after leaving the ghosts house, I did not notice any…

14
r/LocalLLaMA community 19d ago

GLM 5.2 is deployed in GLM Coding Plan. API and MIT weights in a week. Voting and benchmarks on X.

The model now supports a 1M context window and two thinking modes: max and high. z.ai recommends using max for coding. Vote on X What should we prioritize most? Longer context window MIT-licensed open weights No price increase Other links: GLM 5.2 announcement LLM Benchmark…

32
r/LocalLLaMA community 19d ago

We should set up a torrent network for open source models.

Was just thinking about this due to recent events. Hugging Face is a US-based company, legally incorporated as Hugging Face, Inc. with its official headquarters located in Brooklyn, New York. It seems like a pretty big single point of failure for local models. Maybe a…

23
r/LocalLLaMA community 19d ago

New model on huggingface

https://huggingface.co/prefeitura-rio/Rio-3.5-Open-397B A qwen finetune. Looks pretty even with qwen 3.7 plus, except it's actually open source. Disclosure: I work as a researcher for the city government of Rio de Janeiro, which developed this model.   submitted by  …

15
Hacker News — AI on Front Page community 19d ago

Open Source AI Must Win

Article URL: https://opensourceaimustwin.com/?share=v2 Comments URL: https://news.ycombinator.com/item?id=48511908 Points: 307 # Comments: 70

26
Hacker News — AI on Front Page community 20d ago

Kimi K2.7-Code: open-source coding model with better token efficiency

Article URL: https://huggingface.co/moonshotai/Kimi-K2.7-Code Comments URL: https://news.ycombinator.com/item?id=48502347 Points: 208 # Comments: 100

34
r/MachineLearning community 20d ago

Building an Open Source Edge Semantic Cache for LLMs in Rust/WASM – Sanity check on the architecture? [D]

Hey everyone, I am planning out a new open-source infrastructure project and want to get some brutal feedback on the architecture and use-case validity from people running high volume LLM workloads in production. The Problem: Python-based proxies/gateways introduce too much…

27
r/LocalLLaMA community 20d ago

Huawei Released openPangu 2.0 (Will open source on June 30)

At the Huawei Developer Conference (HDC 2026) held on June 12, Richard Yu, Executive Director of Huawei, officially launched the brand-new, open-source Pangu large model—openPangu 2.0. The model is fully adapted to the HarmonyOS ecosystem and has achieved deep optimization and…

12
arXiv — NLP / Computation & Language research 20d ago

Two Wrongs, No Right: Auditing Social-Desirability Bias in LLM Annotators for Computational Social Science

arXiv:2606.12426v1 Announce Type: cross Abstract: LLM annotators are increasingly used in computational social science (CSS), but it is unclear whether their alignment-shaped errors preserve the empirical conclusions a researcher would report. We audit three open-source 7B…

10
Hacker News — AI on Front Page community 20d ago

Codex for open source

Article URL: https://openai.com/form/codex-for-oss/ Comments URL: https://news.ycombinator.com/item?id=48497195 Points: 216 # Comments: 74

11
Hacker News — AI on Front Page community 20d ago

MiMo Code is now released and open-source

Article URL: https://mimo.xiaomi.com/mimocode Comments URL: https://news.ycombinator.com/item?id=48490826 Points: 259 # Comments: 134

37
r/LocalLLaMA community 20d ago

As we know Minimax M3 is just going to be open sourced in few days and because of that I was surfing on internet searching for its scores and I found out pretty interesting results. Is Minimax M3 really that good in agentic stuff and in coding? Is it better than older gpt models?

Has anyone personally compared the Minimax M3 model against other proprietary models to determine its relative performance tier? I am trying to understand where it currently ranks in the broader Al landscape. Can we say Minimax M3 is better than GPT 5.2 in coding and agentic…

26
r/LocalLLaMA community 21d ago

Cognitor: open-source semantic search engine. Automatically chunks, embeds and indexes the content of a target folder, making it searchable semantically.

https://github.com/tanaos/cognitor Cognitor is an open-source semantic search engine and vector database which automatically chunks, embeds and indexes the entire content of a target folder (and its subfolders), making it easily searchable by both AI agents and humans.…

15
r/LocalLLaMA community 21d ago

How I implemented ASR bias for voice transcription models [Open Source]

I've been spending the last couple of weeks building a Wispr Flow clone as an open source project. For context, it is a voice dictation app that lets you type faster, by speaking instead of actually typing. I spent the first week building the basic STT capabilities. One of the…

29
r/LocalLLaMA community 21d ago

Minimax M3 open weights release planned for Friday

  submitted by   /u/rmhubbert [link]   [comments]

28
arXiv — Machine Learning research 21d ago

Bergson: An Open Source Library for Data Attribution

arXiv:2606.11660v1 Announce Type: new Abstract: Data attribution is a promising field in interpretability that aims to explain model behavior through the influence of its training data, with applications including debugging undesirable model behavior and training dataset…

26
r/LocalLLaMA community 21d ago

nvidia/diffusiongemma-26B-A4B-it-NVFP4 · Hugging Face

Model Overview Description: DiffusionGemma 26B A4B IT is an open-weights multimodal generative model developed by Google DeepMind that processes text, image, and video inputs to produce text output via discrete diffusion. Built on the Gemma 4 26B A4B Mixture-of-Experts (MoE)…

12
r/MachineLearning community 21d ago

Pyrecall open source tool for detecting catastrophic forgetting during LLM fine-tuning[P]

Surprised there's no real tooling for this given how much research exists on continual learning. Built pyrecall to fill the gap. Snapshots skill scores before/after fine-tuning, flags regressions, rolls back LoRA adapters by name. Fully local, no external APIs. v0.1.0, MIT, pip…

17
r/LocalLLaMA community 21d ago

Best Open-Source AI coding model for my specs?

hello everyone! im looking for the most powerful open-source coding ai while still fitting my system my specs: CPU: AMD ryzen 7 7700 GPU: RTX 5070 RAM: 32 gb DDR5 OS: windows 11 use case: Writing, Coding, debugging. any recommendations would be great. thanks in advance  …

4
r/LocalLLaMA community 21d ago

DeepMind Just Dropped "DiffusionGemma" — Text Generation via Image-Style Diffusion Model

Another open weight model got dropped today, this one's from DeepMind, seems like a good day for the OSS geeks. Released under Apache 2.0 Instead of generating text sequentially token-by-token like almost every autoregressive model on the market, it uses a text diffusion head. -…

32
r/LocalLLaMA community 22d ago

Cohere released North Mini Code: It's first Open-Source Agentic Coding Model

Small: 30 billion parameters, 3B active. Efficient: Benchmarks to 33.4 on the Artificial Analysis Coding Index, competitive among similar sized models. Open Source: Apache 2.0 license HF: https://huggingface.co/CohereLabs/North-Mini-Code-1.0   submitted by  …

8
r/MachineLearning community 22d ago

Introducing Papers Without Code [P]

Hi, Niels here from the open-source team at Hugging Face. I've recently relaunched paperswithcode.co as a source for finding the state of the art (SOTA) across various AI domains, from 3D generation to AI agents. This is done by automatically parsing research papers published on…

36
r/MachineLearning community 22d ago

RelayOps - Production-shaped telecom support agent (54% auto-resolve, 0 unsafe actions, full audit + decision console) [P]

I just open-sourced RelayOps - a small, honest, production-shaped AI support agent built specifically for telecom and subscription billing queues. Key results (v1.5.1): 54% of a 50-ticket sample queue auto-resolved 0 unsafe auto-actions 0 billing escapes (tested on 12…

25
Hugging Face Daily Papers research 22d ago

Kwai Keye-VL-2.0 Technical Report

Abstract Kwai Keye-VL-2.0-30B-A3B is an open-source Mixture-of-Experts multimodal foundation model that enables long-video understanding and agentic intelligence through DeepSeek Sparse Attention and specialized training infrastructure. Generated by…

36
r/LocalLLaMA community 22d ago

Without open source LLMs, US AI companies could have already monopoled the technology

For such technology with clear importance and impact on all of us, I believe that making it open source is an ethical duty, otherwise, especially with the 1-sided politics of the US we experience today, they could have already monopoled the technology by now, maybe make it…

22
arXiv — NLP / Computation & Language research 22d ago

OpenRTLSet: A Fully Open-Source Dataset for Large Language Model-based Verilog Module Design

arXiv:2606.10285v1 Announce Type: new Abstract: OpenRTLSet introduces the largest fully open-source dataset for hardware design, offering over 131,000 diverse Verilog code samples to the research community and industry. Our dataset uniquely combines Verilog code from GitHub…

6
r/LocalLLaMA community 22d ago

Without open llm competition, closed source LLM companies will become insatiable.

I can't imagine how arrogant one must be to make such a decision. People pay $200 a month for Anthropic to mess with their codebase. Imagine how they would humiliate their customers if the world didn't have an open-source model.…

6
r/LocalLLaMA community 22d ago

Releasing Apodex-1.0 Smol Models (0.8B, 2B, 4B Open-Weights) optimized for Agentic Verification + AgentHarness Evals

Hey r/LocalLLaMA , We just released Apodex 1.0 , and alongside our flagship API, we are releasing the weights for our Smol models (0.8B, 2B, and 4B) . Our core research focuses on independent verification in long-horizon tasks. Instead of just scaling up parameter sizes for raw…

23
r/LocalLLaMA community 22d ago

zai-org/SCAIL-2 · Hugging Face

SCAIL-2: Unifying Controlled Character Animation with End-to-end In-Context Conditioning SCAIL-2 is an open-source model for end-to-end controlled character animation . It animates a reference character with a driving video, and also supports character replacement and…

15
r/LocalLLaMA community 23d ago

Have we reached the point where open-source LLMs are “just good enough”?

The question I’m asking myself is whether open-source LLMs are now “ just good enough ” to meet 95% of requirements. I know, of course, that they still need to and will get even better, but where does the added value of the remaining 5% come from? a) Better answer quality? Okay,…

19
Hacker News — AI on Front Page community 23d ago

Microsoft's open source tools were hacked to steal passwords of AI developers

Article URL: https://techcrunch.com/2026/06/08/microsofts-open-source-tools-were-hacked-to-steal-passwords-of-ai-developers/ Comments URL: https://news.ycombinator.com/item?id=48457830 Points: 233 # Comments: 97

25
r/LocalLLaMA community 23d ago

I fine-tuned Parakeet 0.6B for medical ASR — open weights, local Mac/CUDA/CPU

I fine-tuned NVIDIA's Parakeet TDT 0.6B v2 for clinical speech and am releasing the weights as Omi Med STT v1 (CC-BY-4.0). Disclosure: I'm the founder of Omi Health and built this. Happy to dig into the training mix, benchmark, failure cases, quantization, or anything else. The…

14
r/MachineLearning community 23d ago

How to start open source contribution [D]

hi everyone, I created a blog around how I started open source contribution, documented all minute details. Please give it a read and give review as this is my journey to do blogging for the first time. It is free! https://substack.com/home/post/p-200202050   submitted by…

25
r/LocalLLaMA community 23d ago

Was BitNet a dead end? What happened to ternary LLMs?

They seemed so promising at one point but the biggest ternary model is still 2B. What happened? Why aren't the frontier open weights AI labs attempting to use them?   submitted by   /u/3ntrope [link]   [comments]

7
r/MachineLearning community 23d ago

I'd like to share an updated methodology for building agents.[P]

Hi guys, been exploring here for a while, wanted to share something we've been working on. It's called Spice, an open-source decision layer above agents. We have tons of great execution agents now — Claude Code, Codex, hermes, etc. They're good at doing stuff. But they're…

20
Hacker News — AI on Front Page community 23d ago

Show HN: Gitdot – A better GitHub. Open-source, written in Rust

What works now: user signups, org creations, private/public repos, and importing GitHub repositories (both as read-only mirrors and full migrations). So basically, you can create, push and pull to a repo, but we don't have many features quite yet (issues, PRs, CI). What is a bit…

34
r/LocalLLaMA community 23d ago

OpenEnv is now owned by HF, Torch, Prime Intellect, Unsloth, Modal, Mercor, and more! Use it for training agents.

OpenEnv is a tool for creating an agentic execution environment like terminals, browsers, or anything an agent can interact with. And today, we’re excited to announce that OpenEnv is becoming even more open, to make the future of training agents open source. Starting today,…

38
arXiv — Machine Learning research 24d ago

TorchKM: A GPU-Oriented Library for Kernel Learning and Model Selection

arXiv:2606.06742v1 Announce Type: new Abstract: TorchKM is an open-source library for kernel machines, including support vector machines, kernel logistic regression, and kernel quantile regression, with GPU acceleration. The library features a scikit-learn-style API and is…

36

Open weights are not enough: we need open training frameworks for research and better algorithms [P]

archex: local-first, deterministic code-context for AI agents — no API key, no telemetry (Apache 2.0)

Made a macOS app that creates highly personal macOS apps. Works with models as small as Gemma 4 E2B

I built an open-source Knowledge Graph pipeline with hybrid retrieval to improve LLM multi-hop reasoning [P]

z.ai Poll on X: MIT-licensed open weights are losing

I indexed 669 GB of my GoPro videos using my M1 Max computer and local ML models

Xiaomi is now serving MiMo V2.5 at 1000-3000tps using DFlash & Persistent kernel. DFLash model is out, open-source release promised coming soon

Local models in mid-2026

Open source AI Must Win

I’m building a free bilingual machine-learning notebook course — looking for feedback on structure and coverage [R]

Add arch support for cohere2-MoE by michaelw9999 · Pull Request #24260 · ggml-org/llama.cpp

This is coming to Chinese open source models pretty soon. - prepare yourself.

GLM 5.2 is out - open weights to be released next week. How did it do on my one-shot Pac-Man test?

GLM 5.2 is deployed in GLM Coding Plan. API and MIT weights in a week. Voting and benchmarks on X.

We should set up a torrent network for open source models.

New model on huggingface

Open Source AI Must Win

Kimi K2.7-Code: open-source coding model with better token efficiency

Building an Open Source Edge Semantic Cache for LLMs in Rust/WASM – Sanity check on the architecture? [D]

Huawei Released openPangu 2.0 (Will open source on June 30)

Two Wrongs, No Right: Auditing Social-Desirability Bias in LLM Annotators for Computational Social Science

Codex for open source

MiMo Code is now released and open-source

As we know Minimax M3 is just going to be open sourced in few days and because of that I was surfing on internet searching for its scores and I found out pretty interesting results. Is Minimax M3 really that good in agentic stuff and in coding? Is it better than older gpt models?

Cognitor: open-source semantic search engine. Automatically chunks, embeds and indexes the content of a target folder, making it searchable semantically.

How I implemented ASR bias for voice transcription models [Open Source]

Minimax M3 open weights release planned for Friday

Bergson: An Open Source Library for Data Attribution

nvidia/diffusiongemma-26B-A4B-it-NVFP4 · Hugging Face

Pyrecall open source tool for detecting catastrophic forgetting during LLM fine-tuning[P]

Best Open-Source AI coding model for my specs?

DeepMind Just Dropped "DiffusionGemma" — Text Generation via Image-Style Diffusion Model

Cohere released North Mini Code: It's first Open-Source Agentic Coding Model

Introducing Papers Without Code [P]

RelayOps - Production-shaped telecom support agent (54% auto-resolve, 0 unsafe actions, full audit + decision console) [P]

Kwai Keye-VL-2.0 Technical Report

Without open source LLMs, US AI companies could have already monopoled the technology

OpenRTLSet: A Fully Open-Source Dataset for Large Language Model-based Verilog Module Design

Without open llm competition, closed source LLM companies will become insatiable.

Releasing Apodex-1.0 Smol Models (0.8B, 2B, 4B Open-Weights) optimized for Agentic Verification + AgentHarness Evals

zai-org/SCAIL-2 · Hugging Face

Have we reached the point where open-source LLMs are “just good enough”?

Microsoft's open source tools were hacked to steal passwords of AI developers

I fine-tuned Parakeet 0.6B for medical ASR — open weights, local Mac/CUDA/CPU

How to start open source contribution [D]

Was BitNet a dead end? What happened to ternary LLMs?

I'd like to share an updated methodology for building agents.[P]

Show HN: Gitdot – A better GitHub. Open-source, written in Rust

OpenEnv is now owned by HF, Torch, Prime Intellect, Unsloth, Modal, Mercor, and more! Use it for training agents.

TorchKM: A GPU-Oriented Library for Kernel Learning and Model Selection