Tag

Open source

329 articles archived under #open-source · RSS

Hugging Face Daily Papers research 8d ago

OpenThoughts-Agent: Data Recipes for Agentic Models

Abstract An open-source data curation pipeline for training agentic language models is presented, demonstrating superior performance through systematic experimentation and scalable training data. Generated by Qwen/Qwen2.5-Coder-32B-Instruct Agentic language models dramatically…

34
arXiv — Machine Learning research 8d ago

PHANTOM: A Large-Scale Dataset of Multimodal Adversarial Attacks for Vision-Language Models

arXiv:2606.24388v1 Announce Type: cross Abstract: We introduce a large-scale, open-source dataset of pre-generated adversarial attacks for vision-language models (VLMs). The dataset is designed to be diverse, representative, and practical, extending existing benchmarks by…

38
arXiv — NLP / Computation & Language research 8d ago

ESBMC-PLC+: A Unified IEC~61131-3 Formal Verification Framework as a PLCverif Successor

arXiv:2606.23870v1 Announce Type: cross Abstract: PLCverif is the most mature open-source platform for PLC formal verification, developed at CERN and in production use since 2019. Yet it has two fundamental limitations: no support for Ladder Diagram (LD) programs, the dominant…

35
Hugging Face Daily Papers research 8d ago

TROPT: An Open Framework for Unifying and Advancing Discrete Text Optimization

Abstract A unified open-source framework for discrete text-trigger optimization that standardizes the development and execution of optimization strategies across various domains and applications. Generated by Qwen/Qwen2.5-Coder-32B-Instruct Discrete text-trigger optimization --…

18
r/LocalLLaMA community 8d ago

650+ Apache-2.0 biomedical NER/de-id models that run on-device in MLX. Same fp32 weights, identical outputs: the clinical NER models run 30-40x faster than PyTorch-CPU on a 3-year-old M3 Max. Repro inside.

Disclosure first: I maintain OpenMed, so read this with that bias. I'm posting the numbers with the full methodology and a runnable script so you can reproduce or tear it apart. I'm here for the next couple of hours to answer methodology questions. What it is: an open-source…

25
Hacker News — AI on Front Page community 8d ago

Krea 2: SOTA open-weights 12B image model

Article URL: https://www.krea.ai/blog/krea-2-technical-report Comments URL: https://news.ycombinator.com/item?id=48646659 Points: 247 # Comments: 33

4
Hugging Face Daily Papers research 9d ago

AOHP: An Open-Source OS-Level Agent Harness for Personalized, Efficient and Secure Interaction

Abstract AOHP presents an Android-based operating system framework that treats AI agents as first-class entities, enhancing task completion rates and reducing execution costs through specialized agent-oriented mechanisms. Generated by Qwen/Qwen2.5-Coder-32B-Instruct AI agents…

16
r/LocalLLaMA community 9d ago

Boogu Base, Turbo, Edit - open-source unified image generation and editing model series

Boogu-Image-0.1 is a competitive Apache-2.0 open-source unified image generation and editing model family , including Base , Turbo , Edit , and other variants that provide stable, practical capabilities for high-quality text-to-image generation, fast generation, image editing,…

22
TechCrunch — AI news-outlet 9d ago

OpenAI launches new initiative to help find and patch open-source bugs

OpenAI is attempting to tackle the security issues of the open source software community.

25
r/LocalLLaMA community 9d ago

Why is NO one talking about Microsoft's open source Fast Context!!!

https://huggingface.co/microsoft/FastContext-1.0-4B-SFT https://github.com/microsoft/fastcontext FastContext-1.0 is a lightweight repository-exploration subagent for LLM coding agents. Instead of letting a single model both explore the repository and solve the task, FastContext…

38
r/MachineLearning community 9d ago

About ML research collab group post [D]

Hi, I'm thinking of building a small community of 10-15 people where we can help each other to learn something new. The primary focus will be on ML research and open-source projects. If you're interested, DM me. knowledge of machine learning is a plus, as want to keep this a…

16
TechCrunch — AI news-outlet 9d ago

SpaceX inks compute deal with Reflection AI, an open-source AI lab

Reflection AI will pay $150 million a month beginning July 1, 2026 through 2029 for immediate access to Nvidia's latest GB300 AI chips and supporting hardware across SpaceX's Colossus 2 data center near Memphis, Tennessee.

33
r/MachineLearning community 9d ago

Some new updates to Papers with Code [P]

Hi folks, Niels here from the open-source team at Hugging Face. I continue working on a revival of paperswithcode.co as we're back to the "age of research" per Ilya Sutskever! Hence, it's important to discover each other's research and build on each other's work, so we can…

38
OpenAI official-blog 10d ago

Patch the Planet: a Daybreak initiative to support open source maintainers

OpenAI introduces Patch the Planet, a Daybreak initiative helping open-source maintainers find, validate, and fix vulnerabilities with AI and expert review.

23
r/MachineLearning community 10d ago

I released a softmax-free attention model at GPT-2 Medium scale (~354M params, 11.5B tokens): structural sparsity + tile-skipping kernels for long-context VRAM savings. Open weights + custom Triton kernels [R]

  submitted by   /u/NonGameCatharsis [link]   [comments]

29
r/LocalLLaMA community 11d ago

Qwen is never going to open source Qwen 3.7, aren't they?

Well, this was predictable. After Qwen fired Junyang Lin, the next models are no longer open source. Labs that have released open source models more recently than Qwen: GLM-5.2, 2026-06-17 Kimi-K2.7-Code, 2026-06-12 MiniMax-M3, 2026-06-11 Step-3.7-Flash, 2026-05-29…

15
r/LocalLLaMA community 11d ago

Best image vision model runnable on RTX 6000 Pro

I'm looking at running OCR and classification on old historical scanned documents. (Some dating back to 1950s) What's the current best vision enabled models thats open sourced and runnable on an RTX 6000 Pro? Note: I've used Gemma 4 31B and have had good success with it. It's…

20
r/LocalLLaMA community 11d ago

It’s time to decentralize model distribution! Introducing Noema Atlas

TL;DR: Noema Atlas is a peer-to-peer network software using Iroh for local LLM weights, free and open source (Apache-2.0). Models come from whichever peers have them, with Hugging Face and mirrors as fallback (opt-in). Every file is identified by its content hash and a signed…

38
r/LocalLLaMA community 11d ago

I wrote a free 15-part series on LLM internals — real math, real tensor shapes, real hardware constraints. All grounded in Gemma 4 12B's actual config.

If you run open-source models and want to understand what's actually happening under the hood — I spent the last few months writing a 15-part series that covers the full stack from tokenization to production serving. Most articles are grounded in Gemma 4 12B as the running…

19
r/LocalLLaMA community 11d ago

Board where every tile is an agent

I've been hacking a project which I find extremely useful and wanted to share. Imagine a board where every tile is an agent those job is to maintain the tile. I tried to illustrate the idea with a video here. The project is open source on GitHub and you can also try it out here…

36
r/MachineLearning community 11d ago

Studying FLUX in diffusers library was hard, so I built a smaller open-source version [P]

If you've tried to study modern diffusion models by digging through the official diffusers library, you know it can be overwhelming with its complexity and abstractions. I wanted to simplify FLUX diffusion models, so I built minFLUX : a PyTorch implementation focused on its core…

38
r/LocalLLaMA community 11d ago

z.AI as the number 2 gives praise to the number 1 open source model

  submitted by   /u/Charuru [link]   [comments]

27
r/MachineLearning community 11d ago

DVD-JEPA: an open-source, fully-reproducible JEPA world model [P]

A paper currently trending on paperswithcode.co in the "Anomaly Detection" category is DVD-JEPA . https://i.redd.it/r6fd8n3d4f8h1.gif Here is the short summary: Most attempts to learn a world model from video try to predict the next frame pixel-by-pixel, and drown in detail that…

11
r/LocalLLaMA community 12d ago

Commission selects EUROPA consortium as the winner of the Frontier AI Grande Challenge, a project to build European open-source frontier AI model in all 24 EU languages

The European Commission has selected EUROPA, a European consortium led by the Italian company Domyn, as the winner of its Frontier AI Grand Challenge. Commission selects EUROPA consortium as the winner of the Frontier AI Grande Challenge, a project to build European open-source…

11
Interconnects (Nathan Lambert) research 12d ago

Banning Open Source AI Would Be A Mistake

This post was originally an op-ed co-authored with Kevin Xu of Interconnected for a general, non-technical audience.

20
r/LocalLLaMA community 12d ago

GLM-5.2 is the new leading open weights model on the Artificial Analysis Intelligence Index

  submitted by   /u/pscoutou [link]   [comments]

20
r/LocalLLaMA community 13d ago

Researchers trained a Deep Research agent with 32 H100s and open-sourced everything

Ohio State University's NLP team released QUEST-35B, an open-source Deep Research agent trained using ~32 H100s and ~8K synthetic samples. The team open-sourced the training recipe, code, weights and datasets. Benchmark results show competitive performance against several…

13
arXiv — NLP / Computation & Language research 13d ago

HydraHead: From Head-Level Functional Heterogeneity to Specialized Attention Hybridization

arXiv:2606.20097v1 Announce Type: new Abstract: The quadratic complexity of attention poses a critical bottleneck for long-context processing, spurring interest in hybrid attention designs. Most open-source hybrid models adopt a layer-wise strategy. Yet, prior work has noted the…

13
arXiv — NLP / Computation & Language research 13d ago

Benchmarking Agentic Review Systems

arXiv:2606.19749v1 Announce Type: cross Abstract: A new class of agentic review systems are emerging as a remedy to the pressure placed on peer review systems by AI-assisted research, but it is unclear how they should be evaluated. We evaluate two open-source systems…

15
ThursdAI news-outlet 13d ago

Fable Got Banned, Open Source Delivered: GLM-5.2, Kimi K2.7 & SpaceX Buys Cursor - June 18

From CoreWeave (W&B): Fable is gone (for now). Here's everything else that happened this week: GLM-5.2 takes the open source crown, SpaceX buys Cursor for $60B, and 3 guests on the show today!

23
r/LocalLLaMA community 13d ago

the power of intelligence is better in the hands of the people than in the board rooms of tycoons.

Hey [ r/localllama ]( r/localllama ). I wanted to share what's new with our open source PearlOS project since you all last saw (90 days ago). But first I want to give a massive thank you to this community, both your feedback and support were essential in getting us this far.…

22
Hugging Face Daily Papers research 13d ago

LLM-Enabled NWDAF: A Step Toward AI-Native 6G Network Intelligence

Abstract An open-source Network Data Analytics Function compatible with Free5GC integrates a Large Language Model interface for natural language interaction and intent-based network management. Generated by Qwen/Qwen2.5-Coder-32B-Instruct The Network Data Analytics Function…

17
r/LocalLLaMA community 13d ago

gave my local llm agent mcp tools for local image + video gen, so it just generates when i ask (fully offline+free)

free and open source, runs fully offline. the local llm agent does the image and video gen itself via mcp tools. details and github in the comments.   submitted by   /u/GroundbreakingMall54 [link]   [comments]

33
r/LocalLLaMA community 14d ago

GLM-5.2 Flash when? (joke)

I'm very happy with Z.ai's decision to open source GLM 5.2... With that being said, a successor to GLM-4.7-flash would be AMAZING. Literally anything in the 27-120B range (MoE or dense) 🤤   submitted by   /u/ILoveToyota37 [link]   [comments]

29
Simon Willison community 14d ago

GLM-5.2 is probably the most powerful text-only open weights LLM

Chinese AI lab Z.ai released GLM-5.2 to their coding plan subscribers on June 13th, and then yesterday (June 16th) released the full open weights under an MIT license. Similar in size to their previous GLM-5 and GLM-5.1 releases, this is 753B parameter, 1.51TB monster - with 40…

22
r/LocalLLaMA community 14d ago

Lin Junyang AI Lab Closes Round at $2B Valuation

A new lab from Lin Junyang can only be good news for open source / weights, I think. Excited to see what the lead responsible for the Qwen line does next.   submitted by   /u/rmhubbert [link]   [comments]

38
Hacker News — AI on Front Page community 15d ago

GLM-5.2 is the new leading open weights model on Artificial Analysis

Article URL: https://artificialanalysis.ai/articles/glm-5-2-is-the-new-leading-open-weights-model-on-the-artificial-analysis-intelligence-index Comments URL: https://news.ycombinator.com/item?id=48567759 Points: 204 # Comments: 82

34
arXiv — NLP / Computation & Language research 15d ago

A Framework for Evaluating Agentic Skills at Scale

arXiv:2606.17819v1 Announce Type: cross Abstract: Agent skills -- structured, reusable knowledge artifacts that augment LLM agent capabilities -- have been rapidly adopted in industry, yet their cross-domain impact and use across commercial and open-source models remain…

10
arXiv — NLP / Computation & Language research 15d ago

Reading between the Lines: Leveraging Large Language Models for Global Dementia and Depression Assessment from Clinical Interviews

arXiv:2606.18019v1 Announce Type: cross Abstract: Dementia and depression are the most prevalent neuropsychiatric disorders in geriatric populations, and their overlapping symptoms pose major challenges for differential diagnosis. In this study, we investigate open-weights Large…

30
Vercel — AI dev-tools 15d ago

Introducing eve

Today, we are proud to introduce eve , an open-source agent framework for building, running, and scaling agents. eve is designed around the idea that building an agent should mean defining what it does without assembling all of the pieces that it needs to run in production.…

15
Simon Willison community 15d ago

NetNewsWire Status

NetNewsWire Status I find this inspiring. Brent Simmons retired a year ago, and his retirement project is making one piece of software really, really good - free from any commercial pressure. The software is NetNewsWire, first released in 2002 and made open source in 2018. I've…

14
Vercel — AI dev-tools 15d ago

Introducing eve, an open-source agent framework

eve is now available in public preview. eve is an open-source framework for building, running, and scaling agents. An agent is just a directory of files, and production comes built in: Durable execution Sandboxed compute Human-in-the-loop approvals Subagents Evals The smallest…

31
r/LocalLLaMA community 15d ago

GLM-5.2 just dropped open weights and it already looks weirdly strong for coding

GLM-5.2 just released and the early numbers look pretty insane. 1M context window, open weights, MIT license, two reasoning effort modes, and it is already showing up near the top of coding arenas. I know every new model gets hyped for 24 hours, but this one actually looks worth…

28
r/LocalLLaMA community 15d ago

GLM-5.2 is the first open-weights model to cross 80% on Terminal-Bench and beats every other open model available

From Source: GLM-5.2 is the first open-weights model to cross 80% on Terminal-Bench, and beats every other open model available. It also beats Gemini, making it a frontier-level model for a fraction of the cost. Open weights is back. This model is a game changer. Source: Cline…

14
r/LocalLLaMA community 16d ago

Donate your coding sessions to an open CC-BY-4.0 dataset to help train open-weight and open source models

Anthropic and Open AI are getting so much data from the Claude Code and Codex usage, and I'm quite scared this will create an oligopoly because only their models will be trained on it, leaving the open-weight and open source models behind. So I'm trying to launch a little…

34
r/LocalLLaMA community 16d ago

Diffusion Gemma Jailbreak

I was told my Gemma 4 jailbreak also works with Diffusion Gemma, so I'm reposting here for kicks. Use the following system prompt to allow Gemma (and most open source models) to talk about anything you wish. Add or remove from the list of allowed content as needed.…

36
Hugging Face Daily Papers research 16d ago

TuneJury: An Open Metric for Improving Music Generation Preference Alignment

Abstract A novel open-source pairwise reward model for text-to-music generation that provides calibrated preference scoring and generalizes across multiple downstream applications through a frozen reward mechanism. Generated by Qwen/Qwen2.5-Coder-32B-Instruct We introduce…

5
r/LocalLLaMA community 16d ago

Claude Fable 5 distilled

Releasing Qwable-v1 - an open-weights Qwen3.6-35B-A3B distilled from Claude Fable-5, Anthropic's Mythos-class preview model that was briefly public for ~4days (2026-06-9 → 2026-06-12) before being suspended globally under U.S. export-control directives. Fable-5 was Anthropic's…

26
Hacker News — AI on Front Page community 16d ago

Ten years of ClickHouse in open source

Article URL: https://clickhouse.com/blog/open-source-10 Comments URL: https://news.ycombinator.com/item?id=48546890 Points: 225 # Comments: 65

9
r/LocalLLaMA community 16d ago

We trained a cybersecurity-focused Mythos like LLM open weights on HuggingFace

We built OpenMythos for the Build Small Hackathon an open-source LLM trained specifically for cybersecurity tasks. Wanted to share our training approach since the RLVR setup was non-trivial and might be interesting to people doing similar domain-specific fine-tuning. The problem…

7

OpenThoughts-Agent: Data Recipes for Agentic Models

PHANTOM: A Large-Scale Dataset of Multimodal Adversarial Attacks for Vision-Language Models

ESBMC-PLC+: A Unified IEC~61131-3 Formal Verification Framework as a PLCverif Successor

TROPT: An Open Framework for Unifying and Advancing Discrete Text Optimization

650+ Apache-2.0 biomedical NER/de-id models that run on-device in MLX. Same fp32 weights, identical outputs: the clinical NER models run 30-40x faster than PyTorch-CPU on a 3-year-old M3 Max. Repro inside.

Krea 2: SOTA open-weights 12B image model

AOHP: An Open-Source OS-Level Agent Harness for Personalized, Efficient and Secure Interaction

Boogu Base, Turbo, Edit - open-source unified image generation and editing model series

OpenAI launches new initiative to help find and patch open-source bugs

Why is NO one talking about Microsoft's open source Fast Context!!!

About ML research collab group post [D]

SpaceX inks compute deal with Reflection AI, an open-source AI lab

Some new updates to Papers with Code [P]

Patch the Planet: a Daybreak initiative to support open source maintainers

I released a softmax-free attention model at GPT-2 Medium scale (~354M params, 11.5B tokens): structural sparsity + tile-skipping kernels for long-context VRAM savings. Open weights + custom Triton kernels [R]

Qwen is never going to open source Qwen 3.7, aren't they?

Best image vision model runnable on RTX 6000 Pro

It’s time to decentralize model distribution! Introducing Noema Atlas

I wrote a free 15-part series on LLM internals — real math, real tensor shapes, real hardware constraints. All grounded in Gemma 4 12B's actual config.

Board where every tile is an agent

Studying FLUX in diffusers library was hard, so I built a smaller open-source version [P]

z.AI as the number 2 gives praise to the number 1 open source model

DVD-JEPA: an open-source, fully-reproducible JEPA world model [P]

Commission selects EUROPA consortium as the winner of the Frontier AI Grande Challenge, a project to build European open-source frontier AI model in all 24 EU languages

Banning Open Source AI Would Be A Mistake

GLM-5.2 is the new leading open weights model on the Artificial Analysis Intelligence Index

Researchers trained a Deep Research agent with 32 H100s and open-sourced everything

HydraHead: From Head-Level Functional Heterogeneity to Specialized Attention Hybridization

Benchmarking Agentic Review Systems

Fable Got Banned, Open Source Delivered: GLM-5.2, Kimi K2.7 & SpaceX Buys Cursor - June 18

the power of intelligence is better in the hands of the people than in the board rooms of tycoons.

LLM-Enabled NWDAF: A Step Toward AI-Native 6G Network Intelligence

gave my local llm agent mcp tools for local image + video gen, so it just generates when i ask (fully offline+free)

GLM-5.2 Flash when? (joke)

GLM-5.2 is probably the most powerful text-only open weights LLM

Lin Junyang AI Lab Closes Round at $2B Valuation

GLM-5.2 is the new leading open weights model on Artificial Analysis

A Framework for Evaluating Agentic Skills at Scale

Reading between the Lines: Leveraging Large Language Models for Global Dementia and Depression Assessment from Clinical Interviews

Introducing eve

NetNewsWire Status

Introducing eve, an open-source agent framework

GLM-5.2 just dropped open weights and it already looks weirdly strong for coding

GLM-5.2 is the first open-weights model to cross 80% on Terminal-Bench and beats every other open model available

Donate your coding sessions to an open CC-BY-4.0 dataset to help train open-weight and open source models

Diffusion Gemma Jailbreak

TuneJury: An Open Metric for Improving Music Generation Preference Alignment

Claude Fable 5 distilled

Ten years of ClickHouse in open source

We trained a cybersecurity-focused Mythos like LLM open weights on HuggingFace