News / #open-source Tag Open source 329 articles archived under #open-source · RSS Sign in to follow Hugging Face Daily Papers research 8d ago OpenThoughts-Agent: Data Recipes for Agentic Models Abstract An open-source data curation pipeline for training agentic language models is presented, demonstrating superior performance through systematic experimentation and scalable training data. Generated by Qwen/Qwen2.5-Coder-32B-Instruct Agentic language models dramatically… 34 arXiv — Machine Learning research 8d ago PHANTOM: A Large-Scale Dataset of Multimodal Adversarial Attacks for Vision-Language Models arXiv:2606.24388v1 Announce Type: cross Abstract: We introduce a large-scale, open-source dataset of pre-generated adversarial attacks for vision-language models (VLMs). The dataset is designed to be diverse, representative, and practical, extending existing benchmarks by… 38 arXiv — NLP / Computation & Language research 8d ago ESBMC-PLC+: A Unified IEC~61131-3 Formal Verification Framework as a PLCverif Successor arXiv:2606.23870v1 Announce Type: cross Abstract: PLCverif is the most mature open-source platform for PLC formal verification, developed at CERN and in production use since 2019. Yet it has two fundamental limitations: no support for Ladder Diagram (LD) programs, the dominant… 35 Hugging Face Daily Papers research 8d ago TROPT: An Open Framework for Unifying and Advancing Discrete Text Optimization Abstract A unified open-source framework for discrete text-trigger optimization that standardizes the development and execution of optimization strategies across various domains and applications. Generated by Qwen/Qwen2.5-Coder-32B-Instruct Discrete text-trigger optimization --… 18 r/LocalLLaMA community 8d ago 650+ Apache-2.0 biomedical NER/de-id models that run on-device in MLX. Same fp32 weights, identical outputs: the clinical NER models run 30-40x faster than PyTorch-CPU on a 3-year-old M3 Max. Repro inside. Disclosure first: I maintain OpenMed, so read this with that bias. I'm posting the numbers with the full methodology and a runnable script so you can reproduce or tear it apart. I'm here for the next couple of hours to answer methodology questions. What it is: an open-source… 25 Hacker News — AI on Front Page community 8d ago Krea 2: SOTA open-weights 12B image model Article URL: https://www.krea.ai/blog/krea-2-technical-report Comments URL: https://news.ycombinator.com/item?id=48646659 Points: 247 # Comments: 33 4 Hugging Face Daily Papers research 9d ago AOHP: An Open-Source OS-Level Agent Harness for Personalized, Efficient and Secure Interaction Abstract AOHP presents an Android-based operating system framework that treats AI agents as first-class entities, enhancing task completion rates and reducing execution costs through specialized agent-oriented mechanisms. Generated by Qwen/Qwen2.5-Coder-32B-Instruct AI agents… 16 r/LocalLLaMA community 9d ago Boogu Base, Turbo, Edit - open-source unified image generation and editing model series Boogu-Image-0.1 is a competitive Apache-2.0 open-source unified image generation and editing model family , including Base , Turbo , Edit , and other variants that provide stable, practical capabilities for high-quality text-to-image generation, fast generation, image editing,… 22 TechCrunch — AI news-outlet 9d ago OpenAI launches new initiative to help find and patch open-source bugs OpenAI is attempting to tackle the security issues of the open source software community. 25 r/LocalLLaMA community 9d ago Why is NO one talking about Microsoft's open source Fast Context!!! https://huggingface.co/microsoft/FastContext-1.0-4B-SFT https://github.com/microsoft/fastcontext FastContext-1.0 is a lightweight repository-exploration subagent for LLM coding agents. Instead of letting a single model both explore the repository and solve the task, FastContext… 38 r/MachineLearning community 9d ago About ML research collab group post [D] Hi, I'm thinking of building a small community of 10-15 people where we can help each other to learn something new. The primary focus will be on ML research and open-source projects. If you're interested, DM me. knowledge of machine learning is a plus, as want to keep this a… 16 TechCrunch — AI news-outlet 9d ago SpaceX inks compute deal with Reflection AI, an open-source AI lab Reflection AI will pay $150 million a month beginning July 1, 2026 through 2029 for immediate access to Nvidia's latest GB300 AI chips and supporting hardware across SpaceX's Colossus 2 data center near Memphis, Tennessee. 33 r/MachineLearning community 9d ago Some new updates to Papers with Code [P] Hi folks, Niels here from the open-source team at Hugging Face. I continue working on a revival of paperswithcode.co as we're back to the "age of research" per Ilya Sutskever! Hence, it's important to discover each other's research and build on each other's work, so we can… 38 OpenAI official-blog 10d ago Patch the Planet: a Daybreak initiative to support open source maintainers OpenAI introduces Patch the Planet, a Daybreak initiative helping open-source maintainers find, validate, and fix vulnerabilities with AI and expert review. 23 r/MachineLearning community 10d ago I released a softmax-free attention model at GPT-2 Medium scale (~354M params, 11.5B tokens): structural sparsity + tile-skipping kernels for long-context VRAM savings. Open weights + custom Triton kernels [R]   submitted by   /u/NonGameCatharsis [link]   [comments] 29 r/LocalLLaMA community 11d ago Qwen is never going to open source Qwen 3.7, aren't they? Well, this was predictable. After Qwen fired Junyang Lin, the next models are no longer open source. Labs that have released open source models more recently than Qwen: GLM-5.2, 2026-06-17 Kimi-K2.7-Code, 2026-06-12 MiniMax-M3, 2026-06-11 Step-3.7-Flash, 2026-05-29… 15 r/LocalLLaMA community 11d ago Best image vision model runnable on RTX 6000 Pro I'm looking at running OCR and classification on old historical scanned documents. (Some dating back to 1950s) What's the current best vision enabled models thats open sourced and runnable on an RTX 6000 Pro? Note: I've used Gemma 4 31B and have had good success with it. It's… 20 r/LocalLLaMA community 11d ago It’s time to decentralize model distribution! Introducing Noema Atlas TL;DR: Noema Atlas is a peer-to-peer network software using Iroh for local LLM weights, free and open source (Apache-2.0). Models come from whichever peers have them, with Hugging Face and mirrors as fallback (opt-in). Every file is identified by its content hash and a signed… 38 r/LocalLLaMA community 11d ago I wrote a free 15-part series on LLM internals — real math, real tensor shapes, real hardware constraints. All grounded in Gemma 4 12B's actual config. If you run open-source models and want to understand what's actually happening under the hood — I spent the last few months writing a 15-part series that covers the full stack from tokenization to production serving. Most articles are grounded in Gemma 4 12B as the running… 19 r/LocalLLaMA community 11d ago Board where every tile is an agent I've been hacking a project which I find extremely useful and wanted to share. Imagine a board where every tile is an agent those job is to maintain the tile. I tried to illustrate the idea with a video here. The project is open source on GitHub and you can also try it out here… 36 r/MachineLearning community 11d ago Studying FLUX in diffusers library was hard, so I built a smaller open-source version [P] If you've tried to study modern diffusion models by digging through the official diffusers library, you know it can be overwhelming with its complexity and abstractions. I wanted to simplify FLUX diffusion models, so I built minFLUX : a PyTorch implementation focused on its core… 38 r/LocalLLaMA community 11d ago z.AI as the number 2 gives praise to the number 1 open source model   submitted by   /u/Charuru [link]   [comments] 27 r/MachineLearning community 11d ago DVD-JEPA: an open-source, fully-reproducible JEPA world model [P] A paper currently trending on paperswithcode.co in the "Anomaly Detection" category is DVD-JEPA . https://i.redd.it/r6fd8n3d4f8h1.gif Here is the short summary: Most attempts to learn a world model from video try to predict the next frame pixel-by-pixel, and drown in detail that… 11 r/LocalLLaMA community 12d ago Commission selects EUROPA consortium as the winner of the Frontier AI Grande Challenge, a project to build European open-source frontier AI model in all 24 EU languages The European Commission has selected EUROPA, a European consortium led by the Italian company Domyn, as the winner of its Frontier AI Grand Challenge. Commission selects EUROPA consortium as the winner of the Frontier AI Grande Challenge, a project to build European open-source… 11 Interconnects (Nathan Lambert) research 12d ago Banning Open Source AI Would Be A Mistake This post was originally an op-ed co-authored with Kevin Xu of Interconnected for a general, non-technical audience. 20 r/LocalLLaMA community 12d ago GLM-5.2 is the new leading open weights model on the Artificial Analysis Intelligence Index   submitted by   /u/pscoutou [link]   [comments] 20 r/LocalLLaMA community 13d ago Researchers trained a Deep Research agent with 32 H100s and open-sourced everything Ohio State University's NLP team released QUEST-35B, an open-source Deep Research agent trained using ~32 H100s and ~8K synthetic samples. The team open-sourced the training recipe, code, weights and datasets. Benchmark results show competitive performance against several… 13 arXiv — NLP / Computation & Language research 13d ago HydraHead: From Head-Level Functional Heterogeneity to Specialized Attention Hybridization arXiv:2606.20097v1 Announce Type: new Abstract: The quadratic complexity of attention poses a critical bottleneck for long-context processing, spurring interest in hybrid attention designs. Most open-source hybrid models adopt a layer-wise strategy. Yet, prior work has noted the… 13 arXiv — NLP / Computation & Language research 13d ago Benchmarking Agentic Review Systems arXiv:2606.19749v1 Announce Type: cross Abstract: A new class of agentic review systems are emerging as a remedy to the pressure placed on peer review systems by AI-assisted research, but it is unclear how they should be evaluated. We evaluate two open-source systems… 15 ThursdAI news-outlet 13d ago Fable Got Banned, Open Source Delivered: GLM-5.2, Kimi K2.7 & SpaceX Buys Cursor - June 18 From CoreWeave (W&B): Fable is gone (for now). Here's everything else that happened this week: GLM-5.2 takes the open source crown, SpaceX buys Cursor for $60B, and 3 guests on the show today! 23 r/LocalLLaMA community 13d ago the power of intelligence is better in the hands of the people than in the board rooms of tycoons. Hey [ r/localllama ]( r/localllama ). I wanted to share what's new with our open source PearlOS project since you all last saw (90 days ago). But first I want to give a massive thank you to this community, both your feedback and support were essential in getting us this far.… 22 Hugging Face Daily Papers research 13d ago LLM-Enabled NWDAF: A Step Toward AI-Native 6G Network Intelligence Abstract An open-source Network Data Analytics Function compatible with Free5GC integrates a Large Language Model interface for natural language interaction and intent-based network management. Generated by Qwen/Qwen2.5-Coder-32B-Instruct The Network Data Analytics Function… 17 r/LocalLLaMA community 13d ago gave my local llm agent mcp tools for local image + video gen, so it just generates when i ask (fully offline+free) free and open source, runs fully offline. the local llm agent does the image and video gen itself via mcp tools. details and github in the comments.   submitted by   /u/GroundbreakingMall54 [link]   [comments] 33 r/LocalLLaMA community 14d ago GLM-5.2 Flash when? (joke) I'm very happy with Z.ai's decision to open source GLM 5.2... With that being said, a successor to GLM-4.7-flash would be AMAZING. Literally anything in the 27-120B range (MoE or dense) 🤤   submitted by   /u/ILoveToyota37 [link]   [comments] 29 Simon Willison community 14d ago GLM-5.2 is probably the most powerful text-only open weights LLM Chinese AI lab Z.ai released GLM-5.2 to their coding plan subscribers on June 13th, and then yesterday (June 16th) released the full open weights under an MIT license. Similar in size to their previous GLM-5 and GLM-5.1 releases, this is 753B parameter, 1.51TB monster - with 40… 22 r/LocalLLaMA community 14d ago Lin Junyang AI Lab Closes Round at $2B Valuation A new lab from Lin Junyang can only be good news for open source / weights, I think. Excited to see what the lead responsible for the Qwen line does next.   submitted by   /u/rmhubbert [link]   [comments] 38 Hacker News — AI on Front Page community 15d ago GLM-5.2 is the new leading open weights model on Artificial Analysis Article URL: https://artificialanalysis.ai/articles/glm-5-2-is-the-new-leading-open-weights-model-on-the-artificial-analysis-intelligence-index Comments URL: https://news.ycombinator.com/item?id=48567759 Points: 204 # Comments: 82 34 arXiv — NLP / Computation & Language research 15d ago A Framework for Evaluating Agentic Skills at Scale arXiv:2606.17819v1 Announce Type: cross Abstract: Agent skills -- structured, reusable knowledge artifacts that augment LLM agent capabilities -- have been rapidly adopted in industry, yet their cross-domain impact and use across commercial and open-source models remain… 10 arXiv — NLP / Computation & Language research 15d ago Reading between the Lines: Leveraging Large Language Models for Global Dementia and Depression Assessment from Clinical Interviews arXiv:2606.18019v1 Announce Type: cross Abstract: Dementia and depression are the most prevalent neuropsychiatric disorders in geriatric populations, and their overlapping symptoms pose major challenges for differential diagnosis. In this study, we investigate open-weights Large… 30 Vercel — AI dev-tools 15d ago Introducing eve Today, we are proud to introduce eve , an open-source agent framework for building, running, and scaling agents. eve is designed around the idea that building an agent should mean defining what it does without assembling all of the pieces that it needs to run in production.… 15 Simon Willison community 15d ago NetNewsWire Status NetNewsWire Status I find this inspiring. Brent Simmons retired a year ago, and his retirement project is making one piece of software really, really good - free from any commercial pressure. The software is NetNewsWire, first released in 2002 and made open source in 2018. I've… 14 Vercel — AI dev-tools 15d ago Introducing eve, an open-source agent framework eve is now available in public preview. eve is an open-source framework for building, running, and scaling agents. An agent is just a directory of files, and production comes built in: Durable execution Sandboxed compute Human-in-the-loop approvals Subagents Evals The smallest… 31 r/LocalLLaMA community 15d ago GLM-5.2 just dropped open weights and it already looks weirdly strong for coding GLM-5.2 just released and the early numbers look pretty insane. 1M context window, open weights, MIT license, two reasoning effort modes, and it is already showing up near the top of coding arenas. I know every new model gets hyped for 24 hours, but this one actually looks worth… 28 r/LocalLLaMA community 15d ago GLM-5.2 is the first open-weights model to cross 80% on Terminal-Bench and beats every other open model available From Source: GLM-5.2 is the first open-weights model to cross 80% on Terminal-Bench, and beats every other open model available. It also beats Gemini, making it a frontier-level model for a fraction of the cost. Open weights is back. This model is a game changer. Source: Cline… 14 r/LocalLLaMA community 16d ago Donate your coding sessions to an open CC-BY-4.0 dataset to help train open-weight and open source models Anthropic and Open AI are getting so much data from the Claude Code and Codex usage, and I'm quite scared this will create an oligopoly because only their models will be trained on it, leaving the open-weight and open source models behind. So I'm trying to launch a little… 34 r/LocalLLaMA community 16d ago Diffusion Gemma Jailbreak I was told my Gemma 4 jailbreak also works with Diffusion Gemma, so I'm reposting here for kicks. Use the following system prompt to allow Gemma (and most open source models) to talk about anything you wish. Add or remove from the list of allowed content as needed.… 36 Hugging Face Daily Papers research 16d ago TuneJury: An Open Metric for Improving Music Generation Preference Alignment Abstract A novel open-source pairwise reward model for text-to-music generation that provides calibrated preference scoring and generalizes across multiple downstream applications through a frozen reward mechanism. Generated by Qwen/Qwen2.5-Coder-32B-Instruct We introduce… 5 r/LocalLLaMA community 16d ago Claude Fable 5 distilled Releasing Qwable-v1 - an open-weights Qwen3.6-35B-A3B distilled from Claude Fable-5, Anthropic's Mythos-class preview model that was briefly public for ~4days (2026-06-9 → 2026-06-12) before being suspended globally under U.S. export-control directives. Fable-5 was Anthropic's… 26 Hacker News — AI on Front Page community 16d ago Ten years of ClickHouse in open source Article URL: https://clickhouse.com/blog/open-source-10 Comments URL: https://news.ycombinator.com/item?id=48546890 Points: 225 # Comments: 65 9 r/LocalLLaMA community 16d ago We trained a cybersecurity-focused Mythos like LLM open weights on HuggingFace We built OpenMythos for the Build Small Hackathon an open-source LLM trained specifically for cybersecurity tasks. Wanted to share our training approach since the RLVR setup was non-trivial and might be interesting to people doing similar domain-specific fine-tuning. The problem… 7 Page 2 of 7 · 329 articles ← Newer Older →