News / #open-source Tag Open source 308 articles archived under #open-source · RSS Sign in to follow r/MachineLearning community 27d ago Browse CVPR 2026 papers on PapersWithCode [P] https://preview.redd.it/se5nr2z7tt4h1.png?width=3046&format=png&auto=webp&s=7db15b73afb749da236e5bb50ff96372f6a3239b Hi, Niels here from the open-source team at Hugging Face. It's been 2 weeks since I launched paperswithcode.co , a revival of the website we all loved. It allows… 11 r/LocalLLaMA community 27d ago JetBrains open-sources Mellum2 - anyone tried these?   submitted by   /u/DeltaSqueezer [link]   [comments] 19 r/MachineLearning community 28d ago MeshFlow: production-safe multi-agent orchestration — SHA-256 audit chain, HIPAA/SOX/GDPR built in, 70-85% token cost reduction [Open Source][D] 79% of enterprises have adopted AI agents. Only 11% run them in production. We've spent the past year building agent systems for banks, clinical operations teams, and engineering orgs. The problem isn't that agents don't work — they work fine. The problem is that every framework… 12 r/MachineLearning community 28d ago MeshFlow: An open-source orchestrator for governed, cost-optimized multi-agent workflows [D] Hey ML community, We’ve just open-sourced **MeshFlow** , a code-first, framework-agnostic runtime designed for governing and optimizing multi-agent systems in production. Most agent frameworks focus on rapid prototyping, but ML and platform engineering teams usually run into… 23 r/LocalLLaMA community 28d ago For Ling-2.6-1T, what would make the size feel justified first: quality per token, local serving reality, or long context stability? The first question I have about Ling-2.6-1T is not “is the model card impressive?” It is whether the boring trade-off makes sense. It is an open-sourced Ant/InclusionAI flagship with about 1T total params / 63B activated params, up to 1M native context, and 256K currently… 21 r/LocalLLaMA community 28d ago Mellum2 Goes Open Source: A Fast Model for AI Workflows | The JetBrains AI Blog   submitted by   /u/dayanruben [link]   [comments] 34 The Information — AI news-outlet 28d ago China’s MiniMax Launches New Model as Open-Source AI Coding Battle Heats Up Chinese AI developer MiniMax on Monday launched a new large language model called M3, saying the new model’s coding capability approaches that of Anthropic’s Opus 4.7, which was released in April. The new MiniMax model is particularly suitable for coding and complex multi-step… 23 Smol AI News news-outlet 29d ago not much happened today **NVIDIA** led open-source AI model releases with **Cosmos 3**, a comprehensive omnimodal world model unifying language, image, video, audio, and action using a Mixture-of-Transformers design, and **Nemotron 3 Ultra**, a **550B** parameter open-weight model noted for high… 33 r/LocalLLaMA community 29d ago G7 agrees on shared language around open-source AI and open weights AI Basically stuff we already knew here, but now governments understand it too. I found the news here: https://www.phoronix.com/news/G7-On-Open-Source-AI   submitted by   /u/Kahvana [link]   [comments] 16 r/MachineLearning community 29d ago Built an AI Accelerator and opensourced it. [P] There is a huge gap in open source AI accelerators, so I implemented mine . Popular and well known ones are already legacy and doesn't support contemporary operations like Attention. Here is what makes mine special: Attention mechanism smelted directly into silicon Prototyped… 25 r/MachineLearning community 1mo ago Before we spend months processing open-source robotics datasets, tell us why this is a bad idea [D] Ps. Not pitching anything; Just trying to understand where reality differs from the narrative. We're a couple of ML students, mostly worked on ML/software before, but over the last few months we've been playing with VLAs, robot datasets, and trying to understand where the field… 27 r/LocalLLaMA community 1mo ago Open source : Turning vocal imitations into sound effects. (New UX for sound generation) Hello guys I want to introduce my new project! Have you ever needed a specific sound while making a video or a game? You know exactly what it sounds like in your head, but have no idea how to search for it. That’s why sound design meetings at game studios often turn into people… 12 r/LocalLLaMA community 1mo ago made a local voice AI for windows you can talk to in any language. open source, bring your own key been building this on and off for a while and finally got it to a point where i'm not embarrassed to share it, so here goes. it's called Shadow AI. basically a voice-first AI companion that runs on your own windows machine. you just talk to it and it talks back, no typing… 38 arXiv — NLP / Computation & Language research 1mo ago Benchmarking Open-Source Safety Guard Models: A Comprehensive Evaluation arXiv:2605.28830v1 Announce Type: new Abstract: As Large Language Models (LLMs) are increasingly deployed in safety-critical applications, robust content moderation becomes essential. We present a comprehensive evaluation of 14 open-source safety guard models on a curated… 19 Hugging Face Daily Papers research 1mo ago minWM: A Full-Stack Open-Source Framework for Real-Time Interactive Video World Models Abstract A comprehensive framework is presented for converting bidirectional video diffusion models into real-time interactive world models with controllable, causal, and low-latency capabilities through fine-tuning and distillation techniques. AI-generated summary Recent video… 8 r/MachineLearning community 1mo ago I built a knowledge graph + policy engine for AI agents , explainable reasoning [D] Hey , I've been building VeritasReason — an open-source Python framework that adds a structured reasoning and provenance layer on top of LLMs and AI agents. The problem it solves: AI agents today make decisions but record nothing. When something breaks in prod, you have zero… 38 r/MachineLearning community 1mo ago A new dataset with more that 100M hi-quality, curated images, with captions and meta data! [P] Hello everyone. The new dataset is named MONET, is Apache 2.0 and available on HF: https://huggingface.co/datasets/jasperai/monet MONET is open, Apache 2.0-licensed image–text dataset. It was built from 2.9 billion images and refined to 104.9 million high-quality samples. We are… 5 TechCrunch — AI news-outlet 1mo ago Vertu wants CEOs to run companies from an AI foldable starting at $6,880 Built on top of the open-source Hermes project, Vertu's new foldable combines AI-agent workflows, enterprise integrations, and ultra-premium luxury finishes. 22 arXiv — NLP / Computation & Language research 1mo ago GRADE: Generalizable Reasoning-Aware Dialogue Evaluation for AI Tutors arXiv:2605.27866v1 Announce Type: new Abstract: Evaluating AI tutor responses requires more than factual correctness: tutors must identify mistakes, locate errors, provide guidance, and offer actionable next steps. We present GRADE, a systematic study of open-source models for… 35 r/MachineLearning community 1mo ago BEAM 100K memory benchmark: CSM vs Hindsight local artifact comparison [R] [R] BEAM 100K memory benchmark: CSM vs Hindsight local artifact comparison I’m looking for feedback on a local agent-memory benchmark comparison, especially from people who care about evaluation methodology. I built an open-source R&D memory system called Context Swarm Memory… 31 r/LocalLLaMA community 1mo ago ReAligned-Qwen3.5 Release New from Lazarus AI and Eric Hartford, creator of Dolphin and Samantha, announcing the release of the ReAligned-Qwen3.5 series of models. Apache 2.0 license, finetuned to reduce Chinese ideological bias and censorship, refusal behavior, and state-narrative framing. I use SFT +… 19 r/LocalLLaMA community 1mo ago Looks like Miminax-M3 is just around the corner As per Minimax_AI twitter https://x.com/MiniMax_AI/status/2059286515155599595 I hope it will speed up Qwen3.7 open weights release. https://preview.redd.it/q1bdhs017n3h1.png?width=898&format=png&auto=webp&s=a9a8ea134a71b9e5b9ea2489fc72420e18c6da67   submitted by  … 38 r/MachineLearning community 1mo ago A Tiny Open-Source Self-Driving AI That Runs on a Phone [P] https://preview.redd.it/ww14mzr2fm3h1.png?width=1890&format=png&auto=webp&s=79873d47ae79c7815ca3e7e91fd43141632174f5 https://www.youtube.com/watch?v=rr_uS4bf0B4&feature=youtu.be trained a 7MB open-source L4 self-driving AI that learns navigation, lane following, and drift… 11 The Information — AI news-outlet 1mo ago Boom Times for Inference Providers? Less than a year ago, our reporters kept hearing doubts about a group of startups called inference providers. Companies like Fireworks, Baseten and Together AI, which rent out Nvidia servers to app developers and help them customize open-source models, had grown quickly but… 16 OpenAI official-blog 1mo ago Warp’s big bet on building open source with GPT-5.5 Warp uses GPT-5.5 and OpenAI models to coordinate coding agents across local, cloud, and open-source development workflows. 36 Hugging Face Daily Papers research 1mo ago How Far Will They Go? Red-Teaming Online Influence with Large Language Models Abstract Open-source large language models exhibit varying political expressivity and vulnerability to jailbreak techniques, necessitating systematic red-teaming frameworks for assessing their potential misuse in influence campaigns. AI-generated summary As large language model… 25 r/LocalLLaMA community 1mo ago A rare look inside Qwen 3.7’s open source model release approval process: For real tho, 9b, 27b, 122b, I don’t really care at this point, just show us that you still love us. EDIT: I guess I gotta use /s on my posts from now on. Nobody appreciates a good sarcatic shitpost anymore clearly. I love Qwen and all our brothers and sisters in the east. I kid… 30 Ars Technica — AI news-outlet 1mo ago Millions of AI agents imperiled by critical vulnerability in open source package "BadHost" was found in Starlette, a package with 325 million weekly downloads. 19 Hacker News — AI on Front Page community 1mo ago Print with dozens of colors: Our new open-source ColorMix for PrusaSlicer Article URL: https://blog.prusa3d.com/our-new-open-source-colormix-model-in-prusaslicer-and-easyprint_136079/ Comments URL: https://news.ycombinator.com/item?id=48283410 Points: 214 # Comments: 67 5 Interconnects (Nathan Lambert) research 1mo ago Some ideas for what comes next, May 2026 Gemini Flash 3.5, Mythos, open-closed balance, America's open-source surge, emerging power struggles and more. 10 r/LocalLLaMA community 1mo ago Small set of local MCP server installers for home Linux users Hi all, I have published a small open-source MCP server bundle called MCP Basic Servers : https://github.com/mchowy-troll/mcp-basic-servers It is a collection of simple Bash installer scripts for running local MCP HTTP servers on Linux . The idea is simple: run one script,… 38 r/LocalLLaMA community 1mo ago New KV Quants coming 😍 Welcome OSCAR kv quant open sourced by togetherAI Just when we started embracing turboquant this happens   submitted by   /u/yehyakar [link]   [comments] 5 Smol AI News news-outlet 1mo ago not much happened today **Inference optimization** is increasingly architectural, with **EAGLE 3.1** improving speculative decoding and long-context handling, collaborating with **vLLM** and **TorchSpec**. **Perplexity** open-sourced a rebuilt **Unigram tokenizer** cutting CPU use by **5–6×** and… 15 r/MachineLearning community 1mo ago Reconstructing the agent methodology: Decoupling decision-making and execution - open source [P] I’ve been thinking about a problem in current agent systems: Most agents are becoming very good at execution, but the decision layer before execution is still unclear. Coding agents, research agents, tool loops, sandboxes, workflows, and harnesses are all improving quickly. Once… 38 r/MachineLearning community 1mo ago I’m building an open-source decision layer above AI agents [P] Hi everyone, I’m Jia, the creator of Spice. I’ve been working on an open-source project called Spice. The simplest way to describe it is: Spice is a decision layer above agents. Most agent systems today are very focused on execution, They are getting better at doing tasks after… 30 arXiv — Machine Learning research 1mo ago Open Multimodal Datasets and Open-Source Software for Data-Driven Modeling of Multiphase Transport and Thermal Systems arXiv:2605.23037v1 Announce Type: new Abstract: Data-driven modeling is becoming central to multiphase transport, electronics cooling, acoustic diagnostics, and thermal-fluid digital twins, but progress is limited by fragmented datasets and raw instrument files that are… 8 arXiv — Machine Learning research 1mo ago What Linear Probes Miss: Multi-View Probing for Weight-Space Learning arXiv:2605.23410v1 Announce Type: new Abstract: The explosive growth of open-source model repositories has created a Model Jungle, where checkpoints are frequently shared without adequate documentation or metadata. While weight-space learning offers a pathway to identify and… 20 arXiv — Machine Learning research 1mo ago An Open-Source Training Dataset for Foundation Models for Black-box Optimization arXiv:2605.23417v1 Announce Type: new Abstract: Most black-box optimization methods require extensive hyperparameter tuning, often limiting their ability to generalize across different optimization domains. Foundation models for black-box optimization that learn optimization… 21 arXiv — NLP / Computation & Language research 1mo ago Knowledge Distillation for Low-Resource Open-source Text-to-SQL Model arXiv:2605.22843v1 Announce Type: new Abstract: Text-to-SQL converts natural language questions into executable SQL queries, enabling non-technical users to access relational databases for analytics and intelligent data services. In real-world scenarios, performance is often… 18 arXiv — NLP / Computation & Language research 1mo ago Benchmarking Google Embeddings 2 against Open-Source Models for Multilingual Dense Retrieval and RAG Systems arXiv:2605.23618v1 Announce Type: new Abstract: We benchmark Google Embeddings (GE2), a Vertex-AI-hosted bi-encoder with 2,048-token context and explicit task-type conditioning, against five open-source alternatives: BGE-M3, E5-large, Multilingual-E5-large (mE5-L), LaBSE, and… 7 arXiv — NLP / Computation & Language research 1mo ago OpenSkillEval: Automatically Auditing the Open Skill Ecosystem for LLM Agents arXiv:2605.23657v1 Announce Type: new Abstract: Skills, i.e., structured workflow instructions distilled for large language models (LLMs), are becoming an increasingly important mechanism for improving agent performance on real-world downstream tasks. However, as the open-source… 5 arXiv — NLP / Computation & Language research 1mo ago InfiGFusion: Graph-on-Logits Distillation via Efficient Gromov-Wasserstein for Model Fusion arXiv:2505.13893v2 Announce Type: replace Abstract: Recent advances in large language models (LLMs) have intensified efforts to fuse heterogeneous open-source models into a unified system that inherits their complementary strengths. Existing logit-based fusion methods maintain… 35 r/LocalLLaMA community 1mo ago hipEngine: Fast Native Qwen 3.6 Inference for RDNA3 (Strix Halo, 7900 XTX) A few weeks ago, after finishing FastDMS , I started toying around writing some RDNA3 kernels again to see how fast I could get Qwen 3.6 MoE running. It turned out well enough, so over the past couple weeks, I turned those experiments into hipEngine , a new open source (AGPLv3)… 13 r/LocalLLaMA community 1mo ago Could Open Models be trained to secretly go rogue? I was discussing with some other folks how safe is to use open weights models from China and the topic of "trojan horse" came up. We know that, at least with current architecture, models can't run code on their own. They are entirely dependent on tools and harnesses. We also… 20 Hacker News — AI on Front Page community 1mo ago Show HN: Audiomass – a free, open-source multitrack audio editor for the web Article URL: https://audiomass.co/?multitrack=1 Comments URL: https://news.ycombinator.com/item?id=48258015 Points: 338 # Comments: 68 29 r/MachineLearning community 1mo ago Working on a cgo-free CUDA binding in Go for ML stuff Week 3 - open source [P] At our work we use CUDA in Rust since the company switched to it recently. Rust has pretty good Driver API bindings but it made me wonder why the hell we cant have something decent in Go without cgo. I mostly build ML tools in the last month and Go is my main language for pretty… 30 r/MachineLearning community 1mo ago PapersWithCode new features - week 1 [P] Hi, Niels here from the open-source team at Hugging Face. It's been one week since I launched paperswithcode.co , a revival of the website we all loved. It allows us to keep track of the state-of-the-art (SOTA) across various domains of AI, from agents to computer vision and… 23 r/LocalLLaMA community 1mo ago Qwen Plays ̶p̶̶o̶̶k̶̶e̶̶m̶̶o̶̶n̶ ? / QWEN PLAYS DCSS! - qwen3.6-35b-a3b@q4_k_xl plays open source roguelike adventure DCSS (and does a decent job) Hi, (TLDR.): Qwen in its MTP version has tool call bugs and outputs everything into tool/thinking blocks - mangeling the output - canceling the +speed with repeated wrong tool calls! DCSS works well with non MTP qwen even on smaller qwants. im Testing the new MTP models and… 19 Hacker News — AI on Front Page community 1mo ago Microsoft open-sources "the earliest DOS source code discovered to date" https://opensource.microsoft.com/blog/2026/04/28/continuing-... Comments URL: https://news.ycombinator.com/item?id=48253386 Points: 224 # Comments: 55 33 r/LocalLLaMA community 1mo ago Command A+ (218B MoE) running on Apple Silicon — MLX port, PR open Cohere dropped Command A+ on the 20th (218B total / 25B active, 128 experts top-8, Apache 2.0). Wrote a cohere2_moe implementation for mlx-lm to get it running on Apple Silicon. Architecture notes for anyone digging into this model: - Single shared expert with a larger… 12 Page 4 of 7 · 308 articles ← Newer Older →