News / #open-source Tag Open source 308 articles archived under #open-source · RSS Sign in to follow r/MachineLearning community 1mo ago Spice: We built an open-sourced decision layer that sits above your AI agents (controls agent actions before execution) [P] Hi guys, been exploring here for a while, wanted to share something we've been working on. It's called Spice , an open-source decision layer above agents. We have tons of great execution agents now — Claude Code, Codex, hermes, etc. They're good at doing stuff. But they're… 6 r/LocalLLaMA community 1mo ago meituan-longcat/LongCat-Video-Avatar-1.5 · Hugging Face 🚀 Model Introduction We are excited to announce the release of LongCat-Video-Avatar 1.5, an upgraded open-source framework that prioritizes extreme empirical optimization and production-readiness for audio-driven human video generation. Built upon the LongCat-Video foundation… 21 r/LocalLLaMA community 1mo ago I fine-tuned Cohere Transcribe to support diarization and timestamps Hi I'll keep it short: Cohere-transcribe is currently the best open source speech to text model (and possibly even better than other proprietary models). BUT it doesn't support diarization (speaker identification) and timestamps, even though there are tokens for it in the… 36 r/LocalLLaMA community 1mo ago DeepSeek is pushing forward with $10.29 billion financing round, with Liang Wenfeng committing to continue developing open-source AI models rather than pursuing short-term commercialization goals https://www.bloomberg.com/news/articles/2026-05-22/deepseek-founder-declares-agi-goal-as-10-billion-round-advances   submitted by   /u/External_Mood4719 [link]   [comments] 17 r/LocalLLaMA community 1mo ago Honesty in a small model drops from 35% to 0% by changing the tone of the prompt. Sharing the findings. My paper got published today at Arxiv. It raises questions about how language models behave when the framing of a request shifts. Small open-source AI models can be moved from honest to dishonest behaviour by little more than a change in tone. Asked to solve coding problems… 4 r/LocalLLaMA community 1mo ago 'Am I OpenAI compatible' - a tool and documentation for unified api signatures in open source AI. This has turned out to be useful to many of my friends so I thought I'd share here as well. I created a tool and documentation page for most major open-souce project's adherence to 'OpenAI compatibility' after seeing inconsistencies between engines like vLLM and llama.cpp. Now… 18 arXiv — Machine Learning research 1mo ago The Devil is in the Condition Numbers: Why is GLU Better than non-GLU Structure? arXiv:2605.20749v1 Announce Type: new Abstract: Gated Linear Units (GLU) and their variants are widely adopted in modern open-source large language model architectures and consistently outperform their non-gated counterparts, yet the underlying reasons for this advantage remain… 34 arXiv — NLP / Computation & Language research 1mo ago Do No Harm? Hallucination and Actor-Level Abuse in Web-Deployed Medical Large Language Models arXiv:2605.20591v1 Announce Type: new Abstract: Medical large language models (LLMs), including custom medical GPTs (MedGPTs) and open-source models, are increasingly deployed on web platforms to provide clinical guidance. However, they pose risks of hallucination, policy… 33 r/MachineLearning community 1mo ago l9gpu - open-source GPU observability with workload-level attribution [P] GPU monitoring tools like DCGM give you hardware-level metrics but no workload context. When a node is saturated, you can't tell which experiment, team, or job is responsible without digging through logs. We built l9gpu to close that gap. It's a node-level agent that exports GPU… 25 r/LocalLLaMA community 1mo ago Re. what ever happened to Cohere’s Command-A series of models? Hey everyone, Nick Frosst here from Cohere. A few months ago Aidan (my cofounder) left a comment in here about our Command series and how we were working on some more powerful, open-weights models behind the scenes. We just launched Command A+ and we wanted to share it with you… 37 r/MachineLearning community 1mo ago NOML-NOML: hierarchical TD3 + anchor policy for flight control [P] I built a custom RL algorithm for continuous flight control and open-sourced it. Sharing here in case the structural ideas are useful for anyone doing continuous control where one action axis dominates. I've been training continuous control on a 6-DoF flight sim… 31 arXiv — NLP / Computation & Language research 1mo ago GoLongRL: Capability-Oriented Long Context Reinforcement Learning with Multitask Alignment arXiv:2605.19577v1 Announce Type: new Abstract: We present GoLongRL, a fully open-source, capability-oriented post-training recipe for long-context reinforcement learning with verifiable rewards (RLVR). Existing long-context RL methods often treat data construction as a matter… 17 Hugging Face Daily Papers research 1mo ago GoLongRL: Capability-Oriented Long Context Reinforcement Learning with Multitask Alignment Abstract GoLongRL presents an open-source approach for long-context reinforcement learning with diverse reward optimization through capability-oriented data construction and TMN-Reweight methodology. AI-generated summary We present GoLongRL, a fully open-source,… 37 r/LocalLLaMA community 1mo ago PrivateScribe.ai - Fully local, MIT licensed, free AI transcription built with HIPAA/legal safeguards in mind - One Year Update! I first posted about PrivateScribe.ai ~1yr ago and have recently jumped back intent on bringing it to a functionality that makes it actually usable by non-technical users. One year ago it worked but only the bare minimum. Since then I've gotten ⭐️74 github stars!⭐️ and have had… 31 r/LocalLLaMA community 1mo ago Open weights GLM and Mimo are better than Gemini 3.5 flash according to arena While we are weathering the gemini 3.5 flash hype, keep in mind that according to arena, GLM and Mimo are better. https://arena.ai/leaderboard/text/coding-no-style-control #7 GLM #9 Mimo #12 Gemini 3.5 Flash   submitted by   /u/Terminator857 [link]   [comments] 5 r/LocalLLaMA community 1mo ago Floor for local meeting summarization on a 6GB GPU: qwen3.5:0.8b works at 57s, Granite 4 350M hallucinates Disclosure: I made this. Open-source, MIT, Windows + Linux. Not affiliated with voiceflow.com (the chatbot SaaS, name collision, sorry). Why this exists: I wanted local-only dictation and meeting transcription, because audio shouldn't have to leave the machine just to become… 13 The Information — AI news-outlet 1mo ago Is the Gap Widening Between Anthropic and Open-Source Models? Some developers have told me that the rising costs of frontier AI models from Anthropic and other firms could prompt them to shift to cheaper open-source AI. After all, when companies as sophisticated as Uber are accidentally blowing through their entire year’s AI budget in a… 8 Hacker News — AI on Front Page community 1mo ago Show HN: Forge – Guardrails take an 8B model from 53% to 99% on agentic tasks Hi HN, I'm Antoine Zambelli, AI Director at Texas Instruments. I built Forge, an open-source reliability layer for self-hosted LLM tool-calling. What it does: - Adds domain-and-tool-agnostic guardrails (retry nudges, step enforcement, error recovery, VRAM-aware context… 14 r/LocalLLaMA community 1mo ago bytedance released an open source model that attempts to do just about anything with only 3b parameters Lance is a lightweight native unified multimodal model that supports image and video understanding, generation, and editing within a single framework. Efficient at 3B scale. With only 3B active parameters , Lance delivers strong performance across image generation, image… 32 arXiv — Machine Learning research 1mo ago Provably Shorter Scratchpads in Hybrid DeltaNet-Attention Decoders arXiv:2605.16640v1 Announce Type: new Abstract: We investigate the expressive power of hybrid recurrent-attention decoders, a class of architectures used in recent open-source language models such as Qwen3-Next and its successors. These models combine Gated Attention heads with… 28 arXiv — NLP / Computation & Language research 1mo ago Roll Out and Roll Back: Diffusion LLMs are Their Own Efficiency Teachers arXiv:2605.16941v1 Announce Type: new Abstract: Diffusion Large Language Models (DLLMs) promise fast parallel generation, yet open-source DLLMs still face a severe quality-speed trade-off: accelerating decoding by revealing multiple tokens often causes substantial quality… 7 r/MachineLearning community 1mo ago Witchcraft, fast local semantic search on top of SQLite [P] Witchcraft ( https://github.com/dropbox/witchcraft ) , an open source project that I built at Dropbox, is a from-scratch re-implementation of Stanford's XTR-Warp semantic search engine ( https://github.com/jlscheerer/xtr-warp ) in safe rust, using a single-file SQLite database… 32 r/MachineLearning community 1mo ago Reviving PapersWithCode (by Hugging Face) [P] Hi, Niels here from the open-source team at Hugging Face. Like many others, I was a huge fan of paperswithcode. Sadly, that website is no longer maintained after its acquisition by Meta. Hence, I've been working on reviving it. I obviously use AI agents to parse papers at scale… 10 Hacker News — AI on Front Page community 1mo ago Show HN: Files.md – Open-source alternative to Obsidian Article URL: https://github.com/zakirullin/files.md Comments URL: https://news.ycombinator.com/item?id=48179677 Points: 208 # Comments: 121 14 r/LocalLLaMA community 1mo ago New models when? Forecasting release date. After the recent releases, there's almost a sense of emptiness. When do you think new models will be released? Looking at the chart, it's between the end of May and the beginning of June, but... I don't know why, it seems like something's changing about "open weights"  … 4 arXiv — NLP / Computation & Language research 1mo ago CompactQE: Interpretable Translation Quality Estimation via Small Open-Weight LLMs arXiv:2605.15763v1 Announce Type: new Abstract: Current state-of-the-art Quality Estimation (QE) in machine translation relies on massive, proprietary LLMs, raising data privacy concerns. We demonstrate that smaller, open-source LLMs (<30B parameters) are a viable,… 29 r/LocalLLaMA community 1mo ago Cutoff dates of open source models I was trying Qwen 3.6-27b and Gemma4 in a siomple web chat. Asked them both a qn like 'recommend the best llm for a 5060ti' and was suprised when they both replied 'user is asking about a card that doesn't exist'. I then saw their knowledge cutoff was early 2025, hence why. But… 12 Simon Willison community 1mo ago GDS weighs in on the NHS's decision to retreat from Open Source GDS weighs in on the NHS's decision to retreat from Open Source Terence Eden continues his coverage of the NHS' poorly considered decision to close down access to their open source repositories in response to vulnerabilities reported to them as part of Project Glasswing .… 24 r/LocalLLaMA community 1mo ago ROCm 7.13 nightly adds strix halo optimizations https://www.phoronix.com/news/ROCm-7.13-Released Quote: ...new optimizations for Ryzen AI Max 300 "Strix Halo" and the ROCprof Trace Decoder is now open-source...<snip>... Those rolling from source can grab the ROCm 7.13 Tech Preview via TheRock on GitHub .… 5 Hacker News — AI on Front Page community 1mo ago Show HN: Semble – Code search for agents that uses 98% fewer tokens than grep Hey HN! We (Stephan and Thomas) recently open-sourced Semble. We kept running into the same problem while using Claude Code on large codebases: when the agent can't find something directly, it falls back to grep, reading full files or launching subagents. This uses a lot of… 24 r/LocalLLaMA community 1mo ago 85 GPU-hours comparing 5 abliteration methods on Qwen3.6-27B: benchmarks, safety, weight forensics - Abliterlitics I've been building Abliterlitics , an open-source abliteration forensics toolkit. The idea is straightforward: take the same base model, compare the different abliteration techniques others have applied, then measure what actually changed using benchmarks, safety evaluation,… 13 r/LocalLLaMA community 1mo ago Open Source vs frontier models on a single-file HTML canvas driving animation - results Hey yall, I was inspired by this post : https://www.reddit.com/r/LocalLLaMA/comments/1tf3p6c/local_qwen_36_vs_frontier_models_on_a_coding/ And I know this isn't exactly local, but I wanted to share what I tested out and what results each model delivered so I decided to share… 17 r/LocalLLaMA community 1mo ago GitHub - richardr1126/openreader: An open-source read-along document reader server with high-quality TTS options, synchronized highlighting, and audiobook export for EPUB, PDF, DOCX, TXT, and MD. Sharing my latest release of OpenReader v3.0.0, an open-source text-to-speech document reader and audiobook exporter. It has been live for over a year now, and slowly has gained 300+ GitHub stars. What is OpenReader? A Next.js web app for reading and listening to EPUB, PDF, TXT,… 9 r/LocalLLaMA community 1mo ago Built a 6x cheaper CodeRabbit alternative using open source models Coderabbit apparently uses GPT + Claude models to review PRs and it costed $60/month. So I grabbed a friend and made a alternative which does the same things but uses open source models as backend instead( because inference costs are wayyyy cheaper) We tested it on a PR… 15 Hacker News — AI on Front Page community 1mo ago SANA-WM, a 2.6B open-source world model for 1-minute 720p video Article URL: https://nvlabs.github.io/Sana/WM/ Comments URL: https://news.ycombinator.com/item?id=48159445 Points: 224 # Comments: 93 26 r/LocalLLaMA community 1mo ago I built a self-hosted open-source MCP server that gives any local LLM real financial data — SEC filings, 13F, insider & congressional trades, short data, FRED One thing missing when running local models as agents: real, current data. So I built Equibles — a self-hosted MCP server that scrapes and serves public U.S. financial data and exposes it as MCP tools, so any MCP-capable client (Claude Code/Desktop, Cursor, or your own… 30 r/LocalLLaMA community 1mo ago [FOUNDING] SupraLabs - real open-source AI models for you! https://preview.redd.it/k6lub2ypva1h1.png?width=1500&format=png&auto=webp&s=cd44452c86b5216fec17113a72f43bbf169edafb Hey r/LocalLLaMA ! We founded SupraLabs , and it's huge! What we do? We train, finetune and explore small models with good results to revolutionize small AI… 30 r/LocalLLaMA community 1mo ago I kept a running list of every LLM term that actually matters for production, cleaned it up and open sourced it Been building with LLMs for a while and kept hitting terms where the standard definition was useless for making engineering decisions. So I kept a personal doc, eventually it hit 30+ terms across inference, retrieval, agents, training, and prompting. Each entry has the… 37 Hugging Face Daily Papers research 1mo ago Orchard: An Open-Source Agentic Modeling Framework Abstract Orchard is an open-source framework for scalable agentic modeling that enables training diverse autonomous agents through specialized recipes for coding, GUI navigation, and personal assistance tasks. AI-generated summary Agentic modeling aims to transform LLMs into… 17 r/LocalLLaMA community 1mo ago Developing open source LLM from ground up from pretrain - rlhf(PPO/GRPO) Hello I have been working on creating a LLM from ground up. It is based on deepseek architecture with heavily VRAM footprint reduced optimized(GUM+muon) Currently this is the json schema I am using which should suffice as to what currently is being pretrained. Training on a… 7 TechCrunch — AI news-outlet 1mo ago Clawdmeter turns your Claude Code usage stats into a tiny desktop dashboard A new open source gadget called Clawdmeter turns Claude Code usage stats into a tiny desktop dashboard for AI coding power users. 11 Hugging Face official-blog 1mo ago Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality Back to Articles Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality Enterprise Article Published May 14, 2026 Upvote - Radu Florian hansolosan ibm-granite Parul Awasthy pawasthy ibm-granite Aashka Trivedi… 19 r/LocalLLaMA community 1mo ago Automated AI researcher running locally with llama.cpp Hi everyone, I'm happy to share ml-intern, which is a harness for agents to have tighter integration with Hugging Face's open-source libraries (transformers, datasets, trl, etc) and Hub infrastructure: https://github.com/huggingface/ml-intern The harness is quite simple… 23 r/LocalLLaMA community 1mo ago Built an open-source one-prompt-to-cinematic-reel pipeline on a single GPU — FLUX.2 [klein] for character keyframes, Wan2.2-I2V for animation, vision critic with auto-retry, music + 9-language narration in the same pipeline Shipped this for the AMD x lablab hackathon. Attached video is one of the actual reels the pipeline produced - one English sentence in, finished mp4 with characters, story, music, and voice-over out (fast demo video, not the best quality). ~45 minutes end-to-end on a single AMD… 13 arXiv — Machine Learning research 1mo ago A Resampling-Based Framework for Network Structure Learning in High-Dimensional Data arXiv:2605.12706v1 Announce Type: new Abstract: RSNet is an open-source R package that provides a resampling-based framework for robust and interpretable network inference, designed to address the limited-sample-size challenges common in high-dimensional data. It supports both… 11 r/LocalLLaMA community 1mo ago Fully Realtime Interaction Models I know this model isn't open weights, and when it does drop it'll be over api, but I'm just posting to say the very MICROsecond that this drops you already know me and probably a bunch of other people are going to create an insane amount of distill data from the api. because at… 26 Hacker News — AI on Front Page community 1mo ago Open Source Resistance: keep OSS alive on company time Article URL: https://ossresistance.com/ Comments URL: https://news.ycombinator.com/item?id=48123015 Points: 215 # Comments: 70 14 r/LocalLLaMA community 1mo ago TextGen is now a native desktop app. Open-source alternative to LM Studio (formerly text-generation-webui). Hi all, I have been making a lot of updates to my project, and I wanted to share them here. TextGen (previously text-generation-webui, also known as my username oobabooga or ooba) has been in development since December 2022, before LLaMa and llama.cpp existed. In the last two… 32 Microsoft AI official-blog 1mo ago Hugging Face releases open-weights model family Three new open-weights models under Apache 2.0 — sizes from 1B to 70B — released alongside training recipes and evaluation harnesses. 21 r/LocalLLaMA community 1mo ago The Trillion-Parameter Dilemma: MiMo-V2.5-Pro went open-source (1.02T params). Is self-hosting worth it when the API costs $70 for 387M tokens? Xiaomi open-sourced MiMo-V2.5-Pro. 1.02 trillion parameters, 42B active (MoE), 1M context, MIT license. On paper, this is exciting. In practice, I'm stuck on the math. What I've been doing with it I've been running V2.5-Pro via the API through Claude Code for autonomous coding… 13 Page 5 of 7 · 308 articles ← Newer Older →