Tag

Open source

308 articles archived under #open-source · RSS

r/MachineLearning community 1mo ago

Spice: We built an open-sourced decision layer that sits above your AI agents (controls agent actions before execution) [P]

Hi guys, been exploring here for a while, wanted to share something we've been working on. It's called Spice , an open-source decision layer above agents. We have tons of great execution agents now — Claude Code, Codex, hermes, etc. They're good at doing stuff. But they're…

6
r/LocalLLaMA community 1mo ago

meituan-longcat/LongCat-Video-Avatar-1.5 · Hugging Face

🚀 Model Introduction We are excited to announce the release of LongCat-Video-Avatar 1.5, an upgraded open-source framework that prioritizes extreme empirical optimization and production-readiness for audio-driven human video generation. Built upon the LongCat-Video foundation…

21
r/LocalLLaMA community 1mo ago

I fine-tuned Cohere Transcribe to support diarization and timestamps

Hi I'll keep it short: Cohere-transcribe is currently the best open source speech to text model (and possibly even better than other proprietary models). BUT it doesn't support diarization (speaker identification) and timestamps, even though there are tokens for it in the…

36
r/LocalLLaMA community 1mo ago

DeepSeek is pushing forward with $10.29 billion financing round, with Liang Wenfeng committing to continue developing open-source AI models rather than pursuing short-term commercialization goals

https://www.bloomberg.com/news/articles/2026-05-22/deepseek-founder-declares-agi-goal-as-10-billion-round-advances   submitted by   /u/External_Mood4719 [link]   [comments]

17
r/LocalLLaMA community 1mo ago

Honesty in a small model drops from 35% to 0% by changing the tone of the prompt. Sharing the findings.

My paper got published today at Arxiv. It raises questions about how language models behave when the framing of a request shifts. Small open-source AI models can be moved from honest to dishonest behaviour by little more than a change in tone. Asked to solve coding problems…

4
r/LocalLLaMA community 1mo ago

'Am I OpenAI compatible' - a tool and documentation for unified api signatures in open source AI.

This has turned out to be useful to many of my friends so I thought I'd share here as well. I created a tool and documentation page for most major open-souce project's adherence to 'OpenAI compatibility' after seeing inconsistencies between engines like vLLM and llama.cpp. Now…

18
arXiv — Machine Learning research 1mo ago

The Devil is in the Condition Numbers: Why is GLU Better than non-GLU Structure?

arXiv:2605.20749v1 Announce Type: new Abstract: Gated Linear Units (GLU) and their variants are widely adopted in modern open-source large language model architectures and consistently outperform their non-gated counterparts, yet the underlying reasons for this advantage remain…

34
arXiv — NLP / Computation & Language research 1mo ago

Do No Harm? Hallucination and Actor-Level Abuse in Web-Deployed Medical Large Language Models

arXiv:2605.20591v1 Announce Type: new Abstract: Medical large language models (LLMs), including custom medical GPTs (MedGPTs) and open-source models, are increasingly deployed on web platforms to provide clinical guidance. However, they pose risks of hallucination, policy…

33
r/MachineLearning community 1mo ago

l9gpu - open-source GPU observability with workload-level attribution [P]

GPU monitoring tools like DCGM give you hardware-level metrics but no workload context. When a node is saturated, you can't tell which experiment, team, or job is responsible without digging through logs. We built l9gpu to close that gap. It's a node-level agent that exports GPU…

25
r/LocalLLaMA community 1mo ago

Re. what ever happened to Cohere’s Command-A series of models?

Hey everyone, Nick Frosst here from Cohere. A few months ago Aidan (my cofounder) left a comment in here about our Command series and how we were working on some more powerful, open-weights models behind the scenes. We just launched Command A+ and we wanted to share it with you…

37
r/MachineLearning community 1mo ago

NOML-NOML: hierarchical TD3 + anchor policy for flight control [P]

I built a custom RL algorithm for continuous flight control and open-sourced it. Sharing here in case the structural ideas are useful for anyone doing continuous control where one action axis dominates. I've been training continuous control on a 6-DoF flight sim…

31
arXiv — NLP / Computation & Language research 1mo ago

GoLongRL: Capability-Oriented Long Context Reinforcement Learning with Multitask Alignment

arXiv:2605.19577v1 Announce Type: new Abstract: We present GoLongRL, a fully open-source, capability-oriented post-training recipe for long-context reinforcement learning with verifiable rewards (RLVR). Existing long-context RL methods often treat data construction as a matter…

17
Hugging Face Daily Papers research 1mo ago

GoLongRL: Capability-Oriented Long Context Reinforcement Learning with Multitask Alignment

Abstract GoLongRL presents an open-source approach for long-context reinforcement learning with diverse reward optimization through capability-oriented data construction and TMN-Reweight methodology. AI-generated summary We present GoLongRL, a fully open-source,…

37
r/LocalLLaMA community 1mo ago

PrivateScribe.ai - Fully local, MIT licensed, free AI transcription built with HIPAA/legal safeguards in mind - One Year Update!

I first posted about PrivateScribe.ai ~1yr ago and have recently jumped back intent on bringing it to a functionality that makes it actually usable by non-technical users. One year ago it worked but only the bare minimum. Since then I've gotten ⭐️74 github stars!⭐️ and have had…

31
r/LocalLLaMA community 1mo ago

Open weights GLM and Mimo are better than Gemini 3.5 flash according to arena

While we are weathering the gemini 3.5 flash hype, keep in mind that according to arena, GLM and Mimo are better. https://arena.ai/leaderboard/text/coding-no-style-control #7 GLM #9 Mimo #12 Gemini 3.5 Flash   submitted by   /u/Terminator857 [link]   [comments]

5
r/LocalLLaMA community 1mo ago

Floor for local meeting summarization on a 6GB GPU: qwen3.5:0.8b works at 57s, Granite 4 350M hallucinates

Disclosure: I made this. Open-source, MIT, Windows + Linux. Not affiliated with voiceflow.com (the chatbot SaaS, name collision, sorry). Why this exists: I wanted local-only dictation and meeting transcription, because audio shouldn't have to leave the machine just to become…

13
The Information — AI news-outlet 1mo ago

Is the Gap Widening Between Anthropic and Open-Source Models?

Some developers have told me that the rising costs of frontier AI models from Anthropic and other firms could prompt them to shift to cheaper open-source AI. After all, when companies as sophisticated as Uber are accidentally blowing through their entire year’s AI budget in a…

8
Hacker News — AI on Front Page community 1mo ago

Show HN: Forge – Guardrails take an 8B model from 53% to 99% on agentic tasks

Hi HN, I'm Antoine Zambelli, AI Director at Texas Instruments. I built Forge, an open-source reliability layer for self-hosted LLM tool-calling. What it does: - Adds domain-and-tool-agnostic guardrails (retry nudges, step enforcement, error recovery, VRAM-aware context…

14
r/LocalLLaMA community 1mo ago

bytedance released an open source model that attempts to do just about anything with only 3b parameters

Lance is a lightweight native unified multimodal model that supports image and video understanding, generation, and editing within a single framework. Efficient at 3B scale. With only 3B active parameters , Lance delivers strong performance across image generation, image…

32
arXiv — Machine Learning research 1mo ago

Provably Shorter Scratchpads in Hybrid DeltaNet-Attention Decoders

arXiv:2605.16640v1 Announce Type: new Abstract: We investigate the expressive power of hybrid recurrent-attention decoders, a class of architectures used in recent open-source language models such as Qwen3-Next and its successors. These models combine Gated Attention heads with…

28
arXiv — NLP / Computation & Language research 1mo ago

Roll Out and Roll Back: Diffusion LLMs are Their Own Efficiency Teachers

arXiv:2605.16941v1 Announce Type: new Abstract: Diffusion Large Language Models (DLLMs) promise fast parallel generation, yet open-source DLLMs still face a severe quality-speed trade-off: accelerating decoding by revealing multiple tokens often causes substantial quality…

7
r/MachineLearning community 1mo ago

Witchcraft, fast local semantic search on top of SQLite [P]

Witchcraft ( https://github.com/dropbox/witchcraft ) , an open source project that I built at Dropbox, is a from-scratch re-implementation of Stanford's XTR-Warp semantic search engine ( https://github.com/jlscheerer/xtr-warp ) in safe rust, using a single-file SQLite database…

32
r/MachineLearning community 1mo ago

Reviving PapersWithCode (by Hugging Face) [P]

Hi, Niels here from the open-source team at Hugging Face. Like many others, I was a huge fan of paperswithcode. Sadly, that website is no longer maintained after its acquisition by Meta. Hence, I've been working on reviving it. I obviously use AI agents to parse papers at scale…

10
Hacker News — AI on Front Page community 1mo ago

Show HN: Files.md – Open-source alternative to Obsidian

Article URL: https://github.com/zakirullin/files.md Comments URL: https://news.ycombinator.com/item?id=48179677 Points: 208 # Comments: 121

14
r/LocalLLaMA community 1mo ago

New models when? Forecasting release date.

After the recent releases, there's almost a sense of emptiness. When do you think new models will be released? Looking at the chart, it's between the end of May and the beginning of June, but... I don't know why, it seems like something's changing about "open weights"  …

4
arXiv — NLP / Computation & Language research 1mo ago

CompactQE: Interpretable Translation Quality Estimation via Small Open-Weight LLMs

arXiv:2605.15763v1 Announce Type: new Abstract: Current state-of-the-art Quality Estimation (QE) in machine translation relies on massive, proprietary LLMs, raising data privacy concerns. We demonstrate that smaller, open-source LLMs (<30B parameters) are a viable,…

29
r/LocalLLaMA community 1mo ago

Cutoff dates of open source models

I was trying Qwen 3.6-27b and Gemma4 in a siomple web chat. Asked them both a qn like 'recommend the best llm for a 5060ti' and was suprised when they both replied 'user is asking about a card that doesn't exist'. I then saw their knowledge cutoff was early 2025, hence why. But…

12
Simon Willison community 1mo ago

GDS weighs in on the NHS's decision to retreat from Open Source

GDS weighs in on the NHS's decision to retreat from Open Source Terence Eden continues his coverage of the NHS' poorly considered decision to close down access to their open source repositories in response to vulnerabilities reported to them as part of Project Glasswing .…

24
r/LocalLLaMA community 1mo ago

ROCm 7.13 nightly adds strix halo optimizations

https://www.phoronix.com/news/ROCm-7.13-Released Quote: ...new optimizations for Ryzen AI Max 300 "Strix Halo" and the ROCprof Trace Decoder is now open-source...<snip>... Those rolling from source can grab the ROCm 7.13 Tech Preview via TheRock on GitHub .…

5
Hacker News — AI on Front Page community 1mo ago

Show HN: Semble – Code search for agents that uses 98% fewer tokens than grep

Hey HN! We (Stephan and Thomas) recently open-sourced Semble. We kept running into the same problem while using Claude Code on large codebases: when the agent can't find something directly, it falls back to grep, reading full files or launching subagents. This uses a lot of…

24
r/LocalLLaMA community 1mo ago

85 GPU-hours comparing 5 abliteration methods on Qwen3.6-27B: benchmarks, safety, weight forensics - Abliterlitics

I've been building Abliterlitics , an open-source abliteration forensics toolkit. The idea is straightforward: take the same base model, compare the different abliteration techniques others have applied, then measure what actually changed using benchmarks, safety evaluation,…

13
r/LocalLLaMA community 1mo ago

Open Source vs frontier models on a single-file HTML canvas driving animation - results

Hey yall, I was inspired by this post : https://www.reddit.com/r/LocalLLaMA/comments/1tf3p6c/local_qwen_36_vs_frontier_models_on_a_coding/ And I know this isn't exactly local, but I wanted to share what I tested out and what results each model delivered so I decided to share…

17
r/LocalLLaMA community 1mo ago

GitHub - richardr1126/openreader: An open-source read-along document reader server with high-quality TTS options, synchronized highlighting, and audiobook export for EPUB, PDF, DOCX, TXT, and MD.

Sharing my latest release of OpenReader v3.0.0, an open-source text-to-speech document reader and audiobook exporter. It has been live for over a year now, and slowly has gained 300+ GitHub stars. What is OpenReader? A Next.js web app for reading and listening to EPUB, PDF, TXT,…

9
r/LocalLLaMA community 1mo ago

Built a 6x cheaper CodeRabbit alternative using open source models

Coderabbit apparently uses GPT + Claude models to review PRs and it costed $60/month. So I grabbed a friend and made a alternative which does the same things but uses open source models as backend instead( because inference costs are wayyyy cheaper) We tested it on a PR…

15
Hacker News — AI on Front Page community 1mo ago

SANA-WM, a 2.6B open-source world model for 1-minute 720p video

Article URL: https://nvlabs.github.io/Sana/WM/ Comments URL: https://news.ycombinator.com/item?id=48159445 Points: 224 # Comments: 93

26
r/LocalLLaMA community 1mo ago

I built a self-hosted open-source MCP server that gives any local LLM real financial data — SEC filings, 13F, insider & congressional trades, short data, FRED

One thing missing when running local models as agents: real, current data. So I built Equibles — a self-hosted MCP server that scrapes and serves public U.S. financial data and exposes it as MCP tools, so any MCP-capable client (Claude Code/Desktop, Cursor, or your own…

30
r/LocalLLaMA community 1mo ago

[FOUNDING] SupraLabs - real open-source AI models for you!

https://preview.redd.it/k6lub2ypva1h1.png?width=1500&format=png&auto=webp&s=cd44452c86b5216fec17113a72f43bbf169edafb Hey r/LocalLLaMA ! We founded SupraLabs , and it's huge! What we do? We train, finetune and explore small models with good results to revolutionize small AI…

30
r/LocalLLaMA community 1mo ago

I kept a running list of every LLM term that actually matters for production, cleaned it up and open sourced it

Been building with LLMs for a while and kept hitting terms where the standard definition was useless for making engineering decisions. So I kept a personal doc, eventually it hit 30+ terms across inference, retrieval, agents, training, and prompting. Each entry has the…

37
Hugging Face Daily Papers research 1mo ago

Orchard: An Open-Source Agentic Modeling Framework

Abstract Orchard is an open-source framework for scalable agentic modeling that enables training diverse autonomous agents through specialized recipes for coding, GUI navigation, and personal assistance tasks. AI-generated summary Agentic modeling aims to transform LLMs into…

17
r/LocalLLaMA community 1mo ago

Developing open source LLM from ground up from pretrain - rlhf(PPO/GRPO)

Hello I have been working on creating a LLM from ground up. It is based on deepseek architecture with heavily VRAM footprint reduced optimized(GUM+muon) Currently this is the json schema I am using which should suffice as to what currently is being pretrained. Training on a…

7
TechCrunch — AI news-outlet 1mo ago

Clawdmeter turns your Claude Code usage stats into a tiny desktop dashboard

A new open source gadget called Clawdmeter turns Claude Code usage stats into a tiny desktop dashboard for AI coding power users.

11
Hugging Face official-blog 1mo ago

Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality

Back to Articles Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality Enterprise Article Published May 14, 2026 Upvote - Radu Florian hansolosan ibm-granite Parul Awasthy pawasthy ibm-granite Aashka Trivedi…

19
r/LocalLLaMA community 1mo ago

Automated AI researcher running locally with llama.cpp

Hi everyone, I'm happy to share ml-intern, which is a harness for agents to have tighter integration with Hugging Face's open-source libraries (transformers, datasets, trl, etc) and Hub infrastructure: https://github.com/huggingface/ml-intern The harness is quite simple…

23
r/LocalLLaMA community 1mo ago

Built an open-source one-prompt-to-cinematic-reel pipeline on a single GPU — FLUX.2 [klein] for character keyframes, Wan2.2-I2V for animation, vision critic with auto-retry, music + 9-language narration in the same pipeline

Shipped this for the AMD x lablab hackathon. Attached video is one of the actual reels the pipeline produced - one English sentence in, finished mp4 with characters, story, music, and voice-over out (fast demo video, not the best quality). ~45 minutes end-to-end on a single AMD…

13
arXiv — Machine Learning research 1mo ago

A Resampling-Based Framework for Network Structure Learning in High-Dimensional Data

arXiv:2605.12706v1 Announce Type: new Abstract: RSNet is an open-source R package that provides a resampling-based framework for robust and interpretable network inference, designed to address the limited-sample-size challenges common in high-dimensional data. It supports both…

11
r/LocalLLaMA community 1mo ago

Fully Realtime Interaction Models

I know this model isn't open weights, and when it does drop it'll be over api, but I'm just posting to say the very MICROsecond that this drops you already know me and probably a bunch of other people are going to create an insane amount of distill data from the api. because at…

26
Hacker News — AI on Front Page community 1mo ago

Open Source Resistance: keep OSS alive on company time

Article URL: https://ossresistance.com/ Comments URL: https://news.ycombinator.com/item?id=48123015 Points: 215 # Comments: 70

14
r/LocalLLaMA community 1mo ago

TextGen is now a native desktop app. Open-source alternative to LM Studio (formerly text-generation-webui).

Hi all, I have been making a lot of updates to my project, and I wanted to share them here. TextGen (previously text-generation-webui, also known as my username oobabooga or ooba) has been in development since December 2022, before LLaMa and llama.cpp existed. In the last two…

32
Microsoft AI official-blog 1mo ago

Hugging Face releases open-weights model family

Three new open-weights models under Apache 2.0 — sizes from 1B to 70B — released alongside training recipes and evaluation harnesses.

21
r/LocalLLaMA community 1mo ago

The Trillion-Parameter Dilemma: MiMo-V2.5-Pro went open-source (1.02T params). Is self-hosting worth it when the API costs $70 for 387M tokens?

Xiaomi open-sourced MiMo-V2.5-Pro. 1.02 trillion parameters, 42B active (MoE), 1M context, MIT license. On paper, this is exciting. In practice, I'm stuck on the math. What I've been doing with it I've been running V2.5-Pro via the API through Claude Code for autonomous coding…

13

Spice: We built an open-sourced decision layer that sits above your AI agents (controls agent actions before execution) [P]

meituan-longcat/LongCat-Video-Avatar-1.5 · Hugging Face

I fine-tuned Cohere Transcribe to support diarization and timestamps

DeepSeek is pushing forward with $10.29 billion financing round, with Liang Wenfeng committing to continue developing open-source AI models rather than pursuing short-term commercialization goals

Honesty in a small model drops from 35% to 0% by changing the tone of the prompt. Sharing the findings.

'Am I OpenAI compatible' - a tool and documentation for unified api signatures in open source AI.

The Devil is in the Condition Numbers: Why is GLU Better than non-GLU Structure?

Do No Harm? Hallucination and Actor-Level Abuse in Web-Deployed Medical Large Language Models

l9gpu - open-source GPU observability with workload-level attribution [P]

Re. what ever happened to Cohere’s Command-A series of models?

NOML-NOML: hierarchical TD3 + anchor policy for flight control [P]

GoLongRL: Capability-Oriented Long Context Reinforcement Learning with Multitask Alignment

GoLongRL: Capability-Oriented Long Context Reinforcement Learning with Multitask Alignment

PrivateScribe.ai - Fully local, MIT licensed, free AI transcription built with HIPAA/legal safeguards in mind - One Year Update!

Open weights GLM and Mimo are better than Gemini 3.5 flash according to arena

Floor for local meeting summarization on a 6GB GPU: qwen3.5:0.8b works at 57s, Granite 4 350M hallucinates

Is the Gap Widening Between Anthropic and Open-Source Models?

Show HN: Forge – Guardrails take an 8B model from 53% to 99% on agentic tasks

bytedance released an open source model that attempts to do just about anything with only 3b parameters

Provably Shorter Scratchpads in Hybrid DeltaNet-Attention Decoders

Roll Out and Roll Back: Diffusion LLMs are Their Own Efficiency Teachers

Witchcraft, fast local semantic search on top of SQLite [P]

Reviving PapersWithCode (by Hugging Face) [P]

Show HN: Files.md – Open-source alternative to Obsidian

New models when? Forecasting release date.

CompactQE: Interpretable Translation Quality Estimation via Small Open-Weight LLMs

Cutoff dates of open source models

GDS weighs in on the NHS's decision to retreat from Open Source

ROCm 7.13 nightly adds strix halo optimizations

Show HN: Semble – Code search for agents that uses 98% fewer tokens than grep

85 GPU-hours comparing 5 abliteration methods on Qwen3.6-27B: benchmarks, safety, weight forensics - Abliterlitics

Open Source vs frontier models on a single-file HTML canvas driving animation - results

GitHub - richardr1126/openreader: An open-source read-along document reader server with high-quality TTS options, synchronized highlighting, and audiobook export for EPUB, PDF, DOCX, TXT, and MD.

Built a 6x cheaper CodeRabbit alternative using open source models

SANA-WM, a 2.6B open-source world model for 1-minute 720p video

I built a self-hosted open-source MCP server that gives any local LLM real financial data — SEC filings, 13F, insider & congressional trades, short data, FRED

[FOUNDING] SupraLabs - real open-source AI models for you!

I kept a running list of every LLM term that actually matters for production, cleaned it up and open sourced it

Orchard: An Open-Source Agentic Modeling Framework

Developing open source LLM from ground up from pretrain - rlhf(PPO/GRPO)

Clawdmeter turns your Claude Code usage stats into a tiny desktop dashboard

Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality

Automated AI researcher running locally with llama.cpp

Built an open-source one-prompt-to-cinematic-reel pipeline on a single GPU — FLUX.2 [klein] for character keyframes, Wan2.2-I2V for animation, vision critic with auto-retry, music + 9-language narration in the same pipeline

A Resampling-Based Framework for Network Structure Learning in High-Dimensional Data

Fully Realtime Interaction Models

Open Source Resistance: keep OSS alive on company time

TextGen is now a native desktop app. Open-source alternative to LM Studio (formerly text-generation-webui).

Hugging Face releases open-weights model family

The Trillion-Parameter Dilemma: MiMo-V2.5-Pro went open-source (1.02T params). Is self-hosting worth it when the API costs $70 for 387M tokens?