Tag

Open source

308 articles archived under #open-source · RSS

r/MachineLearning community 27d ago

Browse CVPR 2026 papers on PapersWithCode [P]

https://preview.redd.it/se5nr2z7tt4h1.png?width=3046&format=png&auto=webp&s=7db15b73afb749da236e5bb50ff96372f6a3239b Hi, Niels here from the open-source team at Hugging Face. It's been 2 weeks since I launched paperswithcode.co , a revival of the website we all loved. It allows…

11
r/LocalLLaMA community 27d ago

JetBrains open-sources Mellum2 - anyone tried these?

  submitted by   /u/DeltaSqueezer [link]   [comments]

19
r/MachineLearning community 28d ago

MeshFlow: production-safe multi-agent orchestration — SHA-256 audit chain, HIPAA/SOX/GDPR built in, 70-85% token cost reduction [Open Source][D]

79% of enterprises have adopted AI agents. Only 11% run them in production. We've spent the past year building agent systems for banks, clinical operations teams, and engineering orgs. The problem isn't that agents don't work — they work fine. The problem is that every framework…

12
r/MachineLearning community 28d ago

MeshFlow: An open-source orchestrator for governed, cost-optimized multi-agent workflows [D]

Hey ML community, We’ve just open-sourced **MeshFlow** , a code-first, framework-agnostic runtime designed for governing and optimizing multi-agent systems in production. Most agent frameworks focus on rapid prototyping, but ML and platform engineering teams usually run into…

23
r/LocalLLaMA community 28d ago

For Ling-2.6-1T, what would make the size feel justified first: quality per token, local serving reality, or long context stability?

The first question I have about Ling-2.6-1T is not “is the model card impressive?” It is whether the boring trade-off makes sense. It is an open-sourced Ant/InclusionAI flagship with about 1T total params / 63B activated params, up to 1M native context, and 256K currently…

21
r/LocalLLaMA community 28d ago

Mellum2 Goes Open Source: A Fast Model for AI Workflows | The JetBrains AI Blog

  submitted by   /u/dayanruben [link]   [comments]

34
The Information — AI news-outlet 28d ago

China’s MiniMax Launches New Model as Open-Source AI Coding Battle Heats Up

Chinese AI developer MiniMax on Monday launched a new large language model called M3, saying the new model’s coding capability approaches that of Anthropic’s Opus 4.7, which was released in April. The new MiniMax model is particularly suitable for coding and complex multi-step…

23
Smol AI News news-outlet 29d ago

not much happened today

**NVIDIA** led open-source AI model releases with **Cosmos 3**, a comprehensive omnimodal world model unifying language, image, video, audio, and action using a Mixture-of-Transformers design, and **Nemotron 3 Ultra**, a **550B** parameter open-weight model noted for high…

33
r/LocalLLaMA community 29d ago

G7 agrees on shared language around open-source AI and open weights AI

Basically stuff we already knew here, but now governments understand it too. I found the news here: https://www.phoronix.com/news/G7-On-Open-Source-AI   submitted by   /u/Kahvana [link]   [comments]

16
r/MachineLearning community 29d ago

Built an AI Accelerator and opensourced it. [P]

There is a huge gap in open source AI accelerators, so I implemented mine . Popular and well known ones are already legacy and doesn't support contemporary operations like Attention. Here is what makes mine special: Attention mechanism smelted directly into silicon Prototyped…

25
r/MachineLearning community 1mo ago

Before we spend months processing open-source robotics datasets, tell us why this is a bad idea [D]

Ps. Not pitching anything; Just trying to understand where reality differs from the narrative. We're a couple of ML students, mostly worked on ML/software before, but over the last few months we've been playing with VLAs, robot datasets, and trying to understand where the field…

27
r/LocalLLaMA community 1mo ago

Open source : Turning vocal imitations into sound effects. (New UX for sound generation)

Hello guys I want to introduce my new project! Have you ever needed a specific sound while making a video or a game? You know exactly what it sounds like in your head, but have no idea how to search for it. That’s why sound design meetings at game studios often turn into people…

12
r/LocalLLaMA community 1mo ago

made a local voice AI for windows you can talk to in any language. open source, bring your own key

been building this on and off for a while and finally got it to a point where i'm not embarrassed to share it, so here goes. it's called Shadow AI. basically a voice-first AI companion that runs on your own windows machine. you just talk to it and it talks back, no typing…

38
arXiv — NLP / Computation & Language research 1mo ago

Benchmarking Open-Source Safety Guard Models: A Comprehensive Evaluation

arXiv:2605.28830v1 Announce Type: new Abstract: As Large Language Models (LLMs) are increasingly deployed in safety-critical applications, robust content moderation becomes essential. We present a comprehensive evaluation of 14 open-source safety guard models on a curated…

19
Hugging Face Daily Papers research 1mo ago

minWM: A Full-Stack Open-Source Framework for Real-Time Interactive Video World Models

Abstract A comprehensive framework is presented for converting bidirectional video diffusion models into real-time interactive world models with controllable, causal, and low-latency capabilities through fine-tuning and distillation techniques. AI-generated summary Recent video…

8
r/MachineLearning community 1mo ago

I built a knowledge graph + policy engine for AI agents , explainable reasoning [D]

Hey , I've been building VeritasReason — an open-source Python framework that adds a structured reasoning and provenance layer on top of LLMs and AI agents. The problem it solves: AI agents today make decisions but record nothing. When something breaks in prod, you have zero…

38
r/MachineLearning community 1mo ago

A new dataset with more that 100M hi-quality, curated images, with captions and meta data! [P]

Hello everyone. The new dataset is named MONET, is Apache 2.0 and available on HF: https://huggingface.co/datasets/jasperai/monet MONET is open, Apache 2.0-licensed image–text dataset. It was built from 2.9 billion images and refined to 104.9 million high-quality samples. We are…

5
TechCrunch — AI news-outlet 1mo ago

Vertu wants CEOs to run companies from an AI foldable starting at $6,880

Built on top of the open-source Hermes project, Vertu's new foldable combines AI-agent workflows, enterprise integrations, and ultra-premium luxury finishes.

22
arXiv — NLP / Computation & Language research 1mo ago

GRADE: Generalizable Reasoning-Aware Dialogue Evaluation for AI Tutors

arXiv:2605.27866v1 Announce Type: new Abstract: Evaluating AI tutor responses requires more than factual correctness: tutors must identify mistakes, locate errors, provide guidance, and offer actionable next steps. We present GRADE, a systematic study of open-source models for…

35
r/MachineLearning community 1mo ago

BEAM 100K memory benchmark: CSM vs Hindsight local artifact comparison [R]

[R] BEAM 100K memory benchmark: CSM vs Hindsight local artifact comparison I’m looking for feedback on a local agent-memory benchmark comparison, especially from people who care about evaluation methodology. I built an open-source R&D memory system called Context Swarm Memory…

31
r/LocalLLaMA community 1mo ago

ReAligned-Qwen3.5 Release

New from Lazarus AI and Eric Hartford, creator of Dolphin and Samantha, announcing the release of the ReAligned-Qwen3.5 series of models. Apache 2.0 license, finetuned to reduce Chinese ideological bias and censorship, refusal behavior, and state-narrative framing. I use SFT +…

19
r/LocalLLaMA community 1mo ago

Looks like Miminax-M3 is just around the corner

As per Minimax_AI twitter https://x.com/MiniMax_AI/status/2059286515155599595 I hope it will speed up Qwen3.7 open weights release. https://preview.redd.it/q1bdhs017n3h1.png?width=898&format=png&auto=webp&s=a9a8ea134a71b9e5b9ea2489fc72420e18c6da67   submitted by  …

38
r/MachineLearning community 1mo ago

A Tiny Open-Source Self-Driving AI That Runs on a Phone [P]

https://preview.redd.it/ww14mzr2fm3h1.png?width=1890&format=png&auto=webp&s=79873d47ae79c7815ca3e7e91fd43141632174f5 https://www.youtube.com/watch?v=rr_uS4bf0B4&feature=youtu.be trained a 7MB open-source L4 self-driving AI that learns navigation, lane following, and drift…

11
The Information — AI news-outlet 1mo ago

Boom Times for Inference Providers?

Less than a year ago, our reporters kept hearing doubts about a group of startups called inference providers. Companies like Fireworks, Baseten and Together AI, which rent out Nvidia servers to app developers and help them customize open-source models, had grown quickly but…

16
OpenAI official-blog 1mo ago

Warp’s big bet on building open source with GPT-5.5

Warp uses GPT-5.5 and OpenAI models to coordinate coding agents across local, cloud, and open-source development workflows.

36
Hugging Face Daily Papers research 1mo ago

How Far Will They Go? Red-Teaming Online Influence with Large Language Models

Abstract Open-source large language models exhibit varying political expressivity and vulnerability to jailbreak techniques, necessitating systematic red-teaming frameworks for assessing their potential misuse in influence campaigns. AI-generated summary As large language model…

25
r/LocalLLaMA community 1mo ago

A rare look inside Qwen 3.7’s open source model release approval process:

For real tho, 9b, 27b, 122b, I don’t really care at this point, just show us that you still love us. EDIT: I guess I gotta use /s on my posts from now on. Nobody appreciates a good sarcatic shitpost anymore clearly. I love Qwen and all our brothers and sisters in the east. I kid…

30
Ars Technica — AI news-outlet 1mo ago

Millions of AI agents imperiled by critical vulnerability in open source package

"BadHost" was found in Starlette, a package with 325 million weekly downloads.

19
Hacker News — AI on Front Page community 1mo ago

Print with dozens of colors: Our new open-source ColorMix for PrusaSlicer

Article URL: https://blog.prusa3d.com/our-new-open-source-colormix-model-in-prusaslicer-and-easyprint_136079/ Comments URL: https://news.ycombinator.com/item?id=48283410 Points: 214 # Comments: 67

5
Interconnects (Nathan Lambert) research 1mo ago

Some ideas for what comes next, May 2026

Gemini Flash 3.5, Mythos, open-closed balance, America's open-source surge, emerging power struggles and more.

10
r/LocalLLaMA community 1mo ago

Small set of local MCP server installers for home Linux users

Hi all, I have published a small open-source MCP server bundle called MCP Basic Servers : https://github.com/mchowy-troll/mcp-basic-servers It is a collection of simple Bash installer scripts for running local MCP HTTP servers on Linux . The idea is simple: run one script,…

38
r/LocalLLaMA community 1mo ago

New KV Quants coming 😍 Welcome OSCAR kv quant open sourced by togetherAI

Just when we started embracing turboquant this happens   submitted by   /u/yehyakar [link]   [comments]

5
Smol AI News news-outlet 1mo ago

not much happened today

**Inference optimization** is increasingly architectural, with **EAGLE 3.1** improving speculative decoding and long-context handling, collaborating with **vLLM** and **TorchSpec**. **Perplexity** open-sourced a rebuilt **Unigram tokenizer** cutting CPU use by **5–6×** and…

15
r/MachineLearning community 1mo ago

Reconstructing the agent methodology: Decoupling decision-making and execution - open source [P]

I’ve been thinking about a problem in current agent systems: Most agents are becoming very good at execution, but the decision layer before execution is still unclear. Coding agents, research agents, tool loops, sandboxes, workflows, and harnesses are all improving quickly. Once…

38
r/MachineLearning community 1mo ago

I’m building an open-source decision layer above AI agents [P]

Hi everyone, I’m Jia, the creator of Spice. I’ve been working on an open-source project called Spice. The simplest way to describe it is: Spice is a decision layer above agents. Most agent systems today are very focused on execution, They are getting better at doing tasks after…

30
arXiv — Machine Learning research 1mo ago

Open Multimodal Datasets and Open-Source Software for Data-Driven Modeling of Multiphase Transport and Thermal Systems

arXiv:2605.23037v1 Announce Type: new Abstract: Data-driven modeling is becoming central to multiphase transport, electronics cooling, acoustic diagnostics, and thermal-fluid digital twins, but progress is limited by fragmented datasets and raw instrument files that are…

8
arXiv — Machine Learning research 1mo ago

What Linear Probes Miss: Multi-View Probing for Weight-Space Learning

arXiv:2605.23410v1 Announce Type: new Abstract: The explosive growth of open-source model repositories has created a Model Jungle, where checkpoints are frequently shared without adequate documentation or metadata. While weight-space learning offers a pathway to identify and…

20
arXiv — Machine Learning research 1mo ago

An Open-Source Training Dataset for Foundation Models for Black-box Optimization

arXiv:2605.23417v1 Announce Type: new Abstract: Most black-box optimization methods require extensive hyperparameter tuning, often limiting their ability to generalize across different optimization domains. Foundation models for black-box optimization that learn optimization…

21
arXiv — NLP / Computation & Language research 1mo ago

Knowledge Distillation for Low-Resource Open-source Text-to-SQL Model

arXiv:2605.22843v1 Announce Type: new Abstract: Text-to-SQL converts natural language questions into executable SQL queries, enabling non-technical users to access relational databases for analytics and intelligent data services. In real-world scenarios, performance is often…

18
arXiv — NLP / Computation & Language research 1mo ago

Benchmarking Google Embeddings 2 against Open-Source Models for Multilingual Dense Retrieval and RAG Systems

arXiv:2605.23618v1 Announce Type: new Abstract: We benchmark Google Embeddings (GE2), a Vertex-AI-hosted bi-encoder with 2,048-token context and explicit task-type conditioning, against five open-source alternatives: BGE-M3, E5-large, Multilingual-E5-large (mE5-L), LaBSE, and…

7
arXiv — NLP / Computation & Language research 1mo ago

OpenSkillEval: Automatically Auditing the Open Skill Ecosystem for LLM Agents

arXiv:2605.23657v1 Announce Type: new Abstract: Skills, i.e., structured workflow instructions distilled for large language models (LLMs), are becoming an increasingly important mechanism for improving agent performance on real-world downstream tasks. However, as the open-source…

5
arXiv — NLP / Computation & Language research 1mo ago

InfiGFusion: Graph-on-Logits Distillation via Efficient Gromov-Wasserstein for Model Fusion

arXiv:2505.13893v2 Announce Type: replace Abstract: Recent advances in large language models (LLMs) have intensified efforts to fuse heterogeneous open-source models into a unified system that inherits their complementary strengths. Existing logit-based fusion methods maintain…

35
r/LocalLLaMA community 1mo ago

hipEngine: Fast Native Qwen 3.6 Inference for RDNA3 (Strix Halo, 7900 XTX)

A few weeks ago, after finishing FastDMS , I started toying around writing some RDNA3 kernels again to see how fast I could get Qwen 3.6 MoE running. It turned out well enough, so over the past couple weeks, I turned those experiments into hipEngine , a new open source (AGPLv3)…

13
r/LocalLLaMA community 1mo ago

Could Open Models be trained to secretly go rogue?

I was discussing with some other folks how safe is to use open weights models from China and the topic of "trojan horse" came up. We know that, at least with current architecture, models can't run code on their own. They are entirely dependent on tools and harnesses. We also…

20
Hacker News — AI on Front Page community 1mo ago

Show HN: Audiomass – a free, open-source multitrack audio editor for the web

Article URL: https://audiomass.co/?multitrack=1 Comments URL: https://news.ycombinator.com/item?id=48258015 Points: 338 # Comments: 68

29
r/MachineLearning community 1mo ago

Working on a cgo-free CUDA binding in Go for ML stuff Week 3 - open source [P]

At our work we use CUDA in Rust since the company switched to it recently. Rust has pretty good Driver API bindings but it made me wonder why the hell we cant have something decent in Go without cgo. I mostly build ML tools in the last month and Go is my main language for pretty…

30
r/MachineLearning community 1mo ago

PapersWithCode new features - week 1 [P]

Hi, Niels here from the open-source team at Hugging Face. It's been one week since I launched paperswithcode.co , a revival of the website we all loved. It allows us to keep track of the state-of-the-art (SOTA) across various domains of AI, from agents to computer vision and…

23
r/LocalLLaMA community 1mo ago

Qwen Plays ̶p̶̶o̶̶k̶̶e̶̶m̶̶o̶̶n̶ ? / QWEN PLAYS DCSS! - qwen3.6-35b-a3b@q4_k_xl plays open source roguelike adventure DCSS (and does a decent job)

Hi, (TLDR.): Qwen in its MTP version has tool call bugs and outputs everything into tool/thinking blocks - mangeling the output - canceling the +speed with repeated wrong tool calls! DCSS works well with non MTP qwen even on smaller qwants. im Testing the new MTP models and…

19
Hacker News — AI on Front Page community 1mo ago

Microsoft open-sources "the earliest DOS source code discovered to date"

https://opensource.microsoft.com/blog/2026/04/28/continuing-... Comments URL: https://news.ycombinator.com/item?id=48253386 Points: 224 # Comments: 55

33
r/LocalLLaMA community 1mo ago

Command A+ (218B MoE) running on Apple Silicon — MLX port, PR open

Cohere dropped Command A+ on the 20th (218B total / 25B active, 128 experts top-8, Apache 2.0). Wrote a cohere2_moe implementation for mlx-lm to get it running on Apple Silicon. Architecture notes for anyone digging into this model: - Single shared expert with a larger…

12

Browse CVPR 2026 papers on PapersWithCode [P]

JetBrains open-sources Mellum2 - anyone tried these?

MeshFlow: production-safe multi-agent orchestration — SHA-256 audit chain, HIPAA/SOX/GDPR built in, 70-85% token cost reduction [Open Source][D]

MeshFlow: An open-source orchestrator for governed, cost-optimized multi-agent workflows [D]

For Ling-2.6-1T, what would make the size feel justified first: quality per token, local serving reality, or long context stability?

Mellum2 Goes Open Source: A Fast Model for AI Workflows | The JetBrains AI Blog

China’s MiniMax Launches New Model as Open-Source AI Coding Battle Heats Up

not much happened today

G7 agrees on shared language around open-source AI and open weights AI

Built an AI Accelerator and opensourced it. [P]

Before we spend months processing open-source robotics datasets, tell us why this is a bad idea [D]

Open source : Turning vocal imitations into sound effects. (New UX for sound generation)

made a local voice AI for windows you can talk to in any language. open source, bring your own key

Benchmarking Open-Source Safety Guard Models: A Comprehensive Evaluation

minWM: A Full-Stack Open-Source Framework for Real-Time Interactive Video World Models

I built a knowledge graph + policy engine for AI agents , explainable reasoning [D]

A new dataset with more that 100M hi-quality, curated images, with captions and meta data! [P]

Vertu wants CEOs to run companies from an AI foldable starting at $6,880

GRADE: Generalizable Reasoning-Aware Dialogue Evaluation for AI Tutors

BEAM 100K memory benchmark: CSM vs Hindsight local artifact comparison [R]

ReAligned-Qwen3.5 Release

Looks like Miminax-M3 is just around the corner

A Tiny Open-Source Self-Driving AI That Runs on a Phone [P]

Boom Times for Inference Providers?

Warp’s big bet on building open source with GPT-5.5

How Far Will They Go? Red-Teaming Online Influence with Large Language Models

A rare look inside Qwen 3.7’s open source model release approval process:

Millions of AI agents imperiled by critical vulnerability in open source package

Print with dozens of colors: Our new open-source ColorMix for PrusaSlicer

Some ideas for what comes next, May 2026

Small set of local MCP server installers for home Linux users

New KV Quants coming 😍 Welcome OSCAR kv quant open sourced by togetherAI

not much happened today

Reconstructing the agent methodology: Decoupling decision-making and execution - open source [P]

I’m building an open-source decision layer above AI agents [P]

Open Multimodal Datasets and Open-Source Software for Data-Driven Modeling of Multiphase Transport and Thermal Systems

What Linear Probes Miss: Multi-View Probing for Weight-Space Learning

An Open-Source Training Dataset for Foundation Models for Black-box Optimization

Knowledge Distillation for Low-Resource Open-source Text-to-SQL Model

Benchmarking Google Embeddings 2 against Open-Source Models for Multilingual Dense Retrieval and RAG Systems

OpenSkillEval: Automatically Auditing the Open Skill Ecosystem for LLM Agents

InfiGFusion: Graph-on-Logits Distillation via Efficient Gromov-Wasserstein for Model Fusion

hipEngine: Fast Native Qwen 3.6 Inference for RDNA3 (Strix Halo, 7900 XTX)

Could Open Models be trained to secretly go rogue?

Show HN: Audiomass – a free, open-source multitrack audio editor for the web

Working on a cgo-free CUDA binding in Go for ML stuff Week 3 - open source [P]

PapersWithCode new features - week 1 [P]

Qwen Plays ̶p̶̶o̶̶k̶̶e̶̶m̶̶o̶̶n̶ ? / QWEN PLAYS DCSS! - qwen3.6-35b-a3b@q4_k_xl plays open source roguelike adventure DCSS (and does a decent job)

Microsoft open-sources "the earliest DOS source code discovered to date"

Command A+ (218B MoE) running on Apple Silicon — MLX port, PR open