Tag

Open source

308 articles archived under #open-source · RSS

Hacker News — AI on Front Page community 18d ago

Codex for open source

Article URL: https://openai.com/form/codex-for-oss/ Comments URL: https://news.ycombinator.com/item?id=48497195 Points: 216 # Comments: 74

11
Hacker News — AI on Front Page community 18d ago

MiMo Code is now released and open-source

Article URL: https://mimo.xiaomi.com/mimocode Comments URL: https://news.ycombinator.com/item?id=48490826 Points: 259 # Comments: 134

37
r/LocalLLaMA community 18d ago

As we know Minimax M3 is just going to be open sourced in few days and because of that I was surfing on internet searching for its scores and I found out pretty interesting results. Is Minimax M3 really that good in agentic stuff and in coding? Is it better than older gpt models?

Has anyone personally compared the Minimax M3 model against other proprietary models to determine its relative performance tier? I am trying to understand where it currently ranks in the broader Al landscape. Can we say Minimax M3 is better than GPT 5.2 in coding and agentic…

26
r/LocalLLaMA community 18d ago

Cognitor: open-source semantic search engine. Automatically chunks, embeds and indexes the content of a target folder, making it searchable semantically.

https://github.com/tanaos/cognitor Cognitor is an open-source semantic search engine and vector database which automatically chunks, embeds and indexes the entire content of a target folder (and its subfolders), making it easily searchable by both AI agents and humans.…

15
r/LocalLLaMA community 18d ago

How I implemented ASR bias for voice transcription models [Open Source]

I've been spending the last couple of weeks building a Wispr Flow clone as an open source project. For context, it is a voice dictation app that lets you type faster, by speaking instead of actually typing. I spent the first week building the basic STT capabilities. One of the…

29
r/LocalLLaMA community 18d ago

Minimax M3 open weights release planned for Friday

  submitted by   /u/rmhubbert [link]   [comments]

28
arXiv — Machine Learning research 19d ago

Bergson: An Open Source Library for Data Attribution

arXiv:2606.11660v1 Announce Type: new Abstract: Data attribution is a promising field in interpretability that aims to explain model behavior through the influence of its training data, with applications including debugging undesirable model behavior and training dataset…

26
r/LocalLLaMA community 19d ago

nvidia/diffusiongemma-26B-A4B-it-NVFP4 · Hugging Face

Model Overview Description: DiffusionGemma 26B A4B IT is an open-weights multimodal generative model developed by Google DeepMind that processes text, image, and video inputs to produce text output via discrete diffusion. Built on the Gemma 4 26B A4B Mixture-of-Experts (MoE)…

12
r/MachineLearning community 19d ago

Pyrecall open source tool for detecting catastrophic forgetting during LLM fine-tuning[P]

Surprised there's no real tooling for this given how much research exists on continual learning. Built pyrecall to fill the gap. Snapshots skill scores before/after fine-tuning, flags regressions, rolls back LoRA adapters by name. Fully local, no external APIs. v0.1.0, MIT, pip…

17
r/LocalLLaMA community 19d ago

Best Open-Source AI coding model for my specs?

hello everyone! im looking for the most powerful open-source coding ai while still fitting my system my specs: CPU: AMD ryzen 7 7700 GPU: RTX 5070 RAM: 32 gb DDR5 OS: windows 11 use case: Writing, Coding, debugging. any recommendations would be great. thanks in advance  …

4
r/LocalLLaMA community 19d ago

DeepMind Just Dropped "DiffusionGemma" — Text Generation via Image-Style Diffusion Model

Another open weight model got dropped today, this one's from DeepMind, seems like a good day for the OSS geeks. Released under Apache 2.0 Instead of generating text sequentially token-by-token like almost every autoregressive model on the market, it uses a text diffusion head. -…

32
r/LocalLLaMA community 19d ago

Cohere released North Mini Code: It's first Open-Source Agentic Coding Model

Small: 30 billion parameters, 3B active. Efficient: Benchmarks to 33.4 on the Artificial Analysis Coding Index, competitive among similar sized models. Open Source: Apache 2.0 license HF: https://huggingface.co/CohereLabs/North-Mini-Code-1.0   submitted by  …

8
r/MachineLearning community 19d ago

Introducing Papers Without Code [P]

Hi, Niels here from the open-source team at Hugging Face. I've recently relaunched paperswithcode.co as a source for finding the state of the art (SOTA) across various AI domains, from 3D generation to AI agents. This is done by automatically parsing research papers published on…

36
r/MachineLearning community 19d ago

RelayOps - Production-shaped telecom support agent (54% auto-resolve, 0 unsafe actions, full audit + decision console) [P]

I just open-sourced RelayOps - a small, honest, production-shaped AI support agent built specifically for telecom and subscription billing queues. Key results (v1.5.1): 54% of a 50-ticket sample queue auto-resolved 0 unsafe auto-actions 0 billing escapes (tested on 12…

25
Hugging Face Daily Papers research 19d ago

Kwai Keye-VL-2.0 Technical Report

Abstract Kwai Keye-VL-2.0-30B-A3B is an open-source Mixture-of-Experts multimodal foundation model that enables long-video understanding and agentic intelligence through DeepSeek Sparse Attention and specialized training infrastructure. Generated by…

36
r/LocalLLaMA community 20d ago

Without open source LLMs, US AI companies could have already monopoled the technology

For such technology with clear importance and impact on all of us, I believe that making it open source is an ethical duty, otherwise, especially with the 1-sided politics of the US we experience today, they could have already monopoled the technology by now, maybe make it…

22
arXiv — NLP / Computation & Language research 20d ago

OpenRTLSet: A Fully Open-Source Dataset for Large Language Model-based Verilog Module Design

arXiv:2606.10285v1 Announce Type: new Abstract: OpenRTLSet introduces the largest fully open-source dataset for hardware design, offering over 131,000 diverse Verilog code samples to the research community and industry. Our dataset uniquely combines Verilog code from GitHub…

6
r/LocalLLaMA community 20d ago

Without open llm competition, closed source LLM companies will become insatiable.

I can't imagine how arrogant one must be to make such a decision. People pay $200 a month for Anthropic to mess with their codebase. Imagine how they would humiliate their customers if the world didn't have an open-source model.…

6
r/LocalLLaMA community 20d ago

Releasing Apodex-1.0 Smol Models (0.8B, 2B, 4B Open-Weights) optimized for Agentic Verification + AgentHarness Evals

Hey r/LocalLLaMA , We just released Apodex 1.0 , and alongside our flagship API, we are releasing the weights for our Smol models (0.8B, 2B, and 4B) . Our core research focuses on independent verification in long-horizon tasks. Instead of just scaling up parameter sizes for raw…

23
r/LocalLLaMA community 20d ago

zai-org/SCAIL-2 · Hugging Face

SCAIL-2: Unifying Controlled Character Animation with End-to-end In-Context Conditioning SCAIL-2 is an open-source model for end-to-end controlled character animation . It animates a reference character with a driving video, and also supports character replacement and…

15
r/LocalLLaMA community 20d ago

Have we reached the point where open-source LLMs are “just good enough”?

The question I’m asking myself is whether open-source LLMs are now “ just good enough ” to meet 95% of requirements. I know, of course, that they still need to and will get even better, but where does the added value of the remaining 5% come from? a) Better answer quality? Okay,…

19
Hacker News — AI on Front Page community 20d ago

Microsoft's open source tools were hacked to steal passwords of AI developers

Article URL: https://techcrunch.com/2026/06/08/microsofts-open-source-tools-were-hacked-to-steal-passwords-of-ai-developers/ Comments URL: https://news.ycombinator.com/item?id=48457830 Points: 233 # Comments: 97

25
r/LocalLLaMA community 21d ago

I fine-tuned Parakeet 0.6B for medical ASR — open weights, local Mac/CUDA/CPU

I fine-tuned NVIDIA's Parakeet TDT 0.6B v2 for clinical speech and am releasing the weights as Omi Med STT v1 (CC-BY-4.0). Disclosure: I'm the founder of Omi Health and built this. Happy to dig into the training mix, benchmark, failure cases, quantization, or anything else. The…

14
r/MachineLearning community 21d ago

How to start open source contribution [D]

hi everyone, I created a blog around how I started open source contribution, documented all minute details. Please give it a read and give review as this is my journey to do blogging for the first time. It is free! https://substack.com/home/post/p-200202050   submitted by…

25
r/LocalLLaMA community 21d ago

Was BitNet a dead end? What happened to ternary LLMs?

They seemed so promising at one point but the biggest ternary model is still 2B. What happened? Why aren't the frontier open weights AI labs attempting to use them?   submitted by   /u/3ntrope [link]   [comments]

7
r/MachineLearning community 21d ago

I'd like to share an updated methodology for building agents.[P]

Hi guys, been exploring here for a while, wanted to share something we've been working on. It's called Spice, an open-source decision layer above agents. We have tons of great execution agents now — Claude Code, Codex, hermes, etc. They're good at doing stuff. But they're…

20
Hacker News — AI on Front Page community 21d ago

Show HN: Gitdot – A better GitHub. Open-source, written in Rust

What works now: user signups, org creations, private/public repos, and importing GitHub repositories (both as read-only mirrors and full migrations). So basically, you can create, push and pull to a repo, but we don't have many features quite yet (issues, PRs, CI). What is a bit…

34
r/LocalLLaMA community 21d ago

OpenEnv is now owned by HF, Torch, Prime Intellect, Unsloth, Modal, Mercor, and more! Use it for training agents.

OpenEnv is a tool for creating an agentic execution environment like terminals, browsers, or anything an agent can interact with. And today, we’re excited to announce that OpenEnv is becoming even more open, to make the future of training agents open source. Starting today,…

38
arXiv — Machine Learning research 22d ago

TorchKM: A GPU-Oriented Library for Kernel Learning and Model Selection

arXiv:2606.06742v1 Announce Type: new Abstract: TorchKM is an open-source library for kernel machines, including support vector machines, kernel logistic regression, and kernel quantile regression, with GPU acceleration. The library features a scikit-learn-style API and is…

36
arXiv — Machine Learning research 22d ago

A robust PPG foundation model using multimodal physiological supervision

arXiv:2606.07365v1 Announce Type: new Abstract: Photoplethysmography (PPG), a non-invasive measure of changes in blood volume, is widely used in both wearable devices and clinical settings. Recent PPG foundation models either use open-source ICU datasets with pretraining…

8
Hugging Face official-blog 22d ago

The Open Source Community is backing OpenEnv for Agentic RL

Back to Articles The Open Source Community is backing OpenEnv for Agentic RL Published June 8, 2026 Update on GitHub Upvote 1 ben burtenshaw burtenshaw Joseph Spisak spisakjo Lysandre lysandre Davide Testuggine darktex will brown willcb Charles Frye charlesfrye Chris Wing…

37
r/MachineLearning community 23d ago

Got told my open-source model experiments are too scattered. I'm organizing a journal to provide clarity before structuring the first git release. Is this readable for ML folks who aren’t in mech interp? Open to ANY feedback [D]

# Results Journal: Qwen3.5-35B-A3B E114 as a Generated-Register Routing Signal Date: 2026-06-06 This is an experiment-history document, not a publication claim. It states the current best evidence for the strongest positive result in the Qwen3.5-35B-A3B set, the narrow…

20
Hacker News — AI on Front Page community 23d ago

Ntsc-rs – open-source video emulation of analog TV and VHS artifacts

Article URL: https://ntsc.rs/ Comments URL: https://news.ycombinator.com/item?id=48428025 Points: 227 # Comments: 49

15
r/LocalLLaMA community 24d ago

dots.tts 2B🎙️ SOTA TTS from RedNote

🔗 Blog: https://rednote-hilab.github.io/dots.tts-demo/ 🔗 GitHub: https://github.com/rednote-hilab/dots.tts 🔗 Technical Report: https://arxiv.org/abs/2608.16894 dots.tts 🎙️ New open-source TTS from RedNote (Xiaohongshu) ✨ 2B parameters (Apache 2.0) ✨ Fully continuous…

16
Hacker News — AI on Front Page community 24d ago

pg_durable: Microsoft open sources in-database durable execution

Article URL: https://github.com/microsoft/pg_durable Comments URL: https://news.ycombinator.com/item?id=48414367 Points: 219 # Comments: 52

31
r/LocalLLaMA community 24d ago

I implemented KVarN in my llama.cpp fork and ran KLD benchmarks. It's promising!

Saw this post here yesterday: KVarN: new KV-cache quant from Huawei. 3–5× KV cache compression with actual speed-up instead of slow-down, and unlike TurboQuant it holds up on reasoning (Apache 2.0, vLLM single flag) Cheap KV cache with good precision? Sign me up! Oh, vLLM…

12
arXiv — NLP / Computation & Language research 25d ago

Benchmarking Open-Source Layout Detection Models for Data Snapshot Extraction from Institutional Documents

arXiv:2606.06242v1 Announce Type: new Abstract: Institutional documents contain substantial amounts of operational and analytical information embedded within figures and tables. Current approaches for extracting visual content from documents are largely built around generic…

9
Vercel — AI dev-tools 25d ago

The skills.sh API is now available

The skills.sh API is now available. Authenticate with your project's Vercel OIDC token and start querying more than 600,000 skills from across the open-source ecosystem. Search for skills, pull detailed info on any one, check its security audit, and more. Vercel issues a…

17
Hacker News — AI on Front Page community 25d ago

Anthropic's open-source framework for AI-powered vulnerability discovery

Article URL: https://github.com/anthropics/defending-code-reference-harness Comments URL: https://news.ycombinator.com/item?id=48403980 Points: 215 # Comments: 73

6
r/LocalLLaMA community 25d ago

KVarN: new KV-cache quant from Huawei. 3–5× KV cache compression with actual speed-up instead of slow-down, and unlike TurboQuant it holds up on reasoning (Apache 2.0, vLLM single flag)

The KV-cache quant race just got more interesting. Huawei just open-sourced KVarN , a KV-cache quantization method under Apache 2.0, drops into vLLM with one flag. Posting because the tradeoff it's claiming is genuinely different from what's already in the stack, and I'd like to…

20
r/MachineLearning community 25d ago

On-policy distillation: one of the hottest terms on PapersWithCode [R]

Hi, Niels here from the open-source team at Hugging Face. At paperswithcode.co I am trying to make it easier for people to learn about the newest techniques used across AI papers. One of the hottest terms in AI research that I've recently added is On-policy distillation , also…

27
arXiv — Machine Learning research 26d ago

Spectral Scaling Laws of Muon

arXiv:2606.04058v1 Announce Type: new Abstract: Orthonormalized update rules have rapidly become a leading choice of optimizer for training large language models, with recent open-source state-of-the-art models adopting Muon. To keep these updates tractable, Muon performs the…

13
r/LocalLLaMA community 26d ago

Ideogram 4 is open source! (top ranked on DesignArena)

  submitted by   /u/paf1138 [link]   [comments]

29
r/LocalLLaMA community 26d ago

google/gemma-4-12B · Hugging Face

Gemma is a family of open models built by Google DeepMind. Gemma 4 models are multimodal, handling text and image input (with audio supported on E2B, E4B, and 12B) and generating text output. This release includes open-weights models in both pre-trained and instruction-tuned…

29
r/LocalLLaMA community 26d ago

This day in LLM history….105 years ago today, Qwen 3.6 27b was released open source. /s

Unfortunately, the steam-powered GPUs of the era were incapable of anything higher than a 4K context limit.   submitted by   /u/Porespellar [link]   [comments]

33
r/LocalLLaMA community 26d ago

Calling it now Microsoft is buying Unsloth.

I am going to be honest, I am leery of this new partnership with Unsloth. Microsoft historically hated open source, and this will not benefit the community in the end. It will look great at first. They will drop updates, play nice, and everyone will celebrate. But if you have…

31
arXiv — NLP / Computation & Language research 27d ago

Hallucination Is Linearly Decodable from Mid-Layer Hidden States in Quantized LLMs

arXiv:2606.02628v1 Announce Type: cross Abstract: We investigate whether open-source LLMs encode a linearly separable truthfulness signal in their hidden states, and at which network depth this signal is strongest. Across three $7$B--$8$B instruction-tuned models (Llama-3.1-8B,…

26
arXiv — Machine Learning research 27d ago

Synthetic Hallucinations, Real Gains: Hard Negatives from Frontier Models for FIM Hallucination Mitigation

arXiv:2606.03130v1 Announce Type: new Abstract: Small open-source code models that power IDE autocomplete still emit hallucinated Fill-in-the-Middle (FIM) completions: syntactically natural calls to methods, parameters, variables, and imports that do not exist in the surrounding…

8
arXiv — NLP / Computation & Language research 27d ago

The Unsampled Truth: Psychometrics in SLMs Measure Prompt Artifacts, Not Psychological Constructs

arXiv:2606.03357v1 Announce Type: new Abstract: When prompting SLMs for psychometric assessments, researchers assume the outputs reflect semantic reasoning. We evaluate this premise across 13 open-weights models (0.6B to 14B parameters) using a prompt variation framework that…

18
TechCrunch — AI news-outlet 27d ago

New Microsoft tool lets devs spin up AI behavior tests using text descriptions

Microsoft on Tuesday took the wraps off Adaptive Spec-driven Scoring for Evaluation and Regression Testing, an open source framework for spinning up AI evaluations.

38

Codex for open source

MiMo Code is now released and open-source

As we know Minimax M3 is just going to be open sourced in few days and because of that I was surfing on internet searching for its scores and I found out pretty interesting results. Is Minimax M3 really that good in agentic stuff and in coding? Is it better than older gpt models?

Cognitor: open-source semantic search engine. Automatically chunks, embeds and indexes the content of a target folder, making it searchable semantically.

How I implemented ASR bias for voice transcription models [Open Source]

Minimax M3 open weights release planned for Friday

Bergson: An Open Source Library for Data Attribution

nvidia/diffusiongemma-26B-A4B-it-NVFP4 · Hugging Face

Pyrecall open source tool for detecting catastrophic forgetting during LLM fine-tuning[P]

Best Open-Source AI coding model for my specs?

DeepMind Just Dropped "DiffusionGemma" — Text Generation via Image-Style Diffusion Model

Cohere released North Mini Code: It's first Open-Source Agentic Coding Model

Introducing Papers Without Code [P]

RelayOps - Production-shaped telecom support agent (54% auto-resolve, 0 unsafe actions, full audit + decision console) [P]

Kwai Keye-VL-2.0 Technical Report

Without open source LLMs, US AI companies could have already monopoled the technology

OpenRTLSet: A Fully Open-Source Dataset for Large Language Model-based Verilog Module Design

Without open llm competition, closed source LLM companies will become insatiable.

Releasing Apodex-1.0 Smol Models (0.8B, 2B, 4B Open-Weights) optimized for Agentic Verification + AgentHarness Evals

zai-org/SCAIL-2 · Hugging Face

Have we reached the point where open-source LLMs are “just good enough”?

Microsoft's open source tools were hacked to steal passwords of AI developers

I fine-tuned Parakeet 0.6B for medical ASR — open weights, local Mac/CUDA/CPU

How to start open source contribution [D]

Was BitNet a dead end? What happened to ternary LLMs?

I'd like to share an updated methodology for building agents.[P]

Show HN: Gitdot – A better GitHub. Open-source, written in Rust

OpenEnv is now owned by HF, Torch, Prime Intellect, Unsloth, Modal, Mercor, and more! Use it for training agents.

TorchKM: A GPU-Oriented Library for Kernel Learning and Model Selection

A robust PPG foundation model using multimodal physiological supervision

The Open Source Community is backing OpenEnv for Agentic RL

Got told my open-source model experiments are too scattered. I'm organizing a journal to provide clarity before structuring the first git release. Is this readable for ML folks who aren’t in mech interp? Open to ANY feedback [D]

Ntsc-rs – open-source video emulation of analog TV and VHS artifacts

dots.tts 2B🎙️ SOTA TTS from RedNote

pg_durable: Microsoft open sources in-database durable execution

I implemented KVarN in my llama.cpp fork and ran KLD benchmarks. It's promising!

Benchmarking Open-Source Layout Detection Models for Data Snapshot Extraction from Institutional Documents

The skills.sh API is now available

Anthropic's open-source framework for AI-powered vulnerability discovery

KVarN: new KV-cache quant from Huawei. 3–5× KV cache compression with actual speed-up instead of slow-down, and unlike TurboQuant it holds up on reasoning (Apache 2.0, vLLM single flag)

On-policy distillation: one of the hottest terms on PapersWithCode [R]

Spectral Scaling Laws of Muon

Ideogram 4 is open source! (top ranked on DesignArena)

google/gemma-4-12B · Hugging Face

This day in LLM history….105 years ago today, Qwen 3.6 27b was released open source. /s

Calling it now Microsoft is buying Unsloth.

Hallucination Is Linearly Decodable from Mid-Layer Hidden States in Quantized LLMs

Synthetic Hallucinations, Real Gains: Hard Negatives from Frontier Models for FIM Hallucination Mitigation

The Unsampled Truth: Psychometrics in SLMs Measure Prompt Artifacts, Not Psychological Constructs

New Microsoft tool lets devs spin up AI behavior tests using text descriptions