Tag

Open source

308 articles archived under #open-source · RSS

arXiv — NLP / Computation & Language research 1h ago

How Far Do On-Prem Open LLMs Get on Text-to-SQL? A Cross-Family Size x Technique Frontier on BIRD

arXiv:2606.29733v1 Announce Type: new Abstract: Organizations that cannot send data to a cloud API increasingly ask: how good is Text-to-SQL if the model must run on-premises on open weights, and which popular accuracy "recipes" are worth their compute? We answer with an honest,…

16
r/LocalLLaMA community 6h ago

I Hate Dario Amodei, and everything he stands for.

I am so incredibly sick of this guy‘s fear mongering about open source while fundamentally misunderstanding how it actually works. He recently dropped some arguments that are so completely detached from reality, it honestly feels like he’s never even touched a local model in his…

31
r/LocalLLaMA community 12h ago

Amodei: "Open Source Models Will Eat Your Children"

  submitted by   /u/johnnyApplePRNG [link]   [comments]

35
r/LocalLLaMA community 12h ago

Anthropic's Amodei: "Open Source models [could take us to] a very dangerous place."

  submitted by   /u/johnnyApplePRNG [link]   [comments]

4
Simon Willison community 13h ago

Ornith-1.0: Self-Scaffolding LLMs for Agentic Coding

Ornith-1.0: Self-Scaffolding LLMs for Agentic Coding This is an interesting new open weights (MIT licensed) model, the first model release from DeepReinforce. [...] with variants including 9B Dense, 31B Dense, 35B MoE, and 397B MoE. Built on top of pretrained Gemma 4 and Qwen…

5
arXiv — Machine Learning research 1d ago

Unified Zero-Shot Time Series Forecasting: A Darts Foundation

arXiv:2606.27438v1 Announce Type: new Abstract: Since its initial release in 2020, Darts has become a widely used open-source Python library for time series analysis. A series of foundation models have recently claimed accuracy improvements in zero-shot forecasting, promising a…

15
arXiv — NLP / Computation & Language research 1d ago

EntMTP: Accelerating LLM Inference with Entropy Guided Multi Token Prediction

arXiv:2606.27550v1 Announce Type: new Abstract: Multi-token prediction has been shown to increase data density during training, improve downstream text-generation quality, and serves as the defacto approach for self-speculative decoding. Existing foundation and open source…

29
Hacker News — AI on Front Page community 1d ago

HackerRank open sourced its ATS. My resume scored 90/100. Oh wait 74. No – 88

Article URL: https://danunparsed.com/p/hackerrank-open-source-ats Comments URL: https://news.ycombinator.com/item?id=48713832 Points: 554 # Comments: 223

18
r/LocalLLaMA community 1d ago

The number 1 public enemy of open-source.

Dario's args: "Opensource you can see the source, here you cannot see inside the model" - yes you can that's literally the open weights part btw. - I cannot see the weights inside Claude, but I can GLM 5.2 - Models like Nemotron3 Ultra go further, all the data, training scripts,…

25
r/LocalLLaMA community 1d ago

Hypothetically speaking...

Would it not be possible to create crowd sourced, truly open sourced distilled LLMs with a simple wrapper around command line based AI services that exist today? I'm imagining a layer that goes around whatever application people currently use for coding/AI boyfriend that…

24
r/LocalLLaMA community 2d ago

Will Chinese Open Source Models be the only option soon?

US techbros do not just want to make money. They want total global control of everything. Releasing any more advanced AI interferes with that plan.   submitted by   /u/GeographHero [link]   [comments]

38
Hacker News — AI on Front Page community 2d ago

DeepSeek open-sources inference optimizations with 60–85% faster generation [pdf]

Article URL: https://github.com/deepseek-ai/DeepSpec/blob/main/DSpark_paper.pdf Comments URL: https://news.ycombinator.com/item?id=48696585 Points: 219 # Comments: 43

19
Hacker News — AI on Front Page community 3d ago

The gap between open weights LLMs and closed source LLMs

Article URL: https://blog.doubleword.ai/frontier-os-llm Comments URL: https://news.ycombinator.com/item?id=48692058 Points: 217 # Comments: 178

32
r/LocalLLaMA community 3d ago

Local LLM Peeps

I am 80% done with a harness that works for local and API but is local first. The harness has some interesting logic around multiple agents which I’m holding back on until it is open source on GitHub. I have been local for 6 months and built out EVERYTHING I could think of to…

28
r/LocalLLaMA community 3d ago

Streaming medical STT running locally on a MacBook

Quick teaser of what I’ve been working on over the last few weeks: a streaming medical speech-to-text model that runs fully on-device. This demo is running locally on a MacBook through MLX. Still doing more evals, but planning to release the open weights next week.  …

22
r/MachineLearning community 3d ago

How're you deploying LLMs in production now-a-days? What's the best and most affordable way? [D]

I've been developing an AI product using LLM APIs (from OpenRouter) but want to deploy an open-source LLM in my own Prod env. which I can control. Few reasons behind this are: - I wanna own the complete stack around my product. - Second I wanna fine-tune the model around my…

34
Hacker News — AI on Front Page community 3d ago

We All Depend on Open Source. We Will Defend It Together

Article URL: https://akrites.org/letter/ Comments URL: https://news.ycombinator.com/item?id=48682737 Points: 280 # Comments: 137

35
arXiv — Machine Learning research 4d ago

The Open Source Economic Index of AI Adoption and Capability

arXiv:2606.26118v1 Announce Type: cross Abstract: We work towards measuring both AI adoption and the capability of AI to perform discrete labor tasks across various occupations. To measure adoption, we develop an open-source economic index that uses publicly available user-LLM…

5
arXiv — NLP / Computation & Language research 4d ago

Where Do Models Find Happiness? Emotion Vectors in Open-Source LLMs

arXiv:2606.26987v1 Announce Type: new Abstract: Recent work identified emotion vectors in Claude Sonnet 4.5, which are internal representations that encode emotion concepts, causally influence behavior, and exhibit geometry mirroring human psychological structure. We test the…

29
arXiv — NLP / Computation & Language research 4d ago

Empowering GUI Agents via Autonomous Experience Exploration and Hindsight Experience Utilization for Task Planning

arXiv:2606.27330v1 Announce Type: new Abstract: Multimodal web agents can assist humans in operating repetitive GUI tasks, where effective task planning is essential for decomposing complex tasks into executable actions. While small open source MLLMs are cost efficient and…

8
r/LocalLLaMA community 4d ago

Stop waiting for Qwen3.7 Openweights.

Ornith-1.0, a family of open-source LLMs specialized for agentic coding. Ornith-1.0 spans the full parameter sizes, including 9B Dense, 35B MoE, and 397B MoE. It achieves state-of-the-art performance among open-source models of comparable size on coding benchmarks. Hugging Face:…

36
ThursdAI news-outlet 4d ago

GLM 5.2 total victory: the week open source won and nobody panicked

From CoreWeave: A chill week, but a total Open Source victory for GLM 5.2 + Sakana Fugu, Krea Open Sources, OpenAI makes inference chips with broadcom, Karpathy gets heat about the new Claude Tag...

35
r/LocalLLaMA community 4d ago

Built an open source local first Kanban workflow for running AI coding agents without babysitting every step

I’ve been building BatonBot, a local first app for running AI coding workflows with less babysitting. The problem I kept running into, especially with local models, is that coding agents can be useful but the workflow gets slow: start task → wait → check output → fix next issue…

10
r/LocalLLaMA community 4d ago

Qwen 3.6 27b GLM 5.2 fine-tune?

Hi everyone, Since both models are open weights and GLM seems to find that secret to frontier model reasoning, why don't we see any Qwen GLM finetune yet? Is it because GLM 5.2 is recent and finetune and datasets take time or the community is just not interested in the finetune?…

28
Hacker News — AI on Front Page community 4d ago

Show HN: OpenKnowledge – open source AI-first alternative to Obsidian/Notion

Hi HN, Nick here. We’re launching OpenKnowledge ( https://openknowledge.ai/ ), a “what you see is what you get” markdown editor that has direct integrations with Claude, Codex, and other agents. Available as MacOS app or Web UI+CLI. Fully free/local and OSS. We built this…

20
Vercel — AI dev-tools 4d ago

AI SDK 7

AI SDK, with over 16 million weekly downloads, is the TypeScript SDK for building AI applications, features, frameworks, and agents across any model provider. It's the same layer eve , Vercel's open-source agent framework, is built on. AI SDK 7 adds production depth for agent…

15
r/LocalLLaMA community 5d ago

SDXL running locally in the browser on WebGPU, open-source

I needed simple local image generation without the usual setup. No virtual environments, no ComfyUI with a complex graph and installation as an exe. So i tried to push the whole thing into the browser and run it on WebGPU. It's a browser extension. You install it, then it loads…

13
r/LocalLLaMA community 5d ago

Sipp - an open-source library for in-browser inference built on llama.cpp

GitHub: https://github.com/noumena-labs/Sipp   submitted by   /u/lordhiggsboson [link]   [comments]

9
r/MachineLearning community 5d ago

Find the best open-source OCR models in one place at Papers with Code [P]

Hi, I've created an overview of the most important OCR benchmarks, along with the top open models, and links to their paper and code: https://paperswithcode.co/tasks/ocr . This week, new OCR models were released by Baidu and Mistral. Baidu released Unlimited OCR , a 3B-parameter…

27
Hugging Face Daily Papers research 5d ago

OpenThoughts-Agent: Data Recipes for Agentic Models

Abstract An open-source data curation pipeline for training agentic language models is presented, demonstrating superior performance through systematic experimentation and scalable training data. Generated by Qwen/Qwen2.5-Coder-32B-Instruct Agentic language models dramatically…

34
arXiv — Machine Learning research 6d ago

PHANTOM: A Large-Scale Dataset of Multimodal Adversarial Attacks for Vision-Language Models

arXiv:2606.24388v1 Announce Type: cross Abstract: We introduce a large-scale, open-source dataset of pre-generated adversarial attacks for vision-language models (VLMs). The dataset is designed to be diverse, representative, and practical, extending existing benchmarks by…

38
arXiv — NLP / Computation & Language research 6d ago

ESBMC-PLC+: A Unified IEC~61131-3 Formal Verification Framework as a PLCverif Successor

arXiv:2606.23870v1 Announce Type: cross Abstract: PLCverif is the most mature open-source platform for PLC formal verification, developed at CERN and in production use since 2019. Yet it has two fundamental limitations: no support for Ladder Diagram (LD) programs, the dominant…

35
Hugging Face Daily Papers research 6d ago

TROPT: An Open Framework for Unifying and Advancing Discrete Text Optimization

Abstract A unified open-source framework for discrete text-trigger optimization that standardizes the development and execution of optimization strategies across various domains and applications. Generated by Qwen/Qwen2.5-Coder-32B-Instruct Discrete text-trigger optimization --…

18
r/LocalLLaMA community 6d ago

650+ Apache-2.0 biomedical NER/de-id models that run on-device in MLX. Same fp32 weights, identical outputs: the clinical NER models run 30-40x faster than PyTorch-CPU on a 3-year-old M3 Max. Repro inside.

Disclosure first: I maintain OpenMed, so read this with that bias. I'm posting the numbers with the full methodology and a runnable script so you can reproduce or tear it apart. I'm here for the next couple of hours to answer methodology questions. What it is: an open-source…

25
Hacker News — AI on Front Page community 6d ago

Krea 2: SOTA open-weights 12B image model

Article URL: https://www.krea.ai/blog/krea-2-technical-report Comments URL: https://news.ycombinator.com/item?id=48646659 Points: 247 # Comments: 33

4
Hugging Face Daily Papers research 6d ago

AOHP: An Open-Source OS-Level Agent Harness for Personalized, Efficient and Secure Interaction

Abstract AOHP presents an Android-based operating system framework that treats AI agents as first-class entities, enhancing task completion rates and reducing execution costs through specialized agent-oriented mechanisms. Generated by Qwen/Qwen2.5-Coder-32B-Instruct AI agents…

16
r/LocalLLaMA community 7d ago

Boogu Base, Turbo, Edit - open-source unified image generation and editing model series

Boogu-Image-0.1 is a competitive Apache-2.0 open-source unified image generation and editing model family , including Base , Turbo , Edit , and other variants that provide stable, practical capabilities for high-quality text-to-image generation, fast generation, image editing,…

22
TechCrunch — AI news-outlet 7d ago

OpenAI launches new initiative to help find and patch open-source bugs

OpenAI is attempting to tackle the security issues of the open source software community.

25
r/LocalLLaMA community 7d ago

Why is NO one talking about Microsoft's open source Fast Context!!!

https://huggingface.co/microsoft/FastContext-1.0-4B-SFT https://github.com/microsoft/fastcontext FastContext-1.0 is a lightweight repository-exploration subagent for LLM coding agents. Instead of letting a single model both explore the repository and solve the task, FastContext…

38
r/MachineLearning community 7d ago

About ML research collab group post [D]

Hi, I'm thinking of building a small community of 10-15 people where we can help each other to learn something new. The primary focus will be on ML research and open-source projects. If you're interested, DM me. knowledge of machine learning is a plus, as want to keep this a…

16
TechCrunch — AI news-outlet 7d ago

SpaceX inks compute deal with Reflection AI, an open-source AI lab

Reflection AI will pay $150 million a month beginning July 1, 2026 through 2029 for immediate access to Nvidia's latest GB300 AI chips and supporting hardware across SpaceX's Colossus 2 data center near Memphis, Tennessee.

33
r/MachineLearning community 7d ago

Some new updates to Papers with Code [P]

Hi folks, Niels here from the open-source team at Hugging Face. I continue working on a revival of paperswithcode.co as we're back to the "age of research" per Ilya Sutskever! Hence, it's important to discover each other's research and build on each other's work, so we can…

38
OpenAI official-blog 7d ago

Patch the Planet: a Daybreak initiative to support open source maintainers

OpenAI introduces Patch the Planet, a Daybreak initiative helping open-source maintainers find, validate, and fix vulnerabilities with AI and expert review.

23
r/MachineLearning community 8d ago

I released a softmax-free attention model at GPT-2 Medium scale (~354M params, 11.5B tokens): structural sparsity + tile-skipping kernels for long-context VRAM savings. Open weights + custom Triton kernels [R]

  submitted by   /u/NonGameCatharsis [link]   [comments]

29
r/LocalLLaMA community 8d ago

Qwen is never going to open source Qwen 3.7, aren't they?

Well, this was predictable. After Qwen fired Junyang Lin, the next models are no longer open source. Labs that have released open source models more recently than Qwen: GLM-5.2, 2026-06-17 Kimi-K2.7-Code, 2026-06-12 MiniMax-M3, 2026-06-11 Step-3.7-Flash, 2026-05-29…

15
r/LocalLLaMA community 9d ago

Best image vision model runnable on RTX 6000 Pro

I'm looking at running OCR and classification on old historical scanned documents. (Some dating back to 1950s) What's the current best vision enabled models thats open sourced and runnable on an RTX 6000 Pro? Note: I've used Gemma 4 31B and have had good success with it. It's…

20
r/LocalLLaMA community 9d ago

It’s time to decentralize model distribution! Introducing Noema Atlas

TL;DR: Noema Atlas is a peer-to-peer network software using Iroh for local LLM weights, free and open source (Apache-2.0). Models come from whichever peers have them, with Hugging Face and mirrors as fallback (opt-in). Every file is identified by its content hash and a signed…

38
r/LocalLLaMA community 9d ago

I wrote a free 15-part series on LLM internals — real math, real tensor shapes, real hardware constraints. All grounded in Gemma 4 12B's actual config.

If you run open-source models and want to understand what's actually happening under the hood — I spent the last few months writing a 15-part series that covers the full stack from tokenization to production serving. Most articles are grounded in Gemma 4 12B as the running…

19
r/LocalLLaMA community 9d ago

Board where every tile is an agent

I've been hacking a project which I find extremely useful and wanted to share. Imagine a board where every tile is an agent those job is to maintain the tile. I tried to illustrate the idea with a video here. The project is open source on GitHub and you can also try it out here…

36
r/MachineLearning community 9d ago

Studying FLUX in diffusers library was hard, so I built a smaller open-source version [P]

If you've tried to study modern diffusion models by digging through the official diffusers library, you know it can be overwhelming with its complexity and abstractions. I wanted to simplify FLUX diffusion models, so I built minFLUX : a PyTorch implementation focused on its core…

38

How Far Do On-Prem Open LLMs Get on Text-to-SQL? A Cross-Family Size x Technique Frontier on BIRD

I Hate Dario Amodei, and everything he stands for.

Amodei: "Open Source Models Will Eat Your Children"

Anthropic's Amodei: "Open Source models [could take us to] a very dangerous place."

Ornith-1.0: Self-Scaffolding LLMs for Agentic Coding

Unified Zero-Shot Time Series Forecasting: A Darts Foundation

EntMTP: Accelerating LLM Inference with Entropy Guided Multi Token Prediction

HackerRank open sourced its ATS. My resume scored 90/100. Oh wait 74. No – 88

The number 1 public enemy of open-source.

Hypothetically speaking...

Will Chinese Open Source Models be the only option soon?

DeepSeek open-sources inference optimizations with 60–85% faster generation [pdf]

The gap between open weights LLMs and closed source LLMs

Local LLM Peeps

Streaming medical STT running locally on a MacBook

How're you deploying LLMs in production now-a-days? What's the best and most affordable way? [D]

We All Depend on Open Source. We Will Defend It Together

The Open Source Economic Index of AI Adoption and Capability

Where Do Models Find Happiness? Emotion Vectors in Open-Source LLMs

Empowering GUI Agents via Autonomous Experience Exploration and Hindsight Experience Utilization for Task Planning

Stop waiting for Qwen3.7 Openweights.

GLM 5.2 total victory: the week open source won and nobody panicked

Built an open source local first Kanban workflow for running AI coding agents without babysitting every step

Qwen 3.6 27b GLM 5.2 fine-tune?

Show HN: OpenKnowledge – open source AI-first alternative to Obsidian/Notion

AI SDK 7

SDXL running locally in the browser on WebGPU, open-source

Sipp - an open-source library for in-browser inference built on llama.cpp

Find the best open-source OCR models in one place at Papers with Code [P]

OpenThoughts-Agent: Data Recipes for Agentic Models

PHANTOM: A Large-Scale Dataset of Multimodal Adversarial Attacks for Vision-Language Models

ESBMC-PLC+: A Unified IEC~61131-3 Formal Verification Framework as a PLCverif Successor

TROPT: An Open Framework for Unifying and Advancing Discrete Text Optimization

650+ Apache-2.0 biomedical NER/de-id models that run on-device in MLX. Same fp32 weights, identical outputs: the clinical NER models run 30-40x faster than PyTorch-CPU on a 3-year-old M3 Max. Repro inside.

Krea 2: SOTA open-weights 12B image model

AOHP: An Open-Source OS-Level Agent Harness for Personalized, Efficient and Secure Interaction

Boogu Base, Turbo, Edit - open-source unified image generation and editing model series

OpenAI launches new initiative to help find and patch open-source bugs

Why is NO one talking about Microsoft's open source Fast Context!!!

About ML research collab group post [D]

SpaceX inks compute deal with Reflection AI, an open-source AI lab

Some new updates to Papers with Code [P]

Patch the Planet: a Daybreak initiative to support open source maintainers

I released a softmax-free attention model at GPT-2 Medium scale (~354M params, 11.5B tokens): structural sparsity + tile-skipping kernels for long-context VRAM savings. Open weights + custom Triton kernels [R]

Qwen is never going to open source Qwen 3.7, aren't they?

Best image vision model runnable on RTX 6000 Pro

It’s time to decentralize model distribution! Introducing Noema Atlas

I wrote a free 15-part series on LLM internals — real math, real tensor shapes, real hardware constraints. All grounded in Gemma 4 12B's actual config.

Board where every tile is an agent

Studying FLUX in diffusers library was hard, so I built a smaller open-source version [P]