News / #ide Tag Ide 88 articles archived under #ide · RSS Sign in to follow r/LocalLLaMA community 15h ago Mellum2 local deployments Hey local community, I work at JetBrains with the team that trained Mellum2 models — 12B-2.5A LLMs. Those models are trained completely from scratch, targeting fast inference: our primary goal were H100/H200s prod deployments, but local deployments are good as well. We… 37 Hacker News — AI on Front Page community 4d ago Show HN: OpenKnowledge – open source AI-first alternative to Obsidian/Notion Hi HN, Nick here. We’re launching OpenKnowledge ( https://openknowledge.ai/ ), a “what you see is what you get” markdown editor that has direct integrations with Claude, Codex, and other agents. Available as MacOS app or Web UI+CLI. Fully free/local and OSS. We built this… 20 Hacker News — AI on Front Page community 5d ago LuaJIT 3.0 proposed syntax extensions Article URL: https://github.com/LuaJIT/LuaJIT/issues/1475 Comments URL: https://news.ycombinator.com/item?id=48667336 Points: 201 # Comments: 119 7 r/LocalLLaMA community 5d ago SDXL running locally in the browser on WebGPU, open-source I needed simple local image generation without the usual setup. No virtual environments, no ComfyUI with a complex graph and installation as an exe. So i tried to push the whole thing into the browser and run it on WebGPU. It's a browser extension. You install it, then it loads… 13 Hacker News — AI on Front Page community 6d ago Show HN: TikZ Editor – WYSIWYG editor for figures in LaTeX Hi all! TikZ is a widely-used LaTeX package for drawing figures in papers. It uses commands like \draw[->] (0,0) -- (1,2); to draw lines, shapes, text, etc. Academics usually code up their figures by hand, so there is lots of twiddling around with the coordinates and recompiling… 31 r/LocalLLaMA community 9d ago Qwen code companion on vscode marketplace - thoughts I just came across this extension in vscode few days ago and tried to use with LM studio hosted models and it really is pretty good compared to `continue`, `kilo`, `cline`, `roo` like I felt without much tweaks, gets straight to the point, if any tweaks required u could do… 36 arXiv — Machine Learning research 11d ago Latent Confounded Causal Discovery via Lie Bracket Geometry arXiv:2606.19610v1 Announce Type: new Abstract: Recent work on Kan-Do-Calculus (KDC) has established that the boundary between passive observation and active intervention in causal inference is a category-theoretic bi-adjunction, with interventions modeled by left Kan extensions… 16 arXiv — Machine Learning research 12d ago Seeing Before Reasoning: Decoupling Perception and Reasoning for Shortcut-Resilient Multimodal On-Policy Self-Distillation arXiv:2606.19120v1 Announce Type: new Abstract: On-policy self-distillation (OPSD) trains a model on its own rollouts and uses a frozen copy to provide dense token-level targets conditioned on a reference target. This works well for LLM reasoning, but a direct extension to… 31 arXiv — Machine Learning research 12d ago INDEQS: Informed Neural controlled Differential EQuationS arXiv:2606.19138v1 Announce Type: new Abstract: Neural Controlled Differential Equations (NCDE) provide a powerful continuous-time framework for forecasting time series, but standard graph-based extensions typically learn spatial structure purely from data, even in settings… 38 Hacker News — AI on Front Page community 12d ago RFC 10008: The new HTTP Query Method Article URL: https://www.rfc-editor.org/info/rfc10008/ Comments URL: https://news.ycombinator.com/item?id=48568502 Points: 219 # Comments: 105 13 arXiv — Machine Learning research 13d ago A fairness-aware extension of Stochastic Multicriteria Acceptability Analysis for ranking arXiv:2606.17756v1 Announce Type: new Abstract: Fairness has become a central concern in ranking problems involving individuals or social groups, particularly under the Responsible Artificial Intelligence agenda. In Multi-Criteria Decision Analysis, Stochastic Multicriteria… 33 arXiv — NLP / Computation & Language research 13d ago Self-Generated Error Training for Token Editing in Diffusion Language Models arXiv:2606.17175v1 Announce Type: new Abstract: Token-to-token (T2T) editing lets LLaDA2.1 revise committed tokens during block-diffusion decoding. The released recipe trains this editor on random vocabulary corruptions, but at inference the editor sees the model's own fluent,… 25 r/LocalLLaMA community 13d ago GLM 5.2 API is live, weights are on HF, and ollama has it already GLM 5.2 dropped on Friday locked behind the GLM Coding Plan. That was annoying if you just wanted to test it without subscribing to another IDE tier. Two hours ago today they opened the API and pushed weights to HuggingFace under MIT. Ollama already has it. So now you can… 15 arXiv — NLP / Computation & Language research 14d ago LLM-Assisted Stance Detection in Scientific Discourse: A Test Case in Bayesian Cognitive Science arXiv:2606.15566v1 Announce Type: new Abstract: Qualitative coding is central to social science, but expert annotation is difficult to scale. LLMs offer a possible extension, yet require careful validation when the target construct is interpretive, theoretically loaded, and only… 25 r/LocalLLaMA community 14d ago I think we need a /LocalHarnessLLM or something ... LM Studio Hermes Qwen Code Odysseus Open Claw Open Code Claude Code (and then IDEs w/ agentic capabilities) Continue Rider VS Code And a dozen others I'm sure ... Would love a place to discuss these? If not a new subreddit, a new discord section in localllama discord? I've made… 24 arXiv — Machine Learning research 15d ago DTVEM-RE: A Hierarchical Random-Effects Extension of the Differential Time-Varying Effect Model for Person-Specific Multi-Lag Estimation in Intensive Longitudinal Data arXiv:2606.14116v1 Announce Type: new Abstract: The Differential Time-Varying Effect Model (DTVEM) of Jacobson et al. (2019) is a popular tool for finding the best time lag in intensive longitudinal data, but it assumes everyone shares the same lag structure. The original… 31 arXiv — Machine Learning research 15d ago Beyond task performance: Decoding bioacoustic embeddings with speech features arXiv:2606.14662v1 Announce Type: new Abstract: Pretrained audio embeddings are standard in bioacoustics, yet little is known about which acoustic features these models encode, nor which are useful for a given task. This hinders transparency and limits extension to rare species… 6 arXiv — NLP / Computation & Language research 15d ago CORA: Analyzing and bridging thinking-answer gap in Multimodal RLVR via Consistency-Oriented Reasoning Alignment arXiv:2606.14691v1 Announce Type: new Abstract: Reinforcement learning with verifiable rewards (RLVR) has successfully elicited the reasoning capabilities of large language models, motivating its extension to multimodal scenarios. Existing methods primarily focus on improving… 34 r/LocalLLaMA community 16d ago Pi Setup that pretty much replaced Claude Code for me I've been using Pi with Qwen3.6-27B a lot as my daily driver for more than a month and this setup almost replaced Codex/CC for me entirely. I use it with the advisor extension, with the advisor usually being GPT-5.5 and it has been great for me so far. I sometimes use OpenCode… 35 arXiv — NLP / Computation & Language research 18d ago Edit the Bits, Diff the Codes: Bitwise Residual Editing for Visual Autoregressive Models arXiv:2606.13558v1 Announce Type: cross Abstract: Text-guided image editing with visual autoregressive (VAR) generators requires controlling both what the model samples and where the sampled change is written back into the image code. Existing VAR editors mainly operate on token… 12 The Information — AI news-outlet 18d ago KKR, Nvidia, Others Launch $10 Billion Data Center Company Private equity firm KKR, the Kuwait Investment Authority, Nvidia and power generation company Vistra launched a new company on Thursday to finance and help build AI data centers. Nvidia’s role as an anchor investor in Helix signifies another extension of the AI giant’s growing… 29 arXiv — Machine Learning research 19d ago Flow Matching with In-Context Priors for Out-of-Distribution Brain Dynamics arXiv:2606.11833v1 Announce Type: new Abstract: Flow matching and diffusion models enable conditional generation across domains ranging from images to proteins, with recent extensions to out-of-distribution contexts. Yet generative models of neural time series have largely… 27 arXiv — Machine Learning research 19d ago nD-RoPE: A Generalized RoPE for n-Dimensional Position Embedding arXiv:2606.12146v1 Announce Type: new Abstract: Rotary Position Embedding (RoPE) is widely adopted in Transformer models, yet its extension to high-dimensional domains lacks a unified theoretical formulation. Most existing approaches either apply rotations independently along… 8 r/MachineLearning community 19d ago How common are TMLR desk rejections with "not a suitable venue"? [D] Submitted a short theoretical paper to TMLR and got desk-rejected with "does not meet our editorial standards or allow us to assess claims and evidence" and "not a suitable venue for this work." Is this a common outcome for first submissions? Curious what typically drives this… 33 arXiv — Machine Learning research 20d ago Forward-Only Convolutional Neural Networks with Learnable Channel-Class Assignment arXiv:2606.09928v1 Announce Type: new Abstract: The Forward-Forward (FF) algorithm offers a biologically inspired alternative to backpropagation by replacing gradient-based credit assignment with local, forward-only objectives. While recent extensions have adapted FF to… 32 arXiv — NLP / Computation & Language research 20d ago Gaming AI-Assisted Peer Reviews Poses New Risks to the Scientific Community arXiv:2606.10159v1 Announce Type: new Abstract: AI is increasingly used to support scientific peer review, from manuscript screening, reviewer assistance to editorial triage. Although such systems promise to reduce reviewer burden and accelerate publication, their robustness to… 21 llama.cpp releases dev-tools 20d ago b9580 vulkan: add v_dot2_f32_f16 support in matrix-matrix multiplication and Flash Attention ( #24123 ) vulkan: add support for valve fp16 dot2 extension use macro for dot2 path choice properly check for the feature add dot_product abstraction to reduce preprocessor branching… 10 arXiv — Machine Learning research 21d ago Learning Transfers: Kan Extensions for Neural Invariants arXiv:2606.07627v1 Announce Type: new Abstract: Transfer learning presumes that a representation learned on source tasks carries structure that remains usable on related target tasks. Standard evaluations probe this through target accuracy or distributional discrepancy, yet… 8 r/LocalLLaMA community 21d ago Jetbrains Mellum 2: a really good and performant model Oh Hey Folks, I took the Mellum 2 model for a spin, so I wanted to share my impressions here. Disclaimer: the tests presented here are not cientific nor have those nice names like perplexity,etc. These tests are somewhat more akin to what Im working in a daily basis or how… 28 r/LocalLLaMA community 23d ago Best Coding Harness for Qwen3.6 35B? I've been happily using GitHub Copilot for 7-8 months, primarily in Visual Studio and VS Code, mostly with the built-in flagship models and have felt like the output is worth the cost. Lately I've been playing with a lot of different local LLM models and decided to try using… 32 r/LocalLLaMA community 26d ago Gemma 4 12B first coding agent test on a 4080 Super Just threw the new Gemma 4 12B into VSCodium with the Pi Agent extension to see how it handles tools, and it nailed the test on the first try. I gave it a prompt to write a Python script that reads logs line-by-line, grabs the error modules, and dumps the counts to a JSON file.… 14 arXiv — Machine Learning research 27d ago Synthetic Hallucinations, Real Gains: Hard Negatives from Frontier Models for FIM Hallucination Mitigation arXiv:2606.03130v1 Announce Type: new Abstract: Small open-source code models that power IDE autocomplete still emit hallucinated Fill-in-the-Middle (FIM) completions: syntactically natural calls to methods, parameters, variables, and imports that do not exist in the surrounding… 8 Simon Willison community 27d ago Microsoft's new MAI models Microsoft announced two new text LLMs this morning - MAI-Thinking-1 (reasoning, 35B parameters, available to "select early partners") and MAI-Code-1-Flash (5B parameters, "purpose-built for GitHub Copilot and VS Code to deliver high performance and lower cost [...] rolling out… 17 Hacker News — AI on Front Page community 27d ago 1-Click GitHub Token Stealing via a VSCode Bug Article URL: https://blog.ammaraskar.com/github-token-stealing/ Comments URL: https://news.ycombinator.com/item?id=48371562 Points: 220 # Comments: 30 4 r/LocalLLaMA community 27d ago JetBrains open-sources Mellum2 - anyone tried these?   submitted by   /u/DeltaSqueezer [link]   [comments] 19 Simon Willison community 28d ago Pasted File Editor Tool: Pasted File Editor I really like how you can paste a large volume of text into claude.ai (or the Claude desktop/mobile apps) and it will detect it as a large paste and turn it into a file attachment instead. I decided to have Codex desktop build me a version of that as a… 24 Hugging Face official-blog 28d ago Introducing Mellum2: A 12B Mixture-of-Experts Model by JetBrains Back to Articles Introducing Mellum2: A 12B Mixture-of-Experts Model by JetBrains Team Article Published June 1, 2026 Upvote 5 Nikita Pavlichenko pavlichenko JetBrains Mellum2 is a 12B-parameter Mixture-of-Experts model trained from scratch on natural language and code. The… 24 TechCrunch — AI news-outlet 28d ago DuckDuckGo makes its ‘no-AI’ search engine easier to access as its traffic booms Alternative search engine DuckDuckGo launches 'no AI' web extensions for Chrome and Firefox users. 34 r/LocalLLaMA community 28d ago Mellum2 Goes Open Source: A Fast Model for AI Workflows | The JetBrains AI Blog   submitted by   /u/dayanruben [link]   [comments] 34 Vercel — AI dev-tools 28d ago Chat SDK adds Velt support Chat SDK now supports Velt with the new vendor-official adapter . Build bots that read and reply within Velt comment threads, right where your team already works: documents, text editors, and canvases. Tag the bot, and it will answer in the same thread, grounding its reply with… 24 r/LocalLLaMA community 28d ago Mellum 2 12B A2.5B Coding focused small MoE from JetBrains. They claim coding performance around Qwen 3.5 9B for the reasoning model. Worse than Qwen 3.5 4B in in everything else. Models: https://huggingface.co/collections/JetBrains/mellum-2 Technical report: https://arxiv.org/abs/2605.31268  … 34 arXiv — NLP / Computation & Language research 29d ago Wind Turbine Maintenance Log Labelling Framework: LLM-Driven Data Correction and Enrichment via Semantic Extraction of Reliability Intelligence arXiv:2605.31281v1 Announce Type: new Abstract: As wind turbine fleets age, data-driven reliability engineering is essential to optimise their operation and maintenance for service life extension and levelised cost of energy reduction. Failure event descriptions within… 27 arXiv — Machine Learning research 1mo ago Conf-Gen: Conformal Uncertainty Quantification for Generative Models arXiv:2605.28920v1 Announce Type: new Abstract: Conformal prediction (CP) and its extension, conformal risk control (CRC), are established frameworks for quantifying uncertainty in supervised machine learning through formal guarantees. However, recent breakthroughs in artificial… 17 r/MachineLearning community 1mo ago Kept context-switching between arxiv, OpenReview, GitHub, and HuggingFace for every paper, so I built this. Chrome extension + website with everything inline, plus citation graph + SPECTER2 neighbors. 3M papers, free, feedback welcome [P] Spent the last few months building a deeper context layer over arxiv. Each paper gets a Tomesphere page with a TLDR + key findings (LLM-curated), OpenReview reviews where the venue is public, linked GitHub repos, HuggingFace models, conference videos, the citation graph in both… 15 r/MachineLearning community 1mo ago Built a richer reading layer for arxiv (Chrome extension + web): OpenReview reviews, GitHub/HuggingFace links, citation graph, SPECTER2 neighbors, TLDRs. 3M papers, free, looking for feedback [P] Spent the last few months building a deeper context layer over arxiv. Each paper gets a Tomesphere page with a TLDR + key findings (LLM-curated), OpenReview reviews where the venue is public, linked GitHub repos, HuggingFace models, conference videos, the citation graph in both… 10 arXiv — NLP / Computation & Language research 1mo ago ConvMemory: A Lightweight Learned Memory Reranker, a Negative Attribution Result, and a Research-Preview Conflict Editor arXiv:2605.28062v1 Announce Type: new Abstract: We describe ConvMemory, a small 3.6M-parameter learned reranker for conversational long-term memory retrieval, trained with cross-encoder teacher supervision over fused dense and lexical features. On the LongMemEval memory family,… 30 arXiv — NLP / Computation & Language research 1mo ago Supervised Semantic Differential for Cross-Cultural Concept Analysis: A Case Study of Human Affect arXiv:2605.28225v1 Announce Type: new Abstract: Cross-cultural comparison of psychological meaning requires methods that go beyond word-level translation and examine how semantic dimensions are organized across languages. We introduce a cross-lingual extension of the Supervised… 26 arXiv — NLP / Computation & Language research 1mo ago The Need for an External Observer Formalizing the Sufficiency Gap: A Mathematical Extension of Mixture Identifiability and Contextual Grounding in Sequence Models arXiv:2605.26711v1 Announce Type: new Abstract: We construct a binary mixed-regime process with one deterministic textual regime and one random regime governed by an unobserved latent state. Even an ideal infinite-capacity sequence predictor that exactly recovers the text-only… 34 arXiv — Machine Learning research 1mo ago Riemannian Archetypal Analysis: Interpretable non-linear data analysis on deformed star distributions arXiv:2605.24113v1 Announce Type: new Abstract: Classical archetypal analysis is appealing for its interpretability, but its linear geometry can limit performance on data with strongly non-linear structure; at the same time, existing neural extensions improve flexibility while… 35 arXiv — NLP / Computation & Language research 1mo ago Raon-Speech Technical Report arXiv:2605.23912v1 Announce Type: new Abstract: We present Raon-Speech, a top-performing 9B-parameter speech language model (SpeechLM) for English and Korean speech understanding, answering, and generation, and Raon-SpeechChat, a high-performing full-duplex extension for natural… 11 Page 1 of 2 · 88 articles Older →