Tag

Ide

88 articles archived under #ide · RSS

r/LocalLLaMA community 16h ago

Mellum2 local deployments

Hey local community, I work at JetBrains with the team that trained Mellum2 models — 12B-2.5A LLMs. Those models are trained completely from scratch, targeting fast inference: our primary goal were H100/H200s prod deployments, but local deployments are good as well. We…

37
Hacker News — AI on Front Page community 4d ago

Show HN: OpenKnowledge – open source AI-first alternative to Obsidian/Notion

Hi HN, Nick here. We’re launching OpenKnowledge ( https://openknowledge.ai/ ), a “what you see is what you get” markdown editor that has direct integrations with Claude, Codex, and other agents. Available as MacOS app or Web UI+CLI. Fully free/local and OSS. We built this…

20
Hacker News — AI on Front Page community 5d ago

LuaJIT 3.0 proposed syntax extensions

Article URL: https://github.com/LuaJIT/LuaJIT/issues/1475 Comments URL: https://news.ycombinator.com/item?id=48667336 Points: 201 # Comments: 119

7
r/LocalLLaMA community 5d ago

SDXL running locally in the browser on WebGPU, open-source

I needed simple local image generation without the usual setup. No virtual environments, no ComfyUI with a complex graph and installation as an exe. So i tried to push the whole thing into the browser and run it on WebGPU. It's a browser extension. You install it, then it loads…

13
Hacker News — AI on Front Page community 6d ago

Show HN: TikZ Editor – WYSIWYG editor for figures in LaTeX

Hi all! TikZ is a widely-used LaTeX package for drawing figures in papers. It uses commands like \draw[->] (0,0) -- (1,2); to draw lines, shapes, text, etc. Academics usually code up their figures by hand, so there is lots of twiddling around with the coordinates and recompiling…

31
r/LocalLLaMA community 9d ago

Qwen code companion on vscode marketplace - thoughts

I just came across this extension in vscode few days ago and tried to use with LM studio hosted models and it really is pretty good compared to `continue`, `kilo`, `cline`, `roo` like I felt without much tweaks, gets straight to the point, if any tweaks required u could do…

36
arXiv — Machine Learning research 11d ago

Latent Confounded Causal Discovery via Lie Bracket Geometry

arXiv:2606.19610v1 Announce Type: new Abstract: Recent work on Kan-Do-Calculus (KDC) has established that the boundary between passive observation and active intervention in causal inference is a category-theoretic bi-adjunction, with interventions modeled by left Kan extensions…

16
arXiv — Machine Learning research 12d ago

Seeing Before Reasoning: Decoupling Perception and Reasoning for Shortcut-Resilient Multimodal On-Policy Self-Distillation

arXiv:2606.19120v1 Announce Type: new Abstract: On-policy self-distillation (OPSD) trains a model on its own rollouts and uses a frozen copy to provide dense token-level targets conditioned on a reference target. This works well for LLM reasoning, but a direct extension to…

31
arXiv — Machine Learning research 12d ago

INDEQS: Informed Neural controlled Differential EQuationS

arXiv:2606.19138v1 Announce Type: new Abstract: Neural Controlled Differential Equations (NCDE) provide a powerful continuous-time framework for forecasting time series, but standard graph-based extensions typically learn spatial structure purely from data, even in settings…

38
Hacker News — AI on Front Page community 12d ago

RFC 10008: The new HTTP Query Method

Article URL: https://www.rfc-editor.org/info/rfc10008/ Comments URL: https://news.ycombinator.com/item?id=48568502 Points: 219 # Comments: 105

13
arXiv — Machine Learning research 13d ago

A fairness-aware extension of Stochastic Multicriteria Acceptability Analysis for ranking

arXiv:2606.17756v1 Announce Type: new Abstract: Fairness has become a central concern in ranking problems involving individuals or social groups, particularly under the Responsible Artificial Intelligence agenda. In Multi-Criteria Decision Analysis, Stochastic Multicriteria…

33
arXiv — NLP / Computation & Language research 13d ago

Self-Generated Error Training for Token Editing in Diffusion Language Models

arXiv:2606.17175v1 Announce Type: new Abstract: Token-to-token (T2T) editing lets LLaDA2.1 revise committed tokens during block-diffusion decoding. The released recipe trains this editor on random vocabulary corruptions, but at inference the editor sees the model's own fluent,…

25
r/LocalLLaMA community 13d ago

GLM 5.2 API is live, weights are on HF, and ollama has it already

GLM 5.2 dropped on Friday locked behind the GLM Coding Plan. That was annoying if you just wanted to test it without subscribing to another IDE tier. Two hours ago today they opened the API and pushed weights to HuggingFace under MIT. Ollama already has it. So now you can…

15
arXiv — NLP / Computation & Language research 14d ago

LLM-Assisted Stance Detection in Scientific Discourse: A Test Case in Bayesian Cognitive Science

arXiv:2606.15566v1 Announce Type: new Abstract: Qualitative coding is central to social science, but expert annotation is difficult to scale. LLMs offer a possible extension, yet require careful validation when the target construct is interpretive, theoretically loaded, and only…

25
r/LocalLLaMA community 14d ago

I think we need a /LocalHarnessLLM or something ...

LM Studio Hermes Qwen Code Odysseus Open Claw Open Code Claude Code (and then IDEs w/ agentic capabilities) Continue Rider VS Code And a dozen others I'm sure ... Would love a place to discuss these? If not a new subreddit, a new discord section in localllama discord? I've made…

24
arXiv — Machine Learning research 15d ago

DTVEM-RE: A Hierarchical Random-Effects Extension of the Differential Time-Varying Effect Model for Person-Specific Multi-Lag Estimation in Intensive Longitudinal Data

arXiv:2606.14116v1 Announce Type: new Abstract: The Differential Time-Varying Effect Model (DTVEM) of Jacobson et al. (2019) is a popular tool for finding the best time lag in intensive longitudinal data, but it assumes everyone shares the same lag structure. The original…

31
arXiv — Machine Learning research 15d ago

Beyond task performance: Decoding bioacoustic embeddings with speech features

arXiv:2606.14662v1 Announce Type: new Abstract: Pretrained audio embeddings are standard in bioacoustics, yet little is known about which acoustic features these models encode, nor which are useful for a given task. This hinders transparency and limits extension to rare species…

6
arXiv — NLP / Computation & Language research 15d ago

CORA: Analyzing and bridging thinking-answer gap in Multimodal RLVR via Consistency-Oriented Reasoning Alignment

arXiv:2606.14691v1 Announce Type: new Abstract: Reinforcement learning with verifiable rewards (RLVR) has successfully elicited the reasoning capabilities of large language models, motivating its extension to multimodal scenarios. Existing methods primarily focus on improving…

34
r/LocalLLaMA community 16d ago

Pi Setup that pretty much replaced Claude Code for me

I've been using Pi with Qwen3.6-27B a lot as my daily driver for more than a month and this setup almost replaced Codex/CC for me entirely. I use it with the advisor extension, with the advisor usually being GPT-5.5 and it has been great for me so far. I sometimes use OpenCode…

35
arXiv — NLP / Computation & Language research 18d ago

Edit the Bits, Diff the Codes: Bitwise Residual Editing for Visual Autoregressive Models

arXiv:2606.13558v1 Announce Type: cross Abstract: Text-guided image editing with visual autoregressive (VAR) generators requires controlling both what the model samples and where the sampled change is written back into the image code. Existing VAR editors mainly operate on token…

12
The Information — AI news-outlet 18d ago

KKR, Nvidia, Others Launch $10 Billion Data Center Company

Private equity firm KKR, the Kuwait Investment Authority, Nvidia and power generation company Vistra launched a new company on Thursday to finance and help build AI data centers. Nvidia’s role as an anchor investor in Helix signifies another extension of the AI giant’s growing…

29
arXiv — Machine Learning research 19d ago

Flow Matching with In-Context Priors for Out-of-Distribution Brain Dynamics

arXiv:2606.11833v1 Announce Type: new Abstract: Flow matching and diffusion models enable conditional generation across domains ranging from images to proteins, with recent extensions to out-of-distribution contexts. Yet generative models of neural time series have largely…

27
arXiv — Machine Learning research 19d ago

nD-RoPE: A Generalized RoPE for n-Dimensional Position Embedding

arXiv:2606.12146v1 Announce Type: new Abstract: Rotary Position Embedding (RoPE) is widely adopted in Transformer models, yet its extension to high-dimensional domains lacks a unified theoretical formulation. Most existing approaches either apply rotations independently along…

8
r/MachineLearning community 19d ago

How common are TMLR desk rejections with "not a suitable venue"? [D]

Submitted a short theoretical paper to TMLR and got desk-rejected with "does not meet our editorial standards or allow us to assess claims and evidence" and "not a suitable venue for this work." Is this a common outcome for first submissions? Curious what typically drives this…

33
arXiv — Machine Learning research 20d ago

Forward-Only Convolutional Neural Networks with Learnable Channel-Class Assignment

arXiv:2606.09928v1 Announce Type: new Abstract: The Forward-Forward (FF) algorithm offers a biologically inspired alternative to backpropagation by replacing gradient-based credit assignment with local, forward-only objectives. While recent extensions have adapted FF to…

32
arXiv — NLP / Computation & Language research 20d ago

Gaming AI-Assisted Peer Reviews Poses New Risks to the Scientific Community

arXiv:2606.10159v1 Announce Type: new Abstract: AI is increasingly used to support scientific peer review, from manuscript screening, reviewer assistance to editorial triage. Although such systems promise to reduce reviewer burden and accelerate publication, their robustness to…

21
llama.cpp releases dev-tools 20d ago

b9580

vulkan: add v_dot2_f32_f16 support in matrix-matrix multiplication and Flash Attention ( #24123 ) vulkan: add support for valve fp16 dot2 extension use macro for dot2 path choice properly check for the feature add dot_product abstraction to reduce preprocessor branching…

10
arXiv — Machine Learning research 21d ago

Learning Transfers: Kan Extensions for Neural Invariants

arXiv:2606.07627v1 Announce Type: new Abstract: Transfer learning presumes that a representation learned on source tasks carries structure that remains usable on related target tasks. Standard evaluations probe this through target accuracy or distributional discrepancy, yet…

8
r/LocalLLaMA community 21d ago

Jetbrains Mellum 2: a really good and performant model

Oh Hey Folks, I took the Mellum 2 model for a spin, so I wanted to share my impressions here. Disclaimer: the tests presented here are not cientific nor have those nice names like perplexity,etc. These tests are somewhat more akin to what Im working in a daily basis or how…

28
r/LocalLLaMA community 23d ago

Best Coding Harness for Qwen3.6 35B?

I've been happily using GitHub Copilot for 7-8 months, primarily in Visual Studio and VS Code, mostly with the built-in flagship models and have felt like the output is worth the cost. Lately I've been playing with a lot of different local LLM models and decided to try using…

32
r/LocalLLaMA community 26d ago

Gemma 4 12B first coding agent test on a 4080 Super

Just threw the new Gemma 4 12B into VSCodium with the Pi Agent extension to see how it handles tools, and it nailed the test on the first try. I gave it a prompt to write a Python script that reads logs line-by-line, grabs the error modules, and dumps the counts to a JSON file.…

14
arXiv — Machine Learning research 27d ago

Synthetic Hallucinations, Real Gains: Hard Negatives from Frontier Models for FIM Hallucination Mitigation

arXiv:2606.03130v1 Announce Type: new Abstract: Small open-source code models that power IDE autocomplete still emit hallucinated Fill-in-the-Middle (FIM) completions: syntactically natural calls to methods, parameters, variables, and imports that do not exist in the surrounding…

8
Simon Willison community 27d ago

Microsoft's new MAI models

Microsoft announced two new text LLMs this morning - MAI-Thinking-1 (reasoning, 35B parameters, available to "select early partners") and MAI-Code-1-Flash (5B parameters, "purpose-built for GitHub Copilot and VS Code to deliver high performance and lower cost [...] rolling out…

17
Hacker News — AI on Front Page community 27d ago

1-Click GitHub Token Stealing via a VSCode Bug

Article URL: https://blog.ammaraskar.com/github-token-stealing/ Comments URL: https://news.ycombinator.com/item?id=48371562 Points: 220 # Comments: 30

4
r/LocalLLaMA community 27d ago

JetBrains open-sources Mellum2 - anyone tried these?

  submitted by   /u/DeltaSqueezer [link]   [comments]

19
Simon Willison community 28d ago

Pasted File Editor

Tool: Pasted File Editor I really like how you can paste a large volume of text into claude.ai (or the Claude desktop/mobile apps) and it will detect it as a large paste and turn it into a file attachment instead. I decided to have Codex desktop build me a version of that as a…

24
Hugging Face official-blog 28d ago

Introducing Mellum2: A 12B Mixture-of-Experts Model by JetBrains

Back to Articles Introducing Mellum2: A 12B Mixture-of-Experts Model by JetBrains Team Article Published June 1, 2026 Upvote 5 Nikita Pavlichenko pavlichenko JetBrains Mellum2 is a 12B-parameter Mixture-of-Experts model trained from scratch on natural language and code. The…

24
TechCrunch — AI news-outlet 28d ago

DuckDuckGo makes its ‘no-AI’ search engine easier to access as its traffic booms

Alternative search engine DuckDuckGo launches 'no AI' web extensions for Chrome and Firefox users.

34
r/LocalLLaMA community 28d ago

Mellum2 Goes Open Source: A Fast Model for AI Workflows | The JetBrains AI Blog

  submitted by   /u/dayanruben [link]   [comments]

34
Vercel — AI dev-tools 28d ago

Chat SDK adds Velt support

Chat SDK now supports Velt with the new vendor-official adapter . Build bots that read and reply within Velt comment threads, right where your team already works: documents, text editors, and canvases. Tag the bot, and it will answer in the same thread, grounding its reply with…

24
r/LocalLLaMA community 28d ago

Mellum 2 12B A2.5B

Coding focused small MoE from JetBrains. They claim coding performance around Qwen 3.5 9B for the reasoning model. Worse than Qwen 3.5 4B in in everything else. Models: https://huggingface.co/collections/JetBrains/mellum-2 Technical report: https://arxiv.org/abs/2605.31268  …

34
arXiv — NLP / Computation & Language research 29d ago

Wind Turbine Maintenance Log Labelling Framework: LLM-Driven Data Correction and Enrichment via Semantic Extraction of Reliability Intelligence

arXiv:2605.31281v1 Announce Type: new Abstract: As wind turbine fleets age, data-driven reliability engineering is essential to optimise their operation and maintenance for service life extension and levelised cost of energy reduction. Failure event descriptions within…

27
arXiv — Machine Learning research 1mo ago

Conf-Gen: Conformal Uncertainty Quantification for Generative Models

arXiv:2605.28920v1 Announce Type: new Abstract: Conformal prediction (CP) and its extension, conformal risk control (CRC), are established frameworks for quantifying uncertainty in supervised machine learning through formal guarantees. However, recent breakthroughs in artificial…

17
r/MachineLearning community 1mo ago

Kept context-switching between arxiv, OpenReview, GitHub, and HuggingFace for every paper, so I built this. Chrome extension + website with everything inline, plus citation graph + SPECTER2 neighbors. 3M papers, free, feedback welcome [P]

Spent the last few months building a deeper context layer over arxiv. Each paper gets a Tomesphere page with a TLDR + key findings (LLM-curated), OpenReview reviews where the venue is public, linked GitHub repos, HuggingFace models, conference videos, the citation graph in both…

15
r/MachineLearning community 1mo ago

Built a richer reading layer for arxiv (Chrome extension + web): OpenReview reviews, GitHub/HuggingFace links, citation graph, SPECTER2 neighbors, TLDRs. 3M papers, free, looking for feedback [P]

Spent the last few months building a deeper context layer over arxiv. Each paper gets a Tomesphere page with a TLDR + key findings (LLM-curated), OpenReview reviews where the venue is public, linked GitHub repos, HuggingFace models, conference videos, the citation graph in both…

10
arXiv — NLP / Computation & Language research 1mo ago

ConvMemory: A Lightweight Learned Memory Reranker, a Negative Attribution Result, and a Research-Preview Conflict Editor

arXiv:2605.28062v1 Announce Type: new Abstract: We describe ConvMemory, a small 3.6M-parameter learned reranker for conversational long-term memory retrieval, trained with cross-encoder teacher supervision over fused dense and lexical features. On the LongMemEval memory family,…

30
arXiv — NLP / Computation & Language research 1mo ago

Supervised Semantic Differential for Cross-Cultural Concept Analysis: A Case Study of Human Affect

arXiv:2605.28225v1 Announce Type: new Abstract: Cross-cultural comparison of psychological meaning requires methods that go beyond word-level translation and examine how semantic dimensions are organized across languages. We introduce a cross-lingual extension of the Supervised…

26
arXiv — NLP / Computation & Language research 1mo ago

The Need for an External Observer Formalizing the Sufficiency Gap: A Mathematical Extension of Mixture Identifiability and Contextual Grounding in Sequence Models

arXiv:2605.26711v1 Announce Type: new Abstract: We construct a binary mixed-regime process with one deterministic textual regime and one random regime governed by an unobserved latent state. Even an ideal infinite-capacity sequence predictor that exactly recovers the text-only…

34
arXiv — Machine Learning research 1mo ago

Riemannian Archetypal Analysis: Interpretable non-linear data analysis on deformed star distributions

arXiv:2605.24113v1 Announce Type: new Abstract: Classical archetypal analysis is appealing for its interpretability, but its linear geometry can limit performance on data with strongly non-linear structure; at the same time, existing neural extensions improve flexibility while…

35
arXiv — NLP / Computation & Language research 1mo ago

Raon-Speech Technical Report

arXiv:2605.23912v1 Announce Type: new Abstract: We present Raon-Speech, a top-performing 9B-parameter speech language model (SpeechLM) for English and Korean speech understanding, answering, and generation, and Raon-SpeechChat, a high-performing full-duplex extension for natural…

11

Mellum2 local deployments

Show HN: OpenKnowledge – open source AI-first alternative to Obsidian/Notion

LuaJIT 3.0 proposed syntax extensions

SDXL running locally in the browser on WebGPU, open-source

Show HN: TikZ Editor – WYSIWYG editor for figures in LaTeX

Qwen code companion on vscode marketplace - thoughts

Latent Confounded Causal Discovery via Lie Bracket Geometry

Seeing Before Reasoning: Decoupling Perception and Reasoning for Shortcut-Resilient Multimodal On-Policy Self-Distillation

INDEQS: Informed Neural controlled Differential EQuationS

RFC 10008: The new HTTP Query Method

A fairness-aware extension of Stochastic Multicriteria Acceptability Analysis for ranking

Self-Generated Error Training for Token Editing in Diffusion Language Models

GLM 5.2 API is live, weights are on HF, and ollama has it already

LLM-Assisted Stance Detection in Scientific Discourse: A Test Case in Bayesian Cognitive Science

I think we need a /LocalHarnessLLM or something ...

DTVEM-RE: A Hierarchical Random-Effects Extension of the Differential Time-Varying Effect Model for Person-Specific Multi-Lag Estimation in Intensive Longitudinal Data

Beyond task performance: Decoding bioacoustic embeddings with speech features

CORA: Analyzing and bridging thinking-answer gap in Multimodal RLVR via Consistency-Oriented Reasoning Alignment

Pi Setup that pretty much replaced Claude Code for me

Edit the Bits, Diff the Codes: Bitwise Residual Editing for Visual Autoregressive Models

KKR, Nvidia, Others Launch $10 Billion Data Center Company

Flow Matching with In-Context Priors for Out-of-Distribution Brain Dynamics

nD-RoPE: A Generalized RoPE for n-Dimensional Position Embedding

How common are TMLR desk rejections with "not a suitable venue"? [D]

Forward-Only Convolutional Neural Networks with Learnable Channel-Class Assignment

Gaming AI-Assisted Peer Reviews Poses New Risks to the Scientific Community

b9580

Learning Transfers: Kan Extensions for Neural Invariants

Jetbrains Mellum 2: a really good and performant model

Best Coding Harness for Qwen3.6 35B?

Gemma 4 12B first coding agent test on a 4080 Super

Synthetic Hallucinations, Real Gains: Hard Negatives from Frontier Models for FIM Hallucination Mitigation

Microsoft's new MAI models

1-Click GitHub Token Stealing via a VSCode Bug

JetBrains open-sources Mellum2 - anyone tried these?

Pasted File Editor

Introducing Mellum2: A 12B Mixture-of-Experts Model by JetBrains

DuckDuckGo makes its &#8216;no-AI&#8217; search engine easier to access as its traffic booms

Mellum2 Goes Open Source: A Fast Model for AI Workflows | The JetBrains AI Blog

Chat SDK adds Velt support

Mellum 2 12B A2.5B

Wind Turbine Maintenance Log Labelling Framework: LLM-Driven Data Correction and Enrichment via Semantic Extraction of Reliability Intelligence

Conf-Gen: Conformal Uncertainty Quantification for Generative Models

Kept context-switching between arxiv, OpenReview, GitHub, and HuggingFace for every paper, so I built this. Chrome extension + website with everything inline, plus citation graph + SPECTER2 neighbors. 3M papers, free, feedback welcome [P]

Built a richer reading layer for arxiv (Chrome extension + web): OpenReview reviews, GitHub/HuggingFace links, citation graph, SPECTER2 neighbors, TLDRs. 3M papers, free, looking for feedback [P]

ConvMemory: A Lightweight Learned Memory Reranker, a Negative Attribution Result, and a Research-Preview Conflict Editor

Supervised Semantic Differential for Cross-Cultural Concept Analysis: A Case Study of Human Affect

The Need for an External Observer Formalizing the Sufficiency Gap: A Mathematical Extension of Mixture Identifiability and Contextual Grounding in Sequence Models

Riemannian Archetypal Analysis: Interpretable non-linear data analysis on deformed star distributions

Raon-Speech Technical Report

DuckDuckGo makes its ‘no-AI’ search engine easier to access as its traffic booms