News / #developer-tool Tag Developer Tool 500 articles archived under #developer-tool · RSS Sign in to follow r/LocalLLaMA community 29d ago Llama Studio v0.2.0 I have made an update to my llama-server WebUI based on some awesome feedback and interaction with the community. 1) JSON model config replaced by per-model shell scripts. Run from CLI, paste from unsloth, email to your buddy or post to reddit: Using real shell scripts to store… 17 Hacker News — AI on Front Page community 29d ago Creatine raise brain energy levels and slow Alzheimer's cognitive decline by 30% Article URL: https://thesciverse.org/scientists-found-that-the-creatine-supplement-millions-take-for-muscle-gains-is-quietly-raising-brain-energy-levels-and-slowing-early-alzheimers-cognitive-decline-by-30/ Comments URL: https://news.ycombinator.com/item?id=48346947 Points: 230… 15 Vercel — AI dev-tools 29d ago Chat SDK adds Lark and Feishu support Chat SDK now supports Lark and Feishu via a new vendor-official adapter . Build bots that post, edit, and delete messages, stream replies via Lark's native cardkit typewriter API, send interactive cards, and react with emojis across both Lark and Feishu conversations. The… 20 r/LocalLLaMA community 1mo ago Step-3.7-Flash-NVFP4 thinking for many minutes Anyone else seeing Step-3.7-Flash-NVFP4 thinking for many minutes? I'm using it with Cline and can see it thinking for in some cases 14 minutes with vLLM reporting generation of 90 tokens/s every 10s.   submitted by   /u/NaiRogers [link]   [comments] 19 llama.cpp releases dev-tools 1mo ago b9414 mtmd: Add DeepSeekOCR 2 Support ( #20975 ) mtmd: DeepSeek-OCR 2 support, with multi-tile dynamic resolution introduced clip_image_f32::add_viewsep address PR review drop redundant ggml_cpy ops in both deepseekocr versions build drop no-op ggml_cont in build_sam assert… 30 TechCrunch — AI news-outlet 1mo ago What happens when companies become too AI-pilled? The people deciding that AI can replace your job are also the ones least likely to understand what your job truly involves, according to Box founder Aaron Levie, who pointed to this as an example of “AI psychosis.” Indeed, ClickUp recently cut 22% of its workforce for AI… 25 Marcus on AI community 1mo ago What happens next, after the decline of tokenmaxxing? Two very different sets of predictions 25 TechCrunch — AI news-outlet 1mo ago Does your CEO have AI psychosis? Aaron Levie thinks most of them do. The people deciding that AI can replace your job are also the ones least likely to understand what your job truly involves, according to Box founder Aaron Levie, who pointed to this as an example of “AI psychosis.” Indeed, ClickUp recently cut 22% of its workforce for AI… 25 MIT Technology Review — AI news-outlet 1mo ago How the Pope’s Magnifica Humanitas offers a template for individuals to meet the AI moment Pope Leo XIV’s new encyclical on artificial intelligence includes a statement that warrants serious attention from technologists and policymakers: “Technology is never neutral.” Magnifica Humanitas (“Magnificent Humanity”) is a clarion call to all people to act with courage and… 8 Hacker News — AI on Front Page community 1mo ago Volkswagen blocks Home Assistant by requiring client assertion Article URL: https://github.com/robinostlund/homeassistant-volkswagencarnet/issues/967 Comments URL: https://news.ycombinator.com/item?id=48319509 Points: 221 # Comments: 112 32 arXiv — Machine Learning research 1mo ago Parallel Adaptive Multi-Objective Evolutionary Learning of Discretized Bayesian Network Classifiers for Clinical Data arXiv:2605.29058v1 Announce Type: new Abstract: Bayesian Networks (BNs) are of interest from an explainable AI viewpoint, offering transparent probabilistic models for decision support. Baymex is a recently introduced multi-objective evolutionary algorithm for learning… 24 arXiv — Machine Learning research 1mo ago Probabilistic bias adjustment of seasonal forecasts using generative machine learning: A case study of Arctic sea ice predictions arXiv:2605.29172v1 Announce Type: new Abstract: Seasonal climate predictions support planning and risk management by offering early information of the most likely-to-occur climate conditions in the coming months, and associated uncertainties. Ensemble forecasts enable this by… 20 arXiv — Machine Learning research 1mo ago SigmaMedStat: Temporal Signal Modeling for ICU False Alarm Reduction arXiv:2605.29236v1 Announce Type: new Abstract: Alarm fatigue in intensive care units (ICUs) is a well documented patient safety crisis. Clinical monitors generate 350 or more alarms per patient per day, out of which 72-99% are clinically irrelevant. Staff desensitization to… 29 arXiv — Machine Learning research 1mo ago Causal Label Recovery in Payment Networks arXiv:2605.29272v1 Announce Type: new Abstract: Fraud detection models in payment networks train on chargeback labels that are systematically biased. Every label must survive three sequential gates: authorization (declined transactions generate no labels), issuer reporting… 36 arXiv — NLP / Computation & Language research 1mo ago Specialty-Specific Medical Language Model for Immune-Mediated Diseases arXiv:2605.28838v1 Announce Type: new Abstract: Extracting detailed clinical information from free-text medical narratives remains a practical challenge for researchers and healthcare systems. Terminology for immune-mediated and infectious diseases is especially inconsistent… 29 arXiv — NLP / Computation & Language research 1mo ago Hallucination Detection-Guided Preference Optimization for Clinical Summarization arXiv:2605.28910v1 Announce Type: new Abstract: Large language models (LLMs) have shown promise on summarization tasks, but they often produce hallucinations, which are unsupported or incorrect statements that limit their reliability in specialized healthcare applications. We… 21 llama.cpp releases dev-tools 1mo ago b9393 mtmd: fix gemma 4 audio rms norm eps ( #23815 ) mtmd: fix gemma 4 audio rms norm eps Update tools/mtmd/clip.cpp Co-authored-by: Sigbjørn Skjæret [email protected] Co-authored-by: Sigbjørn Skjæret [email protected] macOS/iOS: macOS Apple Silicon (arm64) macOS… 34 The Information — AI news-outlet 1mo ago Blue Origin New Glenn Rocket Explodes During Test Jeff Bezos’ space company Blue Origin suffered a serious setback Thursday evening when its New Glenn rocket exploded on a launch pad in Florida during a test. Video clips of the incident show a giant fireball engulfing the rocket and surrounding structures. No one was on board… 18 r/LocalLLaMA community 1mo ago Claude cli >= 2.1.154 breaks local use with vLLM by introducing "ctx", "msg" and "system" roles for API messages. This 1-line patch to vLLM fixes it. diff --git a/vllm/entrypoints/anthropic/protocol.py b/vllm/entrypoints/anthropic/protocol.py index 3ebc17117..2d5726d73 100644 --- a/vllm/entrypoints/anthropic/protocol.py +++ b/vllm/entrypoints/anthropic/protocol.py @@ -65,7 +65,7 @@ class AnthropicContentBlock(BaseModel):… 29 r/MachineLearning community 1mo ago Your Agents Are Aging Too: Agent Lifespan Engineering for Deployed Systems [R] Are agents aging after deployment? : https://arxiv.org/abs/2605.26302 On a new longitudinal deployment benchmark, switching the Claude Code CLI agent from Sonnet 4.6 to Opus 4.7 dropped PyTest pass rate by ~15%. This (to me) is a counterintuitive-enough result to pay attention… 6 Don't Worry About the Vase community 1mo ago AI #170: Lack of Executive Order Last week ended on a cliffhanger of sorts. 28 arXiv — Machine Learning research 1mo ago Comparative Analysis of Liquid Neural Networks and LSTM for Sequential Pattern Recognition: Robustness, Efficiency, and Clinical Utility arXiv:2605.27467v1 Announce Type: new Abstract: Traditional Recurrent Neural Networks (RNNs) and Long Short-Term Memory (LSTM) units operate on discrete time steps, often failing to capture the fluid temporal dynamics of real-world physical processes. Liquid Neural Networks… 19 arXiv — Machine Learning research 1mo ago Information-theoretic Multimodal Representation Learning for Electrocardiogram Signals arXiv:2605.27583v1 Announce Type: new Abstract: Electrocardiograms (ECGs) are widely used non-invasive measurements of cardiac activity and play a central role in clinical diagnosis. Recent multimodal approaches align ECG signals with clinical reports to incorporate diagnostic… 4 arXiv — Machine Learning research 1mo ago Can Entry-Wise Clipping Give Spectral Control of Stochastic Gradients? arXiv:2605.27733v1 Announce Type: new Abstract: Training instabilities such as loss spikes are frequently the result of stochastic gradient noise. Because of rare expressions in language training data, and multiple layer composition, the noise impact is heavy-tailed and survives… 33 arXiv — Machine Learning research 1mo ago Cyclical Entropy Eruption: Entropy Dynamics in Agent Reinforcement Learning arXiv:2605.27954v1 Announce Type: new Abstract: Agentic large language models are increasingly used to solve real-world tasks by reasoning over goals, invoking tools, and interacting with external environments. Reinforcement learning provides a natural framework for improving… 38 arXiv — Machine Learning research 1mo ago Dimensionality Reduction for Robust Federated Learning: A Theoretical Analysis and Convergence Guarantee arXiv:2605.28335v1 Announce Type: new Abstract: Federated Learning (FL) enables multiple clients to collaboratively train models without sharing raw data, but it is highly vulnerable to Byzantine attacks. Existing robust approaches can neutralize these threats but incur… 13 arXiv — NLP / Computation & Language research 1mo ago BioELX: Cross-lingual Biomedical Entity Linking via Alias-based Retrieval and LLM Ranking arXiv:2605.27380v1 Announce Type: new Abstract: Cross-lingual biomedical entity linking (BEL) maps mentions in any language to unique identifiers in a biomedical knowledge base (KB), supporting clinical and biomedical NLP applications. However, expert-annotated training data for… 32 arXiv — NLP / Computation & Language research 1mo ago StoryMI: Steerable Multi-Agent Therapeutic Dialogue Generation arXiv:2605.27393v1 Announce Type: new Abstract: Large language models (LLMs) can generate fluent dialogue, but prior works lack situational grounding, dynamic strategy control, and evaluation aligned with clinical standards in motivational interviewing (MI). We introduce… 7 arXiv — NLP / Computation & Language research 1mo ago Beyond Input Understanding: Diagnosing Multilingual Mathematical Reasoning with Directed Acyclic Trace Graphs arXiv:2605.27715v1 Announce Type: new Abstract: Large reasoning models (LRMs) achieve strong mathematical reasoning performance in English, but remain much less reliable in many low- and medium-resource languages. This gap is often explained as a failure to understand… 28 arXiv — NLP / Computation & Language research 1mo ago Challenges in Explaining Pretrained Clinical Text Classifiers arXiv:2605.28060v1 Announce Type: new Abstract: Explaining the predictions of neural models in clinical NLP remains a significant challenge, especially for complex tasks involving long, unstructured medical texts. While post-hoc methods like LIME and SHAP are widely used, they… 19 r/MachineLearning community 1mo ago noisekit - CLI for generating realistic degraded speech datasets for ASR benchmarking [P] If you've ever tried to pick an STT vendor for a phone-based voice agent or call center product, you've probably hit this wall: you have plenty of real production audio, but it's unlabeled, so you can't compute WER on it. And the annotated public datasets (FLEURS, CommonVoice,… 31 TechCrunch — AI news-outlet 1mo ago ClickHouse triples anualized revenue to $250M, charting a path toward an IPO The database provider is eyeing a public debut within the next few years. 8 TechCrunch — AI news-outlet 1mo ago ClickHouse triples annualized revenue to $250M, charting a path toward an IPO The database provider is eyeing a public debut within the next few years. 32 r/LocalLLaMA community 1mo ago AI is not for everyone This may be a controversial take, but AI is not for everyone. I've made a post here before about the vibecoded garbage I see on this subreddit every time I click on it but there seems to be a larger issue. AI isn't just a set and forget karma farm. You actually have to put work… 14 The Information — AI news-outlet 1mo ago Micron Passes $1 Trillion as AI Memory Demand Sends Shares Soaring Micron Technology crossed $1 trillion in market value for the first time Tuesday, as shares climbed 19% on rising demand for memory chips used in AI systems. It was Micron’s largest single-day gain since 2011. The rally came after UBS sharply raised its price target for Micron… 33 arXiv — Machine Learning research 1mo ago GEM: Geometric Entropy Mixing for Optimal LLM Data Curation arXiv:2605.26121v1 Announce Type: new Abstract: LLM pre-training efficacy increasingly depends on data composition rather than sheer volume. Yet, optimal mixing is hindered by categorization flaws: human taxonomies suffer from ontological misalignment, and Euclidean clustering… 27 arXiv — Machine Learning research 1mo ago On the Role of Inductive Bias in Time-Series Pretraining: A Case Study in Learning Generalizable Representations for Clinical Time Series arXiv:2605.26194v1 Announce Type: new Abstract: Clinical time-series learning is routinely constrained by small, heterogeneous cohorts and protocol drift, while its downstream use spans both classification (e.g., pathology diagnosis) and regression (e.g., temporal forecasting).… 30 arXiv — Machine Learning research 1mo ago MuCon: Clipped Muon Updates for LLM Training arXiv:2605.26459v1 Announce Type: new Abstract: Muon-style optimizers take a matrix-valued momentum or preconditioned update $B = U \operatorname{diag}(\sigma_1,\ldots,\sigma_r) V^\top$ and replace it with its canonical partial polar factor $\operatorname{Pol}(B) = U V^\top$.… 31 arXiv — Machine Learning research 1mo ago Dense2MoE: Pushing the Pareto Frontier of On-Device LLMs via Unified Pruning and Upcycling arXiv:2605.26496v1 Announce Type: new Abstract: The Mixture of Experts MoE architecture is highly promising for resource constrained on device deployments yet training these models from scratch incurs prohibitive costs Current methods attempt to alleviate this by upcycling dense… 32 arXiv — Machine Learning research 1mo ago Separate Aggregation of Split Network for Personalized Federated Learning arXiv:2605.26571v1 Announce Type: new Abstract: Federated learning enables collaborative model training without sharing raw data, but its performance can degrade substantially under heterogeneous client data distributions. A single global model often cannot satisfy diverse… 33 arXiv — Machine Learning research 1mo ago Image Feature Fusion-based Federated Client Unlearning (FCU) arXiv:2605.26715v1 Announce Type: new Abstract: Major data protection regulations all mention the "right to be forgotten," and that's what pushed federated unlearning (FU) techniques forward. But one stubborn issue remains: catastrophic forgetting--you erase the target… 9 arXiv — Machine Learning research 1mo ago Adversarial Training for Robust Coverage Network under Worst-case Facility Losses arXiv:2605.26763v1 Announce Type: new Abstract: The Maximal Covering Location-Interdiction Problem (MCLIP) is a classic bi-level optimization problem, which is fundamental to resilient infrastructure planning yet remains computationally intractable. Specifically, the upper level… 5 arXiv — Machine Learning research 1mo ago Ratio-Variance Regularized Policy Optimization arXiv:2605.26784v1 Announce Type: new Abstract: Standard on-policy reinforcement learning relies on heuristic clipping to enforce trust regions, but this mechanism imposes a severe cost by indiscriminately truncating high-return yet high-divergence updates. We demonstrate that… 29 arXiv — NLP / Computation & Language research 1mo ago The Daily Dose: Workflow-Integrated Large Language Model Automation for Clinical Summarization and Trial Identification in Radiation Oncology arXiv:2605.26346v1 Announce Type: new Abstract: Objective: To describe the design and early clinical evaluation of The Daily Dose (TDD), an LLM-driven, automated clinical summarization and clinical-trial identification system integrated into routine radiation oncology practice.… 7 arXiv — NLP / Computation & Language research 1mo ago Curation and Extraction of Drug-Related Entities from Reddit Platform arXiv:2605.26445v1 Announce Type: new Abstract: Physicians learn primarily about illicit drugs from clinical overdose cases, limiting their understanding of real-world usage. Meanwhile, drug users share first-hand experiences online, offering insights into dosage and effects of… 31 arXiv — NLP / Computation & Language research 1mo ago Towards Error-Free EHRs: Reasoning-Intensive Consistency Verification Between Clinical Notes and Structured Tables in Electronic Health Records arXiv:2605.26463v1 Announce Type: new Abstract: Data consistency between unstructured clinical notes and structured tables in Electronic Health Records (EHRs) is essential for patient safety and clinical decision-making. However, existing work on note-table consistency… 7 arXiv — NLP / Computation & Language research 1mo ago Reliable Extraction of Clinical Follow-Up Instructions: A Hybrid Neural-Symbolic Pipeline arXiv:2605.26560v1 Announce Type: new Abstract: Objective. Outpatient notes carry follow-up instructions pairing actions with future times ("MRI brain in two weeks"). Extracting (action, date) pairs supports scheduling and audit, but generative extractors miss the date because… 19 Vercel — AI dev-tools 1mo ago Experimental native binaries for Vercel CLI The Vercel CLI now ships an optional experimental native binary that starts faster, is even more secure, and requires no Node.js runtime dependency. Binaries are code-signed, allowing your OS to verify that they came from Vercel and haven't been modified. Additionally, on macOS,… 30 r/LocalLLaMA community 1mo ago Turning local agents into self-optimizing agents I was experimenting with a self-optimizing agentic pipeline to climb the benchmark leaderboard (TerminalBench). On a 10-task subset, I got the performance to rise from ~30% → ~90%. That loop worked, so I asked: can the same reflect-and-rewrite step run continuously against… 17 Hugging Face Daily Papers research 1mo ago ECHO: Terminal Agents Learn World Models for Free Abstract Environment cross-entropy hybrid objective combines policy-gradient loss with auxiliary environment observation prediction to provide dense supervision from terminal feedback, improving agent performance and self-improvement capabilities. AI-generated summary CLI agents… 23 Page 7 of 10 · 500 articles ← Newer Older →