Tag

Developer Tool

500 articles archived under #developer-tool · RSS

r/LocalLLaMA community 29d ago

Llama Studio v0.2.0

I have made an update to my llama-server WebUI based on some awesome feedback and interaction with the community. 1) JSON model config replaced by per-model shell scripts. Run from CLI, paste from unsloth, email to your buddy or post to reddit: Using real shell scripts to store…

17
Hacker News — AI on Front Page community 29d ago

Creatine raise brain energy levels and slow Alzheimer's cognitive decline by 30%

Article URL: https://thesciverse.org/scientists-found-that-the-creatine-supplement-millions-take-for-muscle-gains-is-quietly-raising-brain-energy-levels-and-slowing-early-alzheimers-cognitive-decline-by-30/ Comments URL: https://news.ycombinator.com/item?id=48346947 Points: 230…

15
Vercel — AI dev-tools 29d ago

Chat SDK adds Lark and Feishu support

Chat SDK now supports Lark and Feishu via a new vendor-official adapter . Build bots that post, edit, and delete messages, stream replies via Lark's native cardkit typewriter API, send interactive cards, and react with emojis across both Lark and Feishu conversations. The…

20
r/LocalLLaMA community 1mo ago

Step-3.7-Flash-NVFP4 thinking for many minutes

Anyone else seeing Step-3.7-Flash-NVFP4 thinking for many minutes? I'm using it with Cline and can see it thinking for in some cases 14 minutes with vLLM reporting generation of 90 tokens/s every 10s.   submitted by   /u/NaiRogers [link]   [comments]

19
llama.cpp releases dev-tools 1mo ago

b9414

mtmd: Add DeepSeekOCR 2 Support ( #20975 ) mtmd: DeepSeek-OCR 2 support, with multi-tile dynamic resolution introduced clip_image_f32::add_viewsep address PR review drop redundant ggml_cpy ops in both deepseekocr versions build drop no-op ggml_cont in build_sam assert…

30
TechCrunch — AI news-outlet 1mo ago

What happens when companies become too AI-pilled?

The people deciding that AI can replace your job are also the ones least likely to understand what your job truly involves, according to Box founder Aaron Levie, who pointed to this as an example of “AI psychosis.” Indeed, ClickUp recently cut 22% of its workforce for AI…

25
Marcus on AI community 1mo ago

What happens next, after the decline of tokenmaxxing?

Two very different sets of predictions

25
TechCrunch — AI news-outlet 1mo ago

Does your CEO have AI psychosis? Aaron Levie thinks most of them do.

The people deciding that AI can replace your job are also the ones least likely to understand what your job truly involves, according to Box founder Aaron Levie, who pointed to this as an example of “AI psychosis.” Indeed, ClickUp recently cut 22% of its workforce for AI…

25
MIT Technology Review — AI news-outlet 1mo ago

How the Pope’s Magnifica Humanitas offers a template for individuals to meet the AI moment

Pope Leo XIV’s new encyclical on artificial intelligence includes a statement that warrants serious attention from technologists and policymakers: “Technology is never neutral.” Magnifica Humanitas (“Magnificent Humanity”) is a clarion call to all people to act with courage and…

8
Hacker News — AI on Front Page community 1mo ago

Volkswagen blocks Home Assistant by requiring client assertion

Article URL: https://github.com/robinostlund/homeassistant-volkswagencarnet/issues/967 Comments URL: https://news.ycombinator.com/item?id=48319509 Points: 221 # Comments: 112

32
arXiv — Machine Learning research 1mo ago

Parallel Adaptive Multi-Objective Evolutionary Learning of Discretized Bayesian Network Classifiers for Clinical Data

arXiv:2605.29058v1 Announce Type: new Abstract: Bayesian Networks (BNs) are of interest from an explainable AI viewpoint, offering transparent probabilistic models for decision support. Baymex is a recently introduced multi-objective evolutionary algorithm for learning…

24
arXiv — Machine Learning research 1mo ago

Probabilistic bias adjustment of seasonal forecasts using generative machine learning: A case study of Arctic sea ice predictions

arXiv:2605.29172v1 Announce Type: new Abstract: Seasonal climate predictions support planning and risk management by offering early information of the most likely-to-occur climate conditions in the coming months, and associated uncertainties. Ensemble forecasts enable this by…

20
arXiv — Machine Learning research 1mo ago

SigmaMedStat: Temporal Signal Modeling for ICU False Alarm Reduction

arXiv:2605.29236v1 Announce Type: new Abstract: Alarm fatigue in intensive care units (ICUs) is a well documented patient safety crisis. Clinical monitors generate 350 or more alarms per patient per day, out of which 72-99% are clinically irrelevant. Staff desensitization to…

29
arXiv — Machine Learning research 1mo ago

Causal Label Recovery in Payment Networks

arXiv:2605.29272v1 Announce Type: new Abstract: Fraud detection models in payment networks train on chargeback labels that are systematically biased. Every label must survive three sequential gates: authorization (declined transactions generate no labels), issuer reporting…

36
arXiv — NLP / Computation & Language research 1mo ago

Specialty-Specific Medical Language Model for Immune-Mediated Diseases

arXiv:2605.28838v1 Announce Type: new Abstract: Extracting detailed clinical information from free-text medical narratives remains a practical challenge for researchers and healthcare systems. Terminology for immune-mediated and infectious diseases is especially inconsistent…

29
arXiv — NLP / Computation & Language research 1mo ago

Hallucination Detection-Guided Preference Optimization for Clinical Summarization

arXiv:2605.28910v1 Announce Type: new Abstract: Large language models (LLMs) have shown promise on summarization tasks, but they often produce hallucinations, which are unsupported or incorrect statements that limit their reliability in specialized healthcare applications. We…

21
llama.cpp releases dev-tools 1mo ago

b9393

mtmd: fix gemma 4 audio rms norm eps ( #23815 ) mtmd: fix gemma 4 audio rms norm eps Update tools/mtmd/clip.cpp Co-authored-by: Sigbjørn Skjæret [email protected] Co-authored-by: Sigbjørn Skjæret [email protected] macOS/iOS: macOS Apple Silicon (arm64) macOS…

34
The Information — AI news-outlet 1mo ago

Blue Origin New Glenn Rocket Explodes During Test

Jeff Bezos’ space company Blue Origin suffered a serious setback Thursday evening when its New Glenn rocket exploded on a launch pad in Florida during a test. Video clips of the incident show a giant fireball engulfing the rocket and surrounding structures. No one was on board…

18
r/LocalLLaMA community 1mo ago

Claude cli >= 2.1.154 breaks local use with vLLM by introducing "ctx", "msg" and "system" roles for API messages. This 1-line patch to vLLM fixes it.

diff --git a/vllm/entrypoints/anthropic/protocol.py b/vllm/entrypoints/anthropic/protocol.py index 3ebc17117..2d5726d73 100644 --- a/vllm/entrypoints/anthropic/protocol.py +++ b/vllm/entrypoints/anthropic/protocol.py @@ -65,7 +65,7 @@ class AnthropicContentBlock(BaseModel):…

29
r/MachineLearning community 1mo ago

Your Agents Are Aging Too: Agent Lifespan Engineering for Deployed Systems [R]

Are agents aging after deployment? : https://arxiv.org/abs/2605.26302 On a new longitudinal deployment benchmark, switching the Claude Code CLI agent from Sonnet 4.6 to Opus 4.7 dropped PyTest pass rate by ~15%. This (to me) is a counterintuitive-enough result to pay attention…

6
Don't Worry About the Vase community 1mo ago

AI #170: Lack of Executive Order

Last week ended on a cliffhanger of sorts.

28
arXiv — Machine Learning research 1mo ago

Comparative Analysis of Liquid Neural Networks and LSTM for Sequential Pattern Recognition: Robustness, Efficiency, and Clinical Utility

arXiv:2605.27467v1 Announce Type: new Abstract: Traditional Recurrent Neural Networks (RNNs) and Long Short-Term Memory (LSTM) units operate on discrete time steps, often failing to capture the fluid temporal dynamics of real-world physical processes. Liquid Neural Networks…

19
arXiv — Machine Learning research 1mo ago

Information-theoretic Multimodal Representation Learning for Electrocardiogram Signals

arXiv:2605.27583v1 Announce Type: new Abstract: Electrocardiograms (ECGs) are widely used non-invasive measurements of cardiac activity and play a central role in clinical diagnosis. Recent multimodal approaches align ECG signals with clinical reports to incorporate diagnostic…

4
arXiv — Machine Learning research 1mo ago

Can Entry-Wise Clipping Give Spectral Control of Stochastic Gradients?

arXiv:2605.27733v1 Announce Type: new Abstract: Training instabilities such as loss spikes are frequently the result of stochastic gradient noise. Because of rare expressions in language training data, and multiple layer composition, the noise impact is heavy-tailed and survives…

33
arXiv — Machine Learning research 1mo ago

Cyclical Entropy Eruption: Entropy Dynamics in Agent Reinforcement Learning

arXiv:2605.27954v1 Announce Type: new Abstract: Agentic large language models are increasingly used to solve real-world tasks by reasoning over goals, invoking tools, and interacting with external environments. Reinforcement learning provides a natural framework for improving…

38
arXiv — Machine Learning research 1mo ago

Dimensionality Reduction for Robust Federated Learning: A Theoretical Analysis and Convergence Guarantee

arXiv:2605.28335v1 Announce Type: new Abstract: Federated Learning (FL) enables multiple clients to collaboratively train models without sharing raw data, but it is highly vulnerable to Byzantine attacks. Existing robust approaches can neutralize these threats but incur…

13
arXiv — NLP / Computation & Language research 1mo ago

BioELX: Cross-lingual Biomedical Entity Linking via Alias-based Retrieval and LLM Ranking

arXiv:2605.27380v1 Announce Type: new Abstract: Cross-lingual biomedical entity linking (BEL) maps mentions in any language to unique identifiers in a biomedical knowledge base (KB), supporting clinical and biomedical NLP applications. However, expert-annotated training data for…

32
arXiv — NLP / Computation & Language research 1mo ago

StoryMI: Steerable Multi-Agent Therapeutic Dialogue Generation

arXiv:2605.27393v1 Announce Type: new Abstract: Large language models (LLMs) can generate fluent dialogue, but prior works lack situational grounding, dynamic strategy control, and evaluation aligned with clinical standards in motivational interviewing (MI). We introduce…

7
arXiv — NLP / Computation & Language research 1mo ago

Beyond Input Understanding: Diagnosing Multilingual Mathematical Reasoning with Directed Acyclic Trace Graphs

arXiv:2605.27715v1 Announce Type: new Abstract: Large reasoning models (LRMs) achieve strong mathematical reasoning performance in English, but remain much less reliable in many low- and medium-resource languages. This gap is often explained as a failure to understand…

28
arXiv — NLP / Computation & Language research 1mo ago

Challenges in Explaining Pretrained Clinical Text Classifiers

arXiv:2605.28060v1 Announce Type: new Abstract: Explaining the predictions of neural models in clinical NLP remains a significant challenge, especially for complex tasks involving long, unstructured medical texts. While post-hoc methods like LIME and SHAP are widely used, they…

19
r/MachineLearning community 1mo ago

noisekit - CLI for generating realistic degraded speech datasets for ASR benchmarking [P]

If you've ever tried to pick an STT vendor for a phone-based voice agent or call center product, you've probably hit this wall: you have plenty of real production audio, but it's unlabeled, so you can't compute WER on it. And the annotated public datasets (FLEURS, CommonVoice,…

31
TechCrunch — AI news-outlet 1mo ago

ClickHouse triples anualized revenue to $250M, charting a path toward an IPO

The database provider is eyeing a public debut within the next few years.

8
TechCrunch — AI news-outlet 1mo ago

ClickHouse triples annualized revenue to $250M, charting a path toward an IPO

The database provider is eyeing a public debut within the next few years.

32
r/LocalLLaMA community 1mo ago

AI is not for everyone

This may be a controversial take, but AI is not for everyone. I've made a post here before about the vibecoded garbage I see on this subreddit every time I click on it but there seems to be a larger issue. AI isn't just a set and forget karma farm. You actually have to put work…

14
The Information — AI news-outlet 1mo ago

Micron Passes $1 Trillion as AI Memory Demand Sends Shares Soaring

Micron Technology crossed $1 trillion in market value for the first time Tuesday, as shares climbed 19% on rising demand for memory chips used in AI systems. It was Micron’s largest single-day gain since 2011. The rally came after UBS sharply raised its price target for Micron…

33
arXiv — Machine Learning research 1mo ago

GEM: Geometric Entropy Mixing for Optimal LLM Data Curation

arXiv:2605.26121v1 Announce Type: new Abstract: LLM pre-training efficacy increasingly depends on data composition rather than sheer volume. Yet, optimal mixing is hindered by categorization flaws: human taxonomies suffer from ontological misalignment, and Euclidean clustering…

27
arXiv — Machine Learning research 1mo ago

On the Role of Inductive Bias in Time-Series Pretraining: A Case Study in Learning Generalizable Representations for Clinical Time Series

arXiv:2605.26194v1 Announce Type: new Abstract: Clinical time-series learning is routinely constrained by small, heterogeneous cohorts and protocol drift, while its downstream use spans both classification (e.g., pathology diagnosis) and regression (e.g., temporal forecasting).…

30
arXiv — Machine Learning research 1mo ago

MuCon: Clipped Muon Updates for LLM Training

arXiv:2605.26459v1 Announce Type: new Abstract: Muon-style optimizers take a matrix-valued momentum or preconditioned update $B = U \operatorname{diag}(\sigma_1,\ldots,\sigma_r) V^\top$ and replace it with its canonical partial polar factor $\operatorname{Pol}(B) = U V^\top$.…

31
arXiv — Machine Learning research 1mo ago

Dense2MoE: Pushing the Pareto Frontier of On-Device LLMs via Unified Pruning and Upcycling

arXiv:2605.26496v1 Announce Type: new Abstract: The Mixture of Experts MoE architecture is highly promising for resource constrained on device deployments yet training these models from scratch incurs prohibitive costs Current methods attempt to alleviate this by upcycling dense…

32
arXiv — Machine Learning research 1mo ago

Separate Aggregation of Split Network for Personalized Federated Learning

arXiv:2605.26571v1 Announce Type: new Abstract: Federated learning enables collaborative model training without sharing raw data, but its performance can degrade substantially under heterogeneous client data distributions. A single global model often cannot satisfy diverse…

33
arXiv — Machine Learning research 1mo ago

Image Feature Fusion-based Federated Client Unlearning (FCU)

arXiv:2605.26715v1 Announce Type: new Abstract: Major data protection regulations all mention the "right to be forgotten," and that's what pushed federated unlearning (FU) techniques forward. But one stubborn issue remains: catastrophic forgetting--you erase the target…

9
arXiv — Machine Learning research 1mo ago

Adversarial Training for Robust Coverage Network under Worst-case Facility Losses

arXiv:2605.26763v1 Announce Type: new Abstract: The Maximal Covering Location-Interdiction Problem (MCLIP) is a classic bi-level optimization problem, which is fundamental to resilient infrastructure planning yet remains computationally intractable. Specifically, the upper level…

5
arXiv — Machine Learning research 1mo ago

Ratio-Variance Regularized Policy Optimization

arXiv:2605.26784v1 Announce Type: new Abstract: Standard on-policy reinforcement learning relies on heuristic clipping to enforce trust regions, but this mechanism imposes a severe cost by indiscriminately truncating high-return yet high-divergence updates. We demonstrate that…

29
arXiv — NLP / Computation & Language research 1mo ago

The Daily Dose: Workflow-Integrated Large Language Model Automation for Clinical Summarization and Trial Identification in Radiation Oncology

arXiv:2605.26346v1 Announce Type: new Abstract: Objective: To describe the design and early clinical evaluation of The Daily Dose (TDD), an LLM-driven, automated clinical summarization and clinical-trial identification system integrated into routine radiation oncology practice.…

7
arXiv — NLP / Computation & Language research 1mo ago

Curation and Extraction of Drug-Related Entities from Reddit Platform

arXiv:2605.26445v1 Announce Type: new Abstract: Physicians learn primarily about illicit drugs from clinical overdose cases, limiting their understanding of real-world usage. Meanwhile, drug users share first-hand experiences online, offering insights into dosage and effects of…

31
arXiv — NLP / Computation & Language research 1mo ago

Towards Error-Free EHRs: Reasoning-Intensive Consistency Verification Between Clinical Notes and Structured Tables in Electronic Health Records

arXiv:2605.26463v1 Announce Type: new Abstract: Data consistency between unstructured clinical notes and structured tables in Electronic Health Records (EHRs) is essential for patient safety and clinical decision-making. However, existing work on note-table consistency…

7
arXiv — NLP / Computation & Language research 1mo ago

Reliable Extraction of Clinical Follow-Up Instructions: A Hybrid Neural-Symbolic Pipeline

arXiv:2605.26560v1 Announce Type: new Abstract: Objective. Outpatient notes carry follow-up instructions pairing actions with future times ("MRI brain in two weeks"). Extracting (action, date) pairs supports scheduling and audit, but generative extractors miss the date because…

19
Vercel — AI dev-tools 1mo ago

Experimental native binaries for Vercel CLI

The Vercel CLI now ships an optional experimental native binary that starts faster, is even more secure, and requires no Node.js runtime dependency. Binaries are code-signed, allowing your OS to verify that they came from Vercel and haven't been modified. Additionally, on macOS,…

30
r/LocalLLaMA community 1mo ago

Turning local agents into self-optimizing agents

I was experimenting with a self-optimizing agentic pipeline to climb the benchmark leaderboard (TerminalBench). On a 10-task subset, I got the performance to rise from ~30% → ~90%. That loop worked, so I asked: can the same reflect-and-rewrite step run continuously against…

17
Hugging Face Daily Papers research 1mo ago

ECHO: Terminal Agents Learn World Models for Free

Abstract Environment cross-entropy hybrid objective combines policy-gradient loss with auxiliary environment observation prediction to provide dense supervision from terminal feedback, improving agent performance and self-improvement capabilities. AI-generated summary CLI agents…

23

Llama Studio v0.2.0

Creatine raise brain energy levels and slow Alzheimer's cognitive decline by 30%

Chat SDK adds Lark and Feishu support

Step-3.7-Flash-NVFP4 thinking for many minutes

b9414

What happens when companies become too AI-pilled?

What happens next, after the decline of tokenmaxxing?

Does your CEO have AI psychosis? Aaron Levie thinks most of them do.

How the Pope’s Magnifica Humanitas offers a template for individuals to meet the AI moment

Volkswagen blocks Home Assistant by requiring client assertion

Parallel Adaptive Multi-Objective Evolutionary Learning of Discretized Bayesian Network Classifiers for Clinical Data

Probabilistic bias adjustment of seasonal forecasts using generative machine learning: A case study of Arctic sea ice predictions

SigmaMedStat: Temporal Signal Modeling for ICU False Alarm Reduction

Causal Label Recovery in Payment Networks

Specialty-Specific Medical Language Model for Immune-Mediated Diseases

Hallucination Detection-Guided Preference Optimization for Clinical Summarization

b9393

Blue Origin New Glenn Rocket Explodes During Test

Claude cli >= 2.1.154 breaks local use with vLLM by introducing "ctx", "msg" and "system" roles for API messages. This 1-line patch to vLLM fixes it.

Your Agents Are Aging Too: Agent Lifespan Engineering for Deployed Systems [R]

AI #170: Lack of Executive Order

Comparative Analysis of Liquid Neural Networks and LSTM for Sequential Pattern Recognition: Robustness, Efficiency, and Clinical Utility

Information-theoretic Multimodal Representation Learning for Electrocardiogram Signals

Can Entry-Wise Clipping Give Spectral Control of Stochastic Gradients?

Cyclical Entropy Eruption: Entropy Dynamics in Agent Reinforcement Learning

Dimensionality Reduction for Robust Federated Learning: A Theoretical Analysis and Convergence Guarantee

BioELX: Cross-lingual Biomedical Entity Linking via Alias-based Retrieval and LLM Ranking

StoryMI: Steerable Multi-Agent Therapeutic Dialogue Generation

Beyond Input Understanding: Diagnosing Multilingual Mathematical Reasoning with Directed Acyclic Trace Graphs

Challenges in Explaining Pretrained Clinical Text Classifiers

noisekit - CLI for generating realistic degraded speech datasets for ASR benchmarking [P]

ClickHouse triples anualized revenue to $250M, charting a path toward an IPO

ClickHouse triples annualized revenue to $250M, charting a path toward an IPO

AI is not for everyone

Micron Passes $1 Trillion as AI Memory Demand Sends Shares Soaring

GEM: Geometric Entropy Mixing for Optimal LLM Data Curation

On the Role of Inductive Bias in Time-Series Pretraining: A Case Study in Learning Generalizable Representations for Clinical Time Series

MuCon: Clipped Muon Updates for LLM Training

Dense2MoE: Pushing the Pareto Frontier of On-Device LLMs via Unified Pruning and Upcycling

Separate Aggregation of Split Network for Personalized Federated Learning

Image Feature Fusion-based Federated Client Unlearning (FCU)

Adversarial Training for Robust Coverage Network under Worst-case Facility Losses

Ratio-Variance Regularized Policy Optimization

The Daily Dose: Workflow-Integrated Large Language Model Automation for Clinical Summarization and Trial Identification in Radiation Oncology

Curation and Extraction of Drug-Related Entities from Reddit Platform

Towards Error-Free EHRs: Reasoning-Intensive Consistency Verification Between Clinical Notes and Structured Tables in Electronic Health Records

Reliable Extraction of Clinical Follow-Up Instructions: A Hybrid Neural-Symbolic Pipeline

Experimental native binaries for Vercel CLI

Turning local agents into self-optimizing agents

ECHO: Terminal Agents Learn World Models for Free