Tag

Developer Tool

500 articles archived under #developer-tool · RSS

arXiv — Machine Learning research 20d ago

DUET -- Dual User Embedding Transformers for Offsite Conversion Prediction

arXiv:2606.10243v1 Announce Type: new Abstract: Offsite conversion rate (OCVR) prediction is an important ranking problem in computational recommendation systems. This task presents a modeling challenge: click signals are abundant and exhibit short temporal horizons, whereas…

25
arXiv — NLP / Computation & Language research 20d ago

Continual LLM Upcycling: A Predictor-Gated Bank-Wise Sparsity Training Recipe for Dense-to-Sparse LLMs

arXiv:2606.10722v1 Announce Type: new Abstract: We study dense-to-sparse continual training as a way to construct channel-sparse large language models from dense checkpoints. Starting from a Qwen2.5-8B dense backbone, we continue training at 32K context and introduce a…

24
arXiv — NLP / Computation & Language research 20d ago

Dep-LLM: Training-Free Depression Diagnosis via Evidence-Guided Structured Multi-factor with Reliable LLM Reasoning

arXiv:2606.10796v1 Announce Type: new Abstract: Automatic Depression Detection (ADD) from clinical interviews is a pivotal task in computational mental health, yet it remains challenging due to two critical obstacles: 1) difficulty in modeling complex but sparsely distributed…

5
arXiv — NLP / Computation & Language research 20d ago

Supervised Fine-tuning with Synthetic Rationale Data Hurts Real-World Disease Prediction

arXiv:2606.10279v1 Announce Type: cross Abstract: Supervised fine-tuning with synthetic rationale data is widely assumed to improve language model performance on clinical prediction tasks by teaching models not just what to predict but why. We test this assumption on five-year…

28
TechCrunch — AI news-outlet 20d ago

Anthropic’s Fable 5 can make weirdly fun video games with the click of a button

Anthropic's Claude Fable 5 is going to be a big hit with the web's vibe coders.

27
llama.cpp releases dev-tools 20d ago

b9586: webui: implement pinned conversations support (#21387)

webui: implement pinned conversations support webui: linter/prettier pass Fix the unused handleMobileSidebarItemClick from the component. the search should find pinned conversations as well Co-authored-by: Pascal [email protected] Co-authored-by: Pascal…

24
Anthropic SDK (Python) releases dev-tools 20d ago

v0.108.0

0.108.0 (2026-06-09) Full Changelog: v0.107.1...v0.108.0 Features api: add support for claude-mythos-5 and claude-fable-5, with support for server-side fallbacks on refusal ( 6b76649 ) client: adds client-side fallbacks middleware for API providers that do not support…

12
GitHub Blog — AI & ML official-blog 20d ago

From one-off prompts to workflows: How to use custom agents in GitHub Copilot CLI

Custom agents let GitHub Copilot CLI understand your stack and team workflows, turning one-off terminal prompts into repeatable, reviewable processes. The post From one-off prompts to workflows: How to use custom agents in GitHub Copilot CLI appeared first on The GitHub Blog .

20
NVIDIA Developer Blog official-blog 20d ago

Evaluate Clinical ASR Models Faster with Agent Skills and NVIDIA Nemotron Speech

Training a speech AI model to correctly recognize or synthesize clinical terminology is surprisingly difficult. Drug names like Acetaminophen, Amlodipine,...

9
Hugging Face Daily Papers research 21d ago

Experience Makes Skillful: Enabling Generalizable Medical Agent Reasoning via Self-Evolving Skill Memory

Abstract SkeMex is a self-evolving framework that enhances medical agents through structured skill memory, improving long-term clinical reasoning by distinguishing useful experiences and governing memory retention based on contextual utility. Generated by…

32
arXiv — Machine Learning research 21d ago

TriHead-GAN: A Generative Adversarial Network with Triple-Head Discriminator for Carbon Emission Time Series Generation

arXiv:2606.07569v1 Announce Type: new Abstract: Accurate carbon emission monitoring is critical for climate policy and emerging regulatory mechanisms such as the EU Carbon Border Adjustment Mechanism, yet city-level high-frequency monitoring data remain extremely scarce,…

31
arXiv — Machine Learning research 21d ago

HASA: Subnet Allocation for Compute-Constrained Model-Heterogeneous Federated Learning

arXiv:2606.07621v1 Announce Type: new Abstract: Edge services increasingly use federated learning to personalize on-device models while keeping sensitive data local. In practice, deployments must handle heterogeneity in both client resources and local data distributions.…

24
arXiv — Machine Learning research 21d ago

BCG-FM: A Foundation Model for Ambient Cardiac Health Sensing

arXiv:2606.07692v1 Announce Type: new Abstract: Foundation models for wearable biosignals have matched or exceeded supervised specialists across a range of clinical tasks, yet all rely on modalities that require deliberate user action--wearing a device or visiting a sleep lab.…

14
arXiv — Machine Learning research 21d ago

EvoCSFL: Surrogate-Assisted Evolutionary Client Selection for Efficient and Robust Federated Learning

arXiv:2606.07702v1 Announce Type: new Abstract: The heterogeneity of client data and systems makes it difficult to achieve satisfactory convergence speed and robustness in federated learning with random client selection. To address this issue, this paper proposes a…

29
arXiv — Machine Learning research 21d ago

Temporal Coverage over Density: Parsimonious Training-Set Design for ML Climate Downscaling

arXiv:2606.07898v1 Announce Type: new Abstract: High-resolution regional climate simulations provide critical information for climate impacts assessments but remain computationally expensive, motivating the development of machine-learning downscalers and emulators. A key…

16
arXiv — Machine Learning research 21d ago

SafeECGMatch: Calibration-Aware Joint Frequency and Time Space Semi-Supervised Learning for Open-Set ECG Classification

arXiv:2606.08037v1 Announce Type: new Abstract: Electrocardiogram (ECG) classification models often suffer from severe label scarcity, making semi-supervised learning (SSL) an attractive strategy for reducing annotation costs. In clinical settings, however, unlabeled pools…

26
r/LocalLLaMA community 21d ago

I fine-tuned Parakeet 0.6B for medical ASR — open weights, local Mac/CUDA/CPU

I fine-tuned NVIDIA's Parakeet TDT 0.6B v2 for clinical speech and am releasing the weights as Omi Med STT v1 (CC-BY-4.0). Disclosure: I'm the founder of Omi Health and built this. Happy to dig into the training mix, benchmark, failure cases, quantization, or anything else. The…

14
r/LocalLLaMA community 21d ago

Here's a llama.cpp CLI Command builder.

No accounts or sign up. No email requirements. No pop-ups and no cookies. No ads. Info is saved locally in your browser so you dont lose any progress. Its got every single flag and argument that could be found in the documentation. Tool tips are added to everything. Every field…

19
Vercel — AI dev-tools 21d ago

Domain Search is now available through the Vercel CLI

You can now use the Vercel CLI to search domains. Using the vercel domains search command, you can supply a domain name and retrieve availability and price results for all TLDs that Vercel supports. You can also filter by TLD, apply sorting, and filter out unavailable domains.…

7
Vercel — AI dev-tools 21d ago

How Fern runs multi-tenant docs for Webflow and ElevenLabs on Vercel

Fern on Vercel 3x faster time to first byte Page load times reduced by 80% 6 million+ page views per month from 1 million+ unique visitors 65% of the platform migrated from Pages Router to App Router in 7 days Fern helps companies ship developer documentation and SDKs, running…

4
llama.cpp releases dev-tools 21d ago

b9562

mtmd : add video input support ( #24269 ) wip ok: lazy bitmap API remember to free lazy text wip add mtmd_helper_video support video input on server (base64 input) add MTMD_VIDEO config add timestamp update CLI cli: allow auto-completion for video add --video arg fix build…

22
llama.cpp releases dev-tools 21d ago

b9559

cli: fix spinner not show during prompt processing ( #24283 ) macOS/iOS: macOS Apple Silicon (arm64) macOS Apple Silicon (arm64, KleidiAI enabled) DISABLED macOS Intel (x64) iOS XCFramework Linux: Ubuntu x64 (CPU) Ubuntu arm64 (CPU) Ubuntu s390x (CPU) Ubuntu x64 (Vulkan) Ubuntu…

10
r/LocalLLaMA community 21d ago

llama-launcher Release

Hello everyone, I've been working on a point and click GUI to make tinkering with llama-server flags much quicker and easier, I thought I'd share for anyone else who might be interested. It's also great for anyone new to llama.cpp that is looking to get into it and doesn't want…

7
r/MachineLearning community 21d ago

Why I stopped using semantic embeddings for tool selection and switched back to BM25 [D]

I've been building agents for about a year and recently shipped one for a client running ~140 MCP-exposed tools at peak. Along the way I made the canonical mistake. I used cosine similarity over tool description embeddings to pick which tools the model could see per turn. Worked…

9
r/LocalLLaMA community 21d ago

Meddies PII: An Open Multilingual De-identification Model for Clinical Text

A clinical AI model does not need to know who the patient is to reason clinically. It needs the symptoms, medications, lab results, diagnosis history, and treatment course. The problem is that in real medical records, those facts usually sit next to identifiers: names, record…

38
Ars Technica — AI news-outlet 21d ago

The weather and climate science AI revolution isn’t revolutionary

Machine learning has its limits—how is it being used?

21
arXiv — Machine Learning research 22d ago

The Identity Trap in EEG Foundation Models: A Diagnostic Audit

arXiv:2606.06647v1 Announce Type: new Abstract: Objective. EEG foundation models (FMs) report strong accuracy on clinical resting-state EEG. However, high accuracy under subject-disjoint cross-validation remains ambiguous: it can reflect a genuine clinical biomarker, or…

38
arXiv — Machine Learning research 22d ago

SCALE: Scalable Cross-Attention Learning with Extrapolation for Agentic Workflow Scheduling

arXiv:2606.06820v1 Announce Type: new Abstract: Agentic Large Language Model (LLM) systems decompose complex tasks into workflow Directed Acyclic Graphs (DAGs) whose primitives must be scheduled on heterogeneous clusters. Existing deep reinforcement learning (DRL) schedulers are…

26
arXiv — Machine Learning research 22d ago

Constructing VAE Latent Spaces with Prescribed Topology

arXiv:2606.07058v1 Announce Type: new Abstract: Variational autoencoders (VAEs) learn low-dimensional latent representations of high-dimensional data. When the data lies on a manifold with non-Euclidean topology, the standard Gaussian prior introduces a topological mismatch that…

17
arXiv — Machine Learning research 22d ago

The discovery of the effects of women employment participation on the fertility of developing countries: A panel data approach

arXiv:2606.07093v1 Announce Type: new Abstract: The fertility trend in developing countries has experienced a significant decline in the last few decades; at the same time, the role of women in the workplace has improved. To have a better insight of the causality of the rate of…

19
arXiv — Machine Learning research 22d ago

REMEDI: A Benchmark for Retention and Unlearning Evaluation in Multi-label Clinical Disease Inference

arXiv:2606.07141v1 Announce Type: new Abstract: Language models trained for clinical disease inference are trained on patient data, which may include sensitive and private information, and data owners may request the removal of their data from a trained model due to privacy or…

12
arXiv — Machine Learning research 22d ago

A robust PPG foundation model using multimodal physiological supervision

arXiv:2606.07365v1 Announce Type: new Abstract: Photoplethysmography (PPG), a non-invasive measure of changes in blood volume, is widely used in both wearable devices and clinical settings. Recent PPG foundation models either use open-source ICU datasets with pretraining…

8
arXiv — Machine Learning research 22d ago

Which Anatomy Matters Under Limited Labels? A Data-Efficient Anatomy-Aware Benchmark for Cardiac Pathology Prediction

arXiv:2606.06509v1 Announce Type: cross Abstract: Numerous medical imaging problems must be solved under limited labels and constrained compute, yet it remains unclear whether performance gains are driven mainly by more expressive models or by better representation of clinically…

17
arXiv — NLP / Computation & Language research 22d ago

When Large Language Models Fail in Healthcare: Evaluating Sensitivity to Prompt Variations

arXiv:2606.07237v1 Announce Type: new Abstract: Large Language Models (LLMs) are increasingly used in healthcare for tasks such as clinical question answering, diagnosis support, and report summarization. Despite their promise, these models remain highly sensitive to subtle…

17
arXiv — NLP / Computation & Language research 22d ago

LLM-Guided Evolution for Medical Decision Pipelines

arXiv:2606.07342v1 Announce Type: new Abstract: Adapting large language models (LLMs) to clinical workflows often requires costly fine-tuning or manual prompt and pipeline engineering. We study LLM-guided MAP-Elites evolution as an inference-time alternative for discovering…

9
arXiv — NLP / Computation & Language research 22d ago

TEVI: Text-Conditioned Editing of Visual Representations via Sparse Autoencoders for Improved Vision-Language Alignment

arXiv:2606.07451v1 Announce Type: cross Abstract: Vision-language models such as CLIP are highly useful for diverse tasks due to their shared image-text embedding space. Despite this, the image and text embeddings are often poorly aligned, affecting downstream performance.…

6
arXiv — NLP / Computation & Language research 22d ago

CTR-Sink: Attention Sink for Language Models in Click-Through Rate Prediction

arXiv:2508.03668v3 Announce Type: replace Abstract: Click-Through Rate (CTR) prediction, a core task in recommendation systems, estimates user click likelihood using historical behavioral data. Modeling user behavior sequences as text to leverage Language Models (LMs) for this…

5
r/LocalLLaMA community 22d ago

Dockerized Nemotron 3.5 ASR — Switched from Parakeet, better multilingual support + streaming (4.5x realtime speed on cpu)

I was originally using Parakeet for my speech recognition pipeline but decided to give Nemotron 3.5 a shot. After testing it on some multilingual audio clips, it's been working great so far. What sold me: - Better language support (40+ locales from one model) - Native streaming…

17
r/LocalLLaMA community 23d ago

Another 1-click admin account takeover in pewdiepie's AI tool (language in video nsfw)

  submitted by   /u/theonejvo [link]   [comments]

10
Hugging Face official-blog 23d ago

Job Searcher

Back to Articles Job Searcher Team Article Published June 6, 2026 Upvote 2 Emre emrekuruu build-small-hackathon Job hunting as a new grad is a full-time job by itself. You sift through hundreds of postings every week to find a handful worth applying to. You click "Easy Apply"…

15
Simon Willison community 24d ago

micropython-wasm 0.1a2

Release: micropython-wasm 0.1a2 I added a CLI to micropython-wasm ( issue #7 ), inspired by the first draft of the blog entry when I realized it would be a great way to illustrate the Try it yourself section. Tags: python , sandboxing , webassembly , micropython

4
Anthropic SDK (Python) releases dev-tools 24d ago

v0.106.0

0.106.0 (2026-06-05) Full Changelog: v0.105.2...v0.106.0 Features api: mark Claude Opus 4.1 as deprecated ( 85068cc ) Bug Fixes client: make Foundry client copy() and with_options() work ( 94146ac ) transform schema: preserve $defs when schema root is a $ref ( #1642 ) ( fc58e06…

19
Hugging Face Daily Papers research 24d ago

BRepCLIP: Contrastive Multimodal Pretraining on BRep Primitives for CAD Understanding

Abstract BRepCLIP enables multimodal representation learning for CAD models by aligning boundary representation geometry with language and image embeddings through contrastive pretraining, achieving superior retrieval and classification performance compared to point-based…

7
Hugging Face Daily Papers research 24d ago

MAOAM: Unified Object and Material Selection with Vision-Language Models

Abstract A unified vision-language model framework enables precise object and material selection through text or click interactions, supporting diverse editing workflows with improved robustness. Generated by Qwen/Qwen2.5-Coder-32B-Instruct Selection is a core operation in…

11
llama.cpp releases dev-tools 24d ago

b9530

cli: fix model params not propagated ( #23893 ) Fixes #23847 macOS/iOS: macOS Apple Silicon (arm64) macOS Apple Silicon (arm64, KleidiAI enabled) DISABLED macOS Intel (x64) iOS XCFramework Linux: Ubuntu x64 (CPU) Ubuntu arm64 (CPU) Ubuntu s390x (CPU) Ubuntu x64 (Vulkan) Ubuntu…

21
arXiv — Machine Learning research 25d ago

DP-MacAdam: Differentially Private Mechanism with Adaptive Clipping and Adaptive Momentum

arXiv:2606.05435v1 Announce Type: new Abstract: Differentially private stochastic gradient descent (DP-SGD) has become the standard framework for privacy-preserving machine learning, yet its reliance on a fixed gradient clipping threshold to limit sensitivity remains a…

12
arXiv — Machine Learning research 25d ago

Intercomparison of Machine Learning Algorithms for Remote Sensing-based In-season Crop Mapping

arXiv:2606.05731v1 Announce Type: new Abstract: In-season crop type mapping is critical for food security in the face of increasingly extreme climate-related threats to crops. Currently, the USDA Cropland Data Layer provides crop type labels at 30m resolution and is available…

8
arXiv — Machine Learning research 25d ago

HoT-SSM:Higher-order Temporal Knowledge Graph Reasoning with State Space Models for Health Care

arXiv:2606.05994v1 Announce Type: new Abstract: Medical knowledge graphs (MKGs) infused with clinical knowledge have been increasingly used to model electronic health records (EHRs) to support interpretable predictions in healthcare domain. However, existing MKG-based approaches…

31
arXiv — Machine Learning research 25d ago

Your GFlowNet Secretly Learns an Optimal Transport Plan

arXiv:2606.06272v1 Announce Type: new Abstract: Generative Flow Networks (GFlowNets) are a framework for sampling structured objects via stochastic trajectories in a directed graph. In this work, we establish a theoretical connection between non-acyclic GFlowNets and optimal…

5
arXiv — NLP / Computation & Language research 25d ago

InfoShield: Privacy-Preserving Speech Representations for Mental Health Screening via Information-Theoretic Optimization

arXiv:2606.05561v1 Announce Type: new Abstract: Speech-based mental health screening offers scalable depression detection, yet clinical deployment faces a significant barrier: users' privacy concerns about demographic information exposure. Current techniques struggle to resolve…

34

DUET -- Dual User Embedding Transformers for Offsite Conversion Prediction

Continual LLM Upcycling: A Predictor-Gated Bank-Wise Sparsity Training Recipe for Dense-to-Sparse LLMs

Dep-LLM: Training-Free Depression Diagnosis via Evidence-Guided Structured Multi-factor with Reliable LLM Reasoning

Supervised Fine-tuning with Synthetic Rationale Data Hurts Real-World Disease Prediction

Anthropic&#8217;s Fable 5 can make weirdly fun video games with the click of a button

b9586: webui: implement pinned conversations support (#21387)

v0.108.0

From one-off prompts to workflows: How to use custom agents in GitHub Copilot CLI

Evaluate Clinical ASR Models Faster with Agent Skills and NVIDIA Nemotron Speech

Experience Makes Skillful: Enabling Generalizable Medical Agent Reasoning via Self-Evolving Skill Memory

TriHead-GAN: A Generative Adversarial Network with Triple-Head Discriminator for Carbon Emission Time Series Generation

HASA: Subnet Allocation for Compute-Constrained Model-Heterogeneous Federated Learning

BCG-FM: A Foundation Model for Ambient Cardiac Health Sensing

EvoCSFL: Surrogate-Assisted Evolutionary Client Selection for Efficient and Robust Federated Learning

Temporal Coverage over Density: Parsimonious Training-Set Design for ML Climate Downscaling

SafeECGMatch: Calibration-Aware Joint Frequency and Time Space Semi-Supervised Learning for Open-Set ECG Classification

I fine-tuned Parakeet 0.6B for medical ASR — open weights, local Mac/CUDA/CPU

Here's a llama.cpp CLI Command builder.

Domain Search is now available through the Vercel CLI

How Fern runs multi-tenant docs for Webflow and ElevenLabs on Vercel

b9562

b9559

llama-launcher Release

Why I stopped using semantic embeddings for tool selection and switched back to BM25 [D]

Meddies PII: An Open Multilingual De-identification Model for Clinical Text

The weather and climate science AI revolution isn’t revolutionary

The Identity Trap in EEG Foundation Models: A Diagnostic Audit

SCALE: Scalable Cross-Attention Learning with Extrapolation for Agentic Workflow Scheduling

Constructing VAE Latent Spaces with Prescribed Topology

The discovery of the effects of women employment participation on the fertility of developing countries: A panel data approach

REMEDI: A Benchmark for Retention and Unlearning Evaluation in Multi-label Clinical Disease Inference

A robust PPG foundation model using multimodal physiological supervision

Which Anatomy Matters Under Limited Labels? A Data-Efficient Anatomy-Aware Benchmark for Cardiac Pathology Prediction

When Large Language Models Fail in Healthcare: Evaluating Sensitivity to Prompt Variations

LLM-Guided Evolution for Medical Decision Pipelines

TEVI: Text-Conditioned Editing of Visual Representations via Sparse Autoencoders for Improved Vision-Language Alignment

CTR-Sink: Attention Sink for Language Models in Click-Through Rate Prediction

Dockerized Nemotron 3.5 ASR — Switched from Parakeet, better multilingual support + streaming (4.5x realtime speed on cpu)

Another 1-click admin account takeover in pewdiepie's AI tool (language in video nsfw)

Job Searcher

micropython-wasm 0.1a2

v0.106.0

BRepCLIP: Contrastive Multimodal Pretraining on BRep Primitives for CAD Understanding

MAOAM: Unified Object and Material Selection with Vision-Language Models

b9530

DP-MacAdam: Differentially Private Mechanism with Adaptive Clipping and Adaptive Momentum

Intercomparison of Machine Learning Algorithms for Remote Sensing-based In-season Crop Mapping

HoT-SSM:Higher-order Temporal Knowledge Graph Reasoning with State Space Models for Health Care

Your GFlowNet Secretly Learns an Optimal Transport Plan

InfoShield: Privacy-Preserving Speech Representations for Mental Health Screening via Information-Theoretic Optimization

Anthropic’s Fable 5 can make weirdly fun video games with the click of a button