Tag

Security

500 articles archived under #security · RSS

The Information — AI news-outlet 1mo ago

Salesforce and Snowflake Earnings to Focus Attention on AI’s Software Impact

As we return from the long weekend, we’re preparing for a resumption this week of one of tech’s big debates: whether AI is killing enterprise software. Salesforce, Snowflake and Asana are each reporting earnings for the first fiscal quarter in the next few days, providing us…

4
r/LocalLLaMA community 1mo ago

Anyone use QwQ-32B? It's over a year old? Has Qwen 3.6 27b basically replaced it?

I seen this one mentioned but it was a source from about 14 months ago. In the age of the Qwen 3.6 and Gemma 4- is there still a use for QwQ 32B? Does anyone still favour it over the new stuff? If so, do you use it for coding? something else? Thanks   submitted by  …

29
r/MachineLearning community 1mo ago

Reconstructing the agent methodology: Decoupling decision-making and execution - open source [P]

I’ve been thinking about a problem in current agent systems: Most agents are becoming very good at execution, but the decision layer before execution is still unclear. Coding agents, research agents, tool loops, sandboxes, workflows, and harnesses are all improving quickly. Once…

38
r/MachineLearning community 1mo ago

I’m building an open-source decision layer above AI agents [P]

Hi everyone, I’m Jia, the creator of Spice. I’ve been working on an open-source project called Spice. The simplest way to describe it is: Spice is a decision layer above agents. Most agent systems today are very focused on execution, They are getting better at doing tasks after…

30
r/LocalLLaMA community 1mo ago

Next year we're getting 0.5T model from Grok

Tweet : https://xcancel.com/elonmusk/status/2058796067592736866#m Right now it joined "Grok-3 Opensource Release" club.   submitted by   /u/pmttyji [link]   [comments]

13
arXiv — Machine Learning research 1mo ago

Open Multimodal Datasets and Open-Source Software for Data-Driven Modeling of Multiphase Transport and Thermal Systems

arXiv:2605.23037v1 Announce Type: new Abstract: Data-driven modeling is becoming central to multiphase transport, electronics cooling, acoustic diagnostics, and thermal-fluid digital twins, but progress is limited by fragmented datasets and raw instrument files that are…

8
arXiv — Machine Learning research 1mo ago

Steered Generation via Gradient-Based Optimization on Sparse Query Features

arXiv:2605.23040v1 Announce Type: new Abstract: Latent steering exploits internal representations of Large Language Models (LLMs) to guide generation, yet interventions on dense states can entangle distinct semantic features. In this paper, we investigate attention query…

9
arXiv — Machine Learning research 1mo ago

Dreaming Smoothly and Sample Efficiently with Gradient Penalized Latent Dynamics

arXiv:2605.23089v1 Announce Type: new Abstract: Model-based reinforcement learning improves sample efficiency by learning a world model. However, existing latent world models such as DreamerV3 do not explicitly enforce local smoothness in their learned transition dynamics,…

15
arXiv — Machine Learning research 1mo ago

Convex Low-resource Accent-Robust Language Detection in Speech Recognition

arXiv:2605.23235v1 Announce Type: new Abstract: Globalization and multiculturalism continue to produce increasingly diverse speech varieties. Yet current spoken dialogue systems frequently fail on under-represented dialects and accents, often misidentifying the input language…

29
arXiv — Machine Learning research 1mo ago

What Linear Probes Miss: Multi-View Probing for Weight-Space Learning

arXiv:2605.23410v1 Announce Type: new Abstract: The explosive growth of open-source model repositories has created a Model Jungle, where checkpoints are frequently shared without adequate documentation or metadata. While weight-space learning offers a pathway to identify and…

20
arXiv — Machine Learning research 1mo ago

Sample-wise Targeted Adversarial Attacks on Test-time Adaptation

arXiv:2605.23411v1 Announce Type: new Abstract: Test-time adaptation (TTA) effectively counters distribution shifts but exposes models to adversarial manipulation via the unlabeled test stream. Existing class-wise targeted attacks remain impractical for stealthy exploitation in…

12
arXiv — Machine Learning research 1mo ago

Reflex: Reinforcement Learning with Reflection Symmetry Exploitation in State-Based Continuous Control

arXiv:2605.23415v1 Announce Type: new Abstract: Reinforcement learning has long struggled with poor sample efficiency. One promising approach to mitigate this problem is leveraging group-invariant Markov Decision Processes ($G$-invariant MDPs). Existing works in this direction…

15
arXiv — Machine Learning research 1mo ago

An Open-Source Training Dataset for Foundation Models for Black-box Optimization

arXiv:2605.23417v1 Announce Type: new Abstract: Most black-box optimization methods require extensive hyperparameter tuning, often limiting their ability to generalize across different optimization domains. Foundation models for black-box optimization that learn optimization…

21
arXiv — Machine Learning research 1mo ago

Class-Dependent Hybrid Data Augmentation for Multiclass Migraine Classification under Severe Class Imbalance

arXiv:2605.23453v1 Announce Type: new Abstract: We conducted a reproducibility-oriented re-evaluation of prior migraine classification studies, correcting for data leakage and metric bias. We then introduced (i) a clinically motivated aggregation of two hemiplegic subtypes…

23
arXiv — NLP / Computation & Language research 1mo ago

Knowledge Distillation for Low-Resource Open-source Text-to-SQL Model

arXiv:2605.22843v1 Announce Type: new Abstract: Text-to-SQL converts natural language questions into executable SQL queries, enabling non-technical users to access relational databases for analytics and intelligent data services. In real-world scenarios, performance is often…

18
arXiv — NLP / Computation & Language research 1mo ago

Graph Alignment Topology as an Inductive Bias for Grounding Detection

arXiv:2605.22963v1 Announce Type: new Abstract: Large Language Models (LLMs) are optimized to produce distributionally plausible continuations rather than to explicitly verify whether generated propositions are entailed by source documents. This inductive bias enables…

12
arXiv — NLP / Computation & Language research 1mo ago

Articulatory strategy as a source of variation in acoustic vowel dynamics

arXiv:2605.23416v1 Announce Type: new Abstract: Acoustic vowel dynamics have some speaker-identifying characteristics, which have been ascribed to individual properties of articulatory strategies: formant transitions have a particular shape because speakers move their…

6
arXiv — NLP / Computation & Language research 1mo ago

Benchmarking Google Embeddings 2 against Open-Source Models for Multilingual Dense Retrieval and RAG Systems

arXiv:2605.23618v1 Announce Type: new Abstract: We benchmark Google Embeddings (GE2), a Vertex-AI-hosted bi-encoder with 2,048-token context and explicit task-type conditioning, against five open-source alternatives: BGE-M3, E5-large, Multilingual-E5-large (mE5-L), LaBSE, and…

7
arXiv — NLP / Computation & Language research 1mo ago

AI-Friendly LaTeX: Using LaTeX Code as a Knowledge Source for Retrieval-Augmented Generation

arXiv:2605.22923v1 Announce Type: cross Abstract: Large language models can answer questions about textbooks, lecture notes, and programming exercises more reliably when their answers are grounded in an explicit knowledge source. Retrieval-augmented generation (RAG) is a common…

30
arXiv — NLP / Computation & Language research 1mo ago

InfiGFusion: Graph-on-Logits Distillation via Efficient Gromov-Wasserstein for Model Fusion

arXiv:2505.13893v2 Announce Type: replace Abstract: Recent advances in large language models (LLMs) have intensified efforts to fuse heterogeneous open-source models into a unified system that inherits their complementary strengths. Existing logit-based fusion methods maintain…

35
arXiv — NLP / Computation & Language research 1mo ago

Fine-Tuning Causal LLMs for Text Classification: Embedding-Based vs. Instruction-Based Approaches

arXiv:2512.12677v2 Announce Type: replace Abstract: We explore efficient strategies to fine-tune decoder-only Large Language Models (LLMs) for downstream text classification under resource constraints. Two approaches are investigated: (1) attaching a classification head to a…

4
r/LocalLLaMA community 1mo ago

opensource music reccomendation / playlist, similar to spotify radio / YT music mix?

Any recommendations for this? Initially, i was thinking that LLMs probably not the right thing for this (assuming your source data is all listening metrics), HOWEVER, if you combine a) user listening data; AND b) user comments / text data / reccs/ reviews / forum posts / social…

6
r/LocalLLaMA community 1mo ago

hipEngine: Fast Native Qwen 3.6 Inference for RDNA3 (Strix Halo, 7900 XTX)

A few weeks ago, after finishing FastDMS , I started toying around writing some RDNA3 kernels again to see how fast I could get Qwen 3.6 MoE running. It turned out well enough, so over the past couple weeks, I turned those experiments into hipEngine , a new open source (AGPLv3)…

13
Hacker News — AI on Front Page community 1mo ago

Show HN: Audiomass – a free, open-source multitrack audio editor for the web

Article URL: https://audiomass.co/?multitrack=1 Comments URL: https://news.ycombinator.com/item?id=48258015 Points: 338 # Comments: 68

29
r/MachineLearning community 1mo ago

Working on a cgo-free CUDA binding in Go for ML stuff Week 3 - open source [P]

At our work we use CUDA in Rust since the company switched to it recently. Rust has pretty good Driver API bindings but it made me wonder why the hell we cant have something decent in Go without cgo. I mostly build ML tools in the last month and Go is my main language for pretty…

30
r/MachineLearning community 1mo ago

PapersWithCode new features - week 1 [P]

Hi, Niels here from the open-source team at Hugging Face. It's been one week since I launched paperswithcode.co , a revival of the website we all loved. It allows us to keep track of the state-of-the-art (SOTA) across various domains of AI, from agents to computer vision and…

23
r/LocalLLaMA community 1mo ago

Qwen Plays ̶p̶̶o̶̶k̶̶e̶̶m̶̶o̶̶n̶ ? / QWEN PLAYS DCSS! - qwen3.6-35b-a3b@q4_k_xl plays open source roguelike adventure DCSS (and does a decent job)

Hi, (TLDR.): Qwen in its MTP version has tool call bugs and outputs everything into tool/thinking blocks - mangeling the output - canceling the +speed with repeated wrong tool calls! DCSS works well with non MTP qwen even on smaller qwants. im Testing the new MTP models and…

19
Hacker News — AI on Front Page community 1mo ago

Microsoft open-sources "the earliest DOS source code discovered to date"

https://opensource.microsoft.com/blog/2026/04/28/continuing-... Comments URL: https://news.ycombinator.com/item?id=48253386 Points: 224 # Comments: 55

33
r/LocalLLaMA community 1mo ago

Embeddings for NVIDIA's Nemotron Personas

I extracted embedding vectors for nvidia/Nemotron-Personas dataset. It's an incredible resource consisting of millions of synthetic personas with detailed backgrounds (names, ages, occupations, hobbies, and more), but finding specific personas or clustering them is difficult. To…

5
r/MachineLearning community 1mo ago

Spice: We built an open-sourced decision layer that sits above your AI agents (controls agent actions before execution) [P]

Hi guys, been exploring here for a while, wanted to share something we've been working on. It's called Spice , an open-source decision layer above agents. We have tons of great execution agents now — Claude Code, Codex, hermes, etc. They're good at doing stuff. But they're…

6
r/LocalLLaMA community 1mo ago

meituan-longcat/LongCat-Video-Avatar-1.5 · Hugging Face

🚀 Model Introduction We are excited to announce the release of LongCat-Video-Avatar 1.5, an upgraded open-source framework that prioritizes extreme empirical optimization and production-readiness for audio-driven human video generation. Built upon the LongCat-Video foundation…

21
r/LocalLLaMA community 1mo ago

I fine-tuned Cohere Transcribe to support diarization and timestamps

Hi I'll keep it short: Cohere-transcribe is currently the best open source speech to text model (and possibly even better than other proprietary models). BUT it doesn't support diarization (speaker identification) and timestamps, even though there are tokens for it in the…

36
Hacker News — AI on Front Page community 1mo ago

CISA tries to contain data leak

Article URL: https://krebsonsecurity.com/2026/05/lawmakers-demand-answers-as-cisa-tries-to-contain-data-leak/ Comments URL: https://news.ycombinator.com/item?id=48238429 Points: 233 # Comments: 53

27
r/LocalLLaMA community 1mo ago

trained a prompt injection detector using ml-intern and DeepSeek v4 Flash, runs in the browser

Trained a prompt injection classifier using ml-intern + DeepSeek v4 Flash. DistilBERT, F1 99%, ONNX int8, ~65 MB, runs in browser with Transformers.js v3. You can try it here: https://huggingface.co/spaces/av-codes/prompt-injection-detector --- I've been interested in prompt…

5
r/LocalLLaMA community 1mo ago

DeepSeek is pushing forward with $10.29 billion financing round, with Liang Wenfeng committing to continue developing open-source AI models rather than pursuing short-term commercialization goals

https://www.bloomberg.com/news/articles/2026-05-22/deepseek-founder-declares-agi-goal-as-10-billion-round-advances   submitted by   /u/External_Mood4719 [link]   [comments]

17
arXiv — Machine Learning research 1mo ago

AgForce Enables Antigen-conditioned Generative Antibody Design

arXiv:2605.21610v1 Announce Type: new Abstract: Antibody design methods condition on antigen structure to generate complementarity-determining regions (CDR), yet a systematic evaluation of baseline methods reveals that they largely ignore the antigen input. We identify three…

9
arXiv — Machine Learning research 1mo ago

Aerodynamic force reconstruction using physics-informed Gaussian processes

arXiv:2605.22111v1 Announce Type: new Abstract: Accurate modeling of aerodynamic loads is essential for understanding and predicting the responses of complex structural systems. However, these models often rely on simplifications of the true physical forces, introducing…

23
arXiv — NLP / Computation & Language research 1mo ago

ArabDiscrim: A Decade-Long Arabic Facebook Corpus on Racism and Discrimination

arXiv:2605.22081v1 Announce Type: new Abstract: We present ArabDiscrim, a decade-long lexical resource and corpus of 293K public Arabic Facebook posts (2014--2024) discussing racism and discrimination. Unlike existing Twitter-centric datasets, ArabDiscrim integrates…

35
arXiv — NLP / Computation & Language research 1mo ago

Evaluation of Chunking Strategies for Effective Text Embedding in Low-Resource Language on Agricultural Documents

arXiv:2605.22203v1 Announce Type: new Abstract: In this study, we compare the performance of four text chunking approaches: Recursive, Khmer-Aware, Sentence-Based, and LLM-Based within a Retrieval-Augmented Generation (RAG) framework applied to Khmer agricultural documents. The…

15
llama.cpp releases dev-tools 1mo ago

b9275

metal : optimize concat kernel and fix set kernel threads ( #23411 ) metal : fix GGML_OP_SET kernel threads tests : extend test_cpy to support different src/dst shapes Extend test_cpy to support different source and destination tensor shapes for CPY operations (reshaping), where…

37
r/LocalLLaMA community 1mo ago

Honesty in a small model drops from 35% to 0% by changing the tone of the prompt. Sharing the findings.

My paper got published today at Arxiv. It raises questions about how language models behave when the framing of a request shifts. Small open-source AI models can be moved from honest to dishonest behaviour by little more than a change in tone. Asked to solve coding problems…

4
TechCrunch — AI news-outlet 1mo ago

With aluminum prices up 20%, recycling startups bet on AI to cash in

Recycling startups are using AI to improve the recovery of critical minerals like aluminum, aiming to build a massive source of the metal.

16
r/LocalLLaMA community 1mo ago

'Am I OpenAI compatible' - a tool and documentation for unified api signatures in open source AI.

This has turned out to be useful to many of my friends so I thought I'd share here as well. I created a tool and documentation page for most major open-souce project's adherence to 'OpenAI compatibility' after seeing inconsistencies between engines like vLLM and llama.cpp. Now…

18
arXiv — Machine Learning research 1mo ago

OmniISR: A Unified Framework for Centralized and Federated Learning via Intermediate Supervision and Regularization

arXiv:2605.20276v1 Announce Type: new Abstract: The global deployment of edge intelligence operates across heterogeneous legal frameworks. While some regions permit centralized learning (CL) via cloud data aggregation, others enforce strict data localization, necessitating…

12
arXiv — Machine Learning research 1mo ago

ZEBRA: Zero-shot Budgeted Resource Allocation for LLM Orchestration

arXiv:2605.20485v1 Announce Type: new Abstract: As autonomous agents increasingly execute end-to-end tasks under fixed monetary budgets, the pressing open question shifts from whether the budget is respected, to how to spend it effectively. Existing budget-aware methods…

23
arXiv — Machine Learning research 1mo ago

REFLECTOR: Internalizing Step-wise Reflection against Indirect Jailbreak

arXiv:2605.20654v1 Announce Type: new Abstract: While Large Language Models (LLMs) demonstrate remarkable capabilities, they remain susceptible to sophisticated, multi-step jailbreak attacks that circumvent conventional surface-level safety alignment by exploiting the internal…

36
arXiv — Machine Learning research 1mo ago

The Devil is in the Condition Numbers: Why is GLU Better than non-GLU Structure?

arXiv:2605.20749v1 Announce Type: new Abstract: Gated Linear Units (GLU) and their variants are widely adopted in modern open-source large language model architectures and consistently outperform their non-gated counterparts, yet the underlying reasons for this advantage remain…

34
arXiv — Machine Learning research 1mo ago

Learning to Think in Physics: Breaking Shortcut Learning in Scientific Diffusion via Representation Alignment

arXiv:2605.20780v1 Announce Type: new Abstract: Physics-informed diffusion models typically enforce PDE constraints only on final outputs, leaving intermediate representations unconstrained and prone to shortcut learning under shifted boundary conditions. We introduce…

8
arXiv — NLP / Computation & Language research 1mo ago

Do No Harm? Hallucination and Actor-Level Abuse in Web-Deployed Medical Large Language Models

arXiv:2605.20591v1 Announce Type: new Abstract: Medical large language models (LLMs), including custom medical GPTs (MedGPTs) and open-source models, are increasingly deployed on web platforms to provide clinical guidance. However, they pose risks of hallucination, policy…

33
arXiv — NLP / Computation & Language research 1mo ago

Do LLMs Know What Luxembourgish Borrows? Probing Lexical Neology in Low-Resource Multilingual Models

arXiv:2605.21227v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly used for writing assistance in small contact languages, yet it is unclear whether they respect community norms around lexical borrowing and neology. We introduce LexNeo-Bench, a…

15

Salesforce and Snowflake Earnings to Focus Attention on AI’s Software Impact

Anyone use QwQ-32B? It's over a year old? Has Qwen 3.6 27b basically replaced it?

Reconstructing the agent methodology: Decoupling decision-making and execution - open source [P]

I’m building an open-source decision layer above AI agents [P]

Next year we're getting 0.5T model from Grok

Open Multimodal Datasets and Open-Source Software for Data-Driven Modeling of Multiphase Transport and Thermal Systems

Steered Generation via Gradient-Based Optimization on Sparse Query Features

Dreaming Smoothly and Sample Efficiently with Gradient Penalized Latent Dynamics

Convex Low-resource Accent-Robust Language Detection in Speech Recognition

What Linear Probes Miss: Multi-View Probing for Weight-Space Learning

Sample-wise Targeted Adversarial Attacks on Test-time Adaptation

Reflex: Reinforcement Learning with Reflection Symmetry Exploitation in State-Based Continuous Control

An Open-Source Training Dataset for Foundation Models for Black-box Optimization

Class-Dependent Hybrid Data Augmentation for Multiclass Migraine Classification under Severe Class Imbalance

Knowledge Distillation for Low-Resource Open-source Text-to-SQL Model

Graph Alignment Topology as an Inductive Bias for Grounding Detection

Articulatory strategy as a source of variation in acoustic vowel dynamics

Benchmarking Google Embeddings 2 against Open-Source Models for Multilingual Dense Retrieval and RAG Systems

AI-Friendly LaTeX: Using LaTeX Code as a Knowledge Source for Retrieval-Augmented Generation

InfiGFusion: Graph-on-Logits Distillation via Efficient Gromov-Wasserstein for Model Fusion

Fine-Tuning Causal LLMs for Text Classification: Embedding-Based vs. Instruction-Based Approaches

opensource music reccomendation / playlist, similar to spotify radio / YT music mix?

hipEngine: Fast Native Qwen 3.6 Inference for RDNA3 (Strix Halo, 7900 XTX)

Show HN: Audiomass – a free, open-source multitrack audio editor for the web

Working on a cgo-free CUDA binding in Go for ML stuff Week 3 - open source [P]

PapersWithCode new features - week 1 [P]

Qwen Plays ̶p̶̶o̶̶k̶̶e̶̶m̶̶o̶̶n̶ ? / QWEN PLAYS DCSS! - qwen3.6-35b-a3b@q4_k_xl plays open source roguelike adventure DCSS (and does a decent job)

Microsoft open-sources "the earliest DOS source code discovered to date"

Embeddings for NVIDIA's Nemotron Personas

Spice: We built an open-sourced decision layer that sits above your AI agents (controls agent actions before execution) [P]

meituan-longcat/LongCat-Video-Avatar-1.5 · Hugging Face

I fine-tuned Cohere Transcribe to support diarization and timestamps

CISA tries to contain data leak

trained a prompt injection detector using ml-intern and DeepSeek v4 Flash, runs in the browser

DeepSeek is pushing forward with $10.29 billion financing round, with Liang Wenfeng committing to continue developing open-source AI models rather than pursuing short-term commercialization goals

AgForce Enables Antigen-conditioned Generative Antibody Design

Aerodynamic force reconstruction using physics-informed Gaussian processes

ArabDiscrim: A Decade-Long Arabic Facebook Corpus on Racism and Discrimination

Evaluation of Chunking Strategies for Effective Text Embedding in Low-Resource Language on Agricultural Documents

b9275

Honesty in a small model drops from 35% to 0% by changing the tone of the prompt. Sharing the findings.

With aluminum prices up 20%, recycling startups bet on AI to cash in

'Am I OpenAI compatible' - a tool and documentation for unified api signatures in open source AI.

OmniISR: A Unified Framework for Centralized and Federated Learning via Intermediate Supervision and Regularization

ZEBRA: Zero-shot Budgeted Resource Allocation for LLM Orchestration

REFLECTOR: Internalizing Step-wise Reflection against Indirect Jailbreak

The Devil is in the Condition Numbers: Why is GLU Better than non-GLU Structure?

Learning to Think in Physics: Breaking Shortcut Learning in Scientific Diffusion via Representation Alignment

Do No Harm? Hallucination and Actor-Level Abuse in Web-Deployed Medical Large Language Models

Do LLMs Know What Luxembourgish Borrows? Probing Lexical Neology in Low-Resource Multilingual Models