Tag

Security

500 articles archived under #security · RSS

arXiv — NLP / Computation & Language research 4d ago

Adaptive Evaluation of Out-of-Band Defenses Against Prompt Injection in LLM Agents

arXiv:2606.26479v1 Announce Type: cross Abstract: Recent work (2024 to 2026) has converged on a strategy for defending tool-using LLM agents against indirect prompt injection: rather than training the model to refuse malicious instructions, enforce security outside the model…

38
arXiv — NLP / Computation & Language research 4d ago

GenRecal: Generation after Recalibration from Large to Small Vision-Language Models

arXiv:2506.15681v4 Announce Type: replace Abstract: Recent advancements in vision-language models (VLMs) have leveraged large language models (LLMs) to achieve performance on par with closed-source systems like GPT-4V. However, deploying these models in real-world scenarios,…

16
r/LocalLLaMA community 4d ago

Stop waiting for Qwen3.7 Openweights.

Ornith-1.0, a family of open-source LLMs specialized for agentic coding. Ornith-1.0 spans the full parameter sizes, including 9B Dense, 35B MoE, and 397B MoE. It achieves state-of-the-art performance among open-source models of comparable size on coding benchmarks. Hugging Face:…

36
ThursdAI news-outlet 4d ago

GLM 5.2 total victory: the week open source won and nobody panicked

From CoreWeave: A chill week, but a total Open Source victory for GLM 5.2 + Sakana Fugu, Krea Open Sources, OpenAI makes inference chips with broadcom, Karpathy gets heat about the new Claude Tag...

35
r/LocalLLaMA community 4d ago

Built an open source local first Kanban workflow for running AI coding agents without babysitting every step

I’ve been building BatonBot, a local first app for running AI coding workflows with less babysitting. The problem I kept running into, especially with local models, is that coding agents can be useful but the workflow gets slow: start task → wait → check output → fix next issue…

10
r/MachineLearning community 4d ago

For ECCV, Springer Metor. How are we supposed to upload the files? [D]

source files + final paper pdf. ZIP containing the source files and final paper.pdf. Where does the supplemental materiel get uploaded? Because in that email it says include it in a "supplementary_materiel" folder. this is all very confusing. can someone clarify?   submitted…

15
NVIDIA Developer Blog official-blog 4d ago

Streamlining Resource Binding with End-to-End Support for Vulkan Descriptor Heaps

Shaders are GPU programs that process visual data—such as rays, pixels, geometry, and textures—to produce specific rendering effects. Shaders find necessary...

32
r/MachineLearning community 4d ago

ECCV 2026 camera-ready deadline: June 27 or June 30? [D]

In the recent Springer/Meteor email, it says: The deadline for the upload of the camera-ready manuscripts and source files is 30 June. This is a hard deadline and will not be extended. However, in the same email, the Meteor submission line for my paper says: submission due: June…

35
Hacker News — AI on Front Page community 4d ago

Show HN: OpenKnowledge – open source AI-first alternative to Obsidian/Notion

Hi HN, Nick here. We’re launching OpenKnowledge ( https://openknowledge.ai/ ), a “what you see is what you get” markdown editor that has direct integrations with Claude, Codex, and other agents. Available as MacOS app or Web UI+CLI. Fully free/local and OSS. We built this…

20
Vercel — AI dev-tools 4d ago

AI SDK 7

AI SDK, with over 16 million weekly downloads, is the TypeScript SDK for building AI applications, features, frameworks, and agents across any model provider. It's the same layer eve , Vercel's open-source agent framework, is built on. AI SDK 7 adds production depth for agent…

15
Hacker News — AI on Front Page community 4d ago

LastPass notifies users of yet another data breach

Article URL: https://9to5mac.com/2026/06/23/lastpass-notifies-users-of-yet-another-data-breach/ Comments URL: https://news.ycombinator.com/item?id=48671468 Points: 229 # Comments: 106

18
Hugging Face Daily Papers research 4d ago

Distill Once, Adapt Life-Long: Exploring Dataset Distillation for Continual Test-Time Adaptation

Abstract DO-ALL is a test-time adaptation framework that uses dataset distillation to create synthetic anchors for stable long-term model performance without retaining source data. Generated by Qwen/Qwen2.5-Coder-32B-Instruct Continual Test-Time Adaptation (CTTA) aims to…

20
r/LocalLLaMA community 4d ago

Could you help me test MTP for GLM-4.7-Flash?

Some of you may remember old models from GLM: GLM Air or GLM Flash. I know they’re outdated, but I have a soft spot for them, so I am currently working on enabling MTP for them in llama.cpp. If you know how to compile llama.cpp from source and have the hardware to run…

23
arXiv — Machine Learning research 5d ago

Learning Subset-Shared Invariances for Domain Generalization with Mixture-of-Experts

arXiv:2606.25665v1 Announce Type: new Abstract: Domain generalization (DG) aims to learn a model from one or more source domains that generalizes to an unseen target domain without accessing target data during training. A common approach enforces invariance of representations…

29
arXiv — NLP / Computation & Language research 5d ago

Error-Aware TF-IDF Retrieval-Augmented Generation for ASR Error Correction

arXiv:2606.24915v1 Announce Type: new Abstract: End-to-end automatic speech recognition systems frequently hallucinate rare entities and domain-specific terms, especially in low-resource languages. While retrieval-augmented generation frameworks can mitigate these errors using…

18
arXiv — NLP / Computation & Language research 5d ago

Neural Machine Translation for Low-Resource Tangkhul--English

arXiv:2606.25365v1 Announce Type: new Abstract: We present a study on low-resource machine translation for the Tangkhul-English (nmf-en) language pair. Tangkhul is a severely under-resourced Tibeto-Burman language spoken primarily in Manipur, India, with virtually no prior…

16
arXiv — NLP / Computation & Language research 5d ago

Optimizing Abstractive Summarization With Fine-Tuned PEGASUS

arXiv:2606.25462v1 Announce Type: new Abstract: Abstractive text summarization is the technique of generating a short and concise summary comprising the salient ideas of a source text without making a subset of the salient sentences from the source text. The introduction of…

22
arXiv — NLP / Computation & Language research 5d ago

How Reliable Is Your Jailbreak Judge? Calibration and Adversarial Robustness of Automated ASR Scoring

arXiv:2606.25487v1 Announce Type: new Abstract: Almost every paper on LLM jailbreaks and prompt injection reports an attack-success rate (ASR), and that number is assigned not by people but by an automated judge: either a safety classifier trained for the task, or a general chat…

23
arXiv — NLP / Computation & Language research 5d ago

SARA: Unlocking Multilingual Knowledge in Mixture-of-Experts via Semantically Anchored Routing Alignment

arXiv:2606.25821v1 Announce Type: new Abstract: Sparse Mixture-of-Experts (MoE) architectures have emerged as an increasingly influential paradigm as they offer a strategic balance between parameter scalability and computational efficiency. However, low-resource languages, which…

21
arXiv — NLP / Computation & Language research 5d ago

Dziri Voicebot: An End-to-End Low-Resource Speech-to-Speech Conversational System for Algerian Dialect

arXiv:2606.26003v1 Announce Type: new Abstract: Automatic speech and language technologies are still heavily biased toward high-resource languages, limiting their applicability to dialectal and low-resource settings such as Algerian Dialect. This language presents additional…

28
arXiv — NLP / Computation & Language research 5d ago

The Tatoxa System for Text Detoxification in Low-Resource Languages: The Case of Tatar

arXiv:2606.26015v1 Announce Type: new Abstract: Text detoxification, the automated detection and mitigation of abusive and harmful content, is essential for ensuring the safety of online communities and protecting users. However, low resource languages such as Tatar have…

10
arXiv — NLP / Computation & Language research 5d ago

How Large Language Models Source Brand Reputation Across Languages and Markets

arXiv:2606.25787v1 Announce Type: cross Abstract: When a large language model (LLM) answers a question about a company, it grounds the answer in retrieved web sources, and those sources decide what the model says. Most analysis of AI brand visibility looks at the answer text.…

37
Simon Willison community 5d ago

simonw/browser-compat-db

simonw/browser-compat-db Inspired by Mozilla's new MDN MCP service - source code here - I decided to try converting their comprehensive mdn/browser-compat-data repository full of browser compatibility data into a SQLite database. This new GitHub Repo includes a Claude Code for…

11
r/LocalLLaMA community 5d ago

SDXL running locally in the browser on WebGPU, open-source

I needed simple local image generation without the usual setup. No virtual environments, no ComfyUI with a complex graph and installation as an exe. So i tried to push the whole thing into the browser and run it on WebGPU. It's a browser extension. You install it, then it loads…

13
r/LocalLLaMA community 5d ago

Sipp - an open-source library for in-browser inference built on llama.cpp

GitHub: https://github.com/noumena-labs/Sipp   submitted by   /u/lordhiggsboson [link]   [comments]

9
r/MachineLearning community 5d ago

Find the best open-source OCR models in one place at Papers with Code [P]

Hi, I've created an overview of the most important OCR benchmarks, along with the top open models, and links to their paper and code: https://paperswithcode.co/tasks/ocr . This week, new OCR models were released by Baidu and Mistral. Baidu released Unlimited OCR , a 3B-parameter…

27
r/LocalLLaMA community 5d ago

New EU model (Domyn) will be 400b.

The source is in Italian, but a well respected newspaper (like Financial Times) https://www.ilsole24ore.com/art/frontier-grand-challenge-domyn-guidera-progetto-dell-ai-sovrana-AIgNTNoD?refresh_ce=1 They are a startup that has already created a closed 260b model (Domyn Large) for…

38
Hugging Face Daily Papers research 5d ago

OpenThoughts-Agent: Data Recipes for Agentic Models

Abstract An open-source data curation pipeline for training agentic language models is presented, demonstrating superior performance through systematic experimentation and scalable training data. Generated by Qwen/Qwen2.5-Coder-32B-Instruct Agentic language models dramatically…

34
arXiv — Machine Learning research 6d ago

Low-power analogue neural networks with trainable nonlinear connections for continuous control

arXiv:2606.23742v1 Announce Type: new Abstract: Physical neural networks promise low-power machine learning by computing directly with analogue device physics, but most architectures force nonlinear device responses to act as scalar weights. Inspired by Kolmogorov-Arnold…

28
arXiv — Machine Learning research 6d ago

Exploring Dualistic Meta-Learning to Enhance Domain Generalization in Open Set Scenarios

arXiv:2606.23758v1 Announce Type: new Abstract: Domain generalization learns from multiple source domains to generalize to unseen target domains. However, it often neglects the realistic case of label mismatch between source and target. Open set domain generalization is then…

35
arXiv — Machine Learning research 6d ago

Data Augmentation: A Fourier Analysis Perspective

arXiv:2606.24418v1 Announce Type: new Abstract: Data augmentation is a simple and model-agnostic approach for exploiting known invariances in learning problems. Given a group acting on the input space, one augments the training set with transformed copies of each sample. Because…

37
arXiv — Machine Learning research 6d ago

MotifGen: Spatiotemporal interpolation of misaligned satellite images via multi-source generative modeling, in an application to tropical cyclones

arXiv:2606.24263v1 Announce Type: cross Abstract: Microwave satellite imagery plays a crucial role in monitoring tropical cyclone precipitation and intensity worldwide, but suffers from long revisit times, potentially missing rapid storm evolution phases. While this raises the…

27
arXiv — Machine Learning research 6d ago

PHANTOM: A Large-Scale Dataset of Multimodal Adversarial Attacks for Vision-Language Models

arXiv:2606.24388v1 Announce Type: cross Abstract: We introduce a large-scale, open-source dataset of pre-generated adversarial attacks for vision-language models (VLMs). The dataset is designed to be diverse, representative, and practical, extending existing benchmarks by…

38
arXiv — Machine Learning research 6d ago

ASALT: Adaptive State Alignment for Lateral Transfer in Multi-agent Reinforcement Learning

arXiv:2606.24601v1 Announce Type: cross Abstract: Multi-agent reinforcement learning (MARL) addresses the problem of training multiple agents that pursue collaborative, competitive, or mixed objectives. Prior work has investigated transfer learning between source and target…

29
arXiv — NLP / Computation & Language research 6d ago

QuechuaTok: Morphological Boundary Accuracy as a Necessary Metric for Tokenizer Evaluation in Agglutinative Low-Resource Languages

arXiv:2606.23943v1 Announce Type: new Abstract: Tokenization is a foundational step in NLP pipelines, yet standard evaluation metrics such as fertility rate fail to capture morphological correctness for agglutinative languages. We present QuechuaTok, a systematic benchmark…

32
arXiv — NLP / Computation & Language research 6d ago

Poster: Exploring the Limits of Audio-Based Detection of Turkish Phone Call Scams

arXiv:2606.24523v1 Announce Type: new Abstract: Scam phone calls exploit vulnerable communities worldwide, yet research on detection has focused almost exclusively on English and other high-resource languages. In low-resource settings such as Turkish, detection is especially…

11
arXiv — NLP / Computation & Language research 6d ago

AI-PAVE-Br: Leveraging Large Language Models for Enhanced Product Attribute Value Extraction through a Golden Set Approach

arXiv:2606.24655v1 Announce Type: new Abstract: The explosive growth and complexity of product data within the dynamic Brazilian e-commerce landscape demand robust and specialized methods for structured information extraction. Traditional approaches to Product Attribute Value…

5
arXiv — NLP / Computation & Language research 6d ago

Paying to Know: Micro-Transaction Markets for Verified Product Information in Agentic E-Commerce

arXiv:2606.24783v1 Announce Type: new Abstract: Commercial NLP treats the shopping chatbot as a recommender or a conversion tool: its job is to match a user to a catalogue entry and close a sale. We argue that the arrival of agent-native micro-payment rails (e.g., x402, AP2)…

23
arXiv — NLP / Computation & Language research 6d ago

Less is More: Quality-Aware Training Data Selection for Scientific Summarization

arXiv:2606.24828v1 Announce Type: new Abstract: Scientific long-document summarization datasets commonly treat author-written abstracts as gold reference summaries, although their quality and alignment with the source article vary. At the same time, publicly available scientific…

38
arXiv — NLP / Computation & Language research 6d ago

ESBMC-PLC+: A Unified IEC~61131-3 Formal Verification Framework as a PLCverif Successor

arXiv:2606.23870v1 Announce Type: cross Abstract: PLCverif is the most mature open-source platform for PLC formal verification, developed at CERN and in production use since 2019. Yet it has two fundamental limitations: no support for Ladder Diagram (LD) programs, the dominant…

35
Hacker News — AI on Front Page community 6d ago

Meta Pauses Employee-Tracking Program Following Internal Data Leak

Article URL: https://www.wired.com/story/meta-pauses-employee-tracking-program-following-internal-security-breach/ Comments URL: https://news.ycombinator.com/item?id=48653575 Points: 214 # Comments: 139

38
Hacker News — AI on Front Page community 6d ago

Vulnerability reports are not special anymore

Article URL: https://words.filippo.io/vuln-reports/ Comments URL: https://news.ycombinator.com/item?id=48653216 Points: 208 # Comments: 114

29
Hugging Face Daily Papers research 6d ago

TROPT: An Open Framework for Unifying and Advancing Discrete Text Optimization

Abstract A unified open-source framework for discrete text-trigger optimization that standardizes the development and execution of optimization strategies across various domains and applications. Generated by Qwen/Qwen2.5-Coder-32B-Instruct Discrete text-trigger optimization --…

18
r/MachineLearning community 6d ago

What's your biggest pain point when choosing between cloud GPU providers for LLM inference?[R]

Trying to understand how other people make this decision. Do you compare $/hr, $/token, throughput, reliability? Is there a tool or resource you rely on, or are you just doing the math manually? Asking because I'm an ML engineer who's been doing this in spreadsheets and…

14
r/LocalLLaMA community 6d ago

Human Evaluation of GLM-5.2

I've seen plenty of benchmarks that put GLM-5.2 below many of the closed source alternatives but at their heels. I thought to myself, next version GLM will totally be where the best frontiers are at now. The last few days I've been testing it on a real world project, and it's…

6
Hugging Face Daily Papers research 6d ago

AOHP: An Open-Source OS-Level Agent Harness for Personalized, Efficient and Secure Interaction

Abstract AOHP presents an Android-based operating system framework that treats AI agents as first-class entities, enhancing task completion rates and reducing execution costs through specialized agent-oriented mechanisms. Generated by Qwen/Qwen2.5-Coder-32B-Instruct AI agents…

16
r/LocalLLaMA community 7d ago

Boogu Base, Turbo, Edit - open-source unified image generation and editing model series

Boogu-Image-0.1 is a competitive Apache-2.0 open-source unified image generation and editing model family , including Base , Turbo , Edit , and other variants that provide stable, practical capabilities for high-quality text-to-image generation, fast generation, image editing,…

22
TechCrunch — AI news-outlet 7d ago

OpenAI launches new initiative to help find and patch open-source bugs

OpenAI is attempting to tackle the security issues of the open source software community.

25
r/LocalLLaMA community 7d ago

Why is NO one talking about Microsoft's open source Fast Context!!!

https://huggingface.co/microsoft/FastContext-1.0-4B-SFT https://github.com/microsoft/fastcontext FastContext-1.0 is a lightweight repository-exploration subagent for LLM coding agents. Instead of letting a single model both explore the repository and solve the task, FastContext…

38
Simon Willison community 7d ago

Prompt Injection as Role Confusion

Prompt Injection as Role Confusion First, I absolutely love this: This is a blog-style writeup of the paper. I wish every paper would come with one of these. Academic writing is pretty dry - the impact of a paper can be so much higher if you publish a readable version to…

19

Adaptive Evaluation of Out-of-Band Defenses Against Prompt Injection in LLM Agents

GenRecal: Generation after Recalibration from Large to Small Vision-Language Models

Stop waiting for Qwen3.7 Openweights.

GLM 5.2 total victory: the week open source won and nobody panicked

Built an open source local first Kanban workflow for running AI coding agents without babysitting every step

For ECCV, Springer Metor. How are we supposed to upload the files? [D]

Streamlining Resource Binding with End-to-End Support for Vulkan Descriptor Heaps

ECCV 2026 camera-ready deadline: June 27 or June 30? [D]

Show HN: OpenKnowledge – open source AI-first alternative to Obsidian/Notion

AI SDK 7

LastPass notifies users of yet another data breach

Distill Once, Adapt Life-Long: Exploring Dataset Distillation for Continual Test-Time Adaptation

Could you help me test MTP for GLM-4.7-Flash?

Learning Subset-Shared Invariances for Domain Generalization with Mixture-of-Experts

Error-Aware TF-IDF Retrieval-Augmented Generation for ASR Error Correction

Neural Machine Translation for Low-Resource Tangkhul--English

Optimizing Abstractive Summarization With Fine-Tuned PEGASUS

How Reliable Is Your Jailbreak Judge? Calibration and Adversarial Robustness of Automated ASR Scoring

SARA: Unlocking Multilingual Knowledge in Mixture-of-Experts via Semantically Anchored Routing Alignment

Dziri Voicebot: An End-to-End Low-Resource Speech-to-Speech Conversational System for Algerian Dialect

The Tatoxa System for Text Detoxification in Low-Resource Languages: The Case of Tatar

How Large Language Models Source Brand Reputation Across Languages and Markets

simonw/browser-compat-db

SDXL running locally in the browser on WebGPU, open-source

Sipp - an open-source library for in-browser inference built on llama.cpp

Find the best open-source OCR models in one place at Papers with Code [P]

New EU model (Domyn) will be 400b.

OpenThoughts-Agent: Data Recipes for Agentic Models

Low-power analogue neural networks with trainable nonlinear connections for continuous control

Exploring Dualistic Meta-Learning to Enhance Domain Generalization in Open Set Scenarios

Data Augmentation: A Fourier Analysis Perspective

MotifGen: Spatiotemporal interpolation of misaligned satellite images via multi-source generative modeling, in an application to tropical cyclones

PHANTOM: A Large-Scale Dataset of Multimodal Adversarial Attacks for Vision-Language Models

ASALT: Adaptive State Alignment for Lateral Transfer in Multi-agent Reinforcement Learning

QuechuaTok: Morphological Boundary Accuracy as a Necessary Metric for Tokenizer Evaluation in Agglutinative Low-Resource Languages

Poster: Exploring the Limits of Audio-Based Detection of Turkish Phone Call Scams

AI-PAVE-Br: Leveraging Large Language Models for Enhanced Product Attribute Value Extraction through a Golden Set Approach

Paying to Know: Micro-Transaction Markets for Verified Product Information in Agentic E-Commerce

Less is More: Quality-Aware Training Data Selection for Scientific Summarization

ESBMC-PLC+: A Unified IEC~61131-3 Formal Verification Framework as a PLCverif Successor

Meta Pauses Employee-Tracking Program Following Internal Data Leak

Vulnerability reports are not special anymore

TROPT: An Open Framework for Unifying and Advancing Discrete Text Optimization

What's your biggest pain point when choosing between cloud GPU providers for LLM inference?[R]

Human Evaluation of GLM-5.2

AOHP: An Open-Source OS-Level Agent Harness for Personalized, Efficient and Secure Interaction

Boogu Base, Turbo, Edit - open-source unified image generation and editing model series

OpenAI launches new initiative to help find and patch open-source bugs

Why is NO one talking about Microsoft's open source Fast Context!!!

Prompt Injection as Role Confusion