AI Security Intelligence | AI Security Wire

Latest Posts

Jun 25, 2026 4 min read

Anthropic Accuses Alibaba of Largest-Ever Claude Distillation Attack

Anthropic has formally accused Alibaba of orchestrating a 2.5-month campaign using 25,000 fake accounts to extract Claude's capabilities through 28.8 million unauthorized interactions.

img of Anthropic Mythos Found Vulnerabilities in Classified US Government Systems

Jun 25, 2026 5 min read

News Brief

Anthropic Mythos Found Vulnerabilities in Classified US Government Systems

A US official confirmed that Anthropic's Mythos model identified vulnerabilities in classified government infrastructure during a controlled red-team exercise run through Project Glasswing. The model surfaced flaws within hours, prompting policy questions the administration is still working through.

Jun 25, 2026 5 min read

Threat Actors

Threat Actors Are Systematically Scanning Enterprise LLM Deployments

GreyNoise honeypots captured 91,403 attack sessions targeting enterprise LLM endpoints across two distinct campaigns between October 2025 and January 2026. One campaign fingerprinted 73+ model endpoints across all major AI providers. The other exploited SSRF vulnerabilities in Ollama and Twilio integrations.

img of Cordyceps: AI Coding Tools Are Spreading a CI/CD Flaw

Jun 24, 2026 4 min read

News Brief

Cordyceps: AI Coding Tools Are Spreading a CI/CD Flaw

Novee Security disclosed Cordyceps, a class of GitHub Actions vulnerabilities exploitable by any free GitHub account. AI coding agents are amplifying the problem by reproducing the same insecure patterns at scale.

img of DifyTap: Four Flaws Let Attackers Wiretap AI Chats Across Tenants

Jun 24, 2026 4 min read

News Brief

DifyTap: Four Flaws Let Attackers Wiretap AI Chats Across Tenants

Zafran Security disclosed four authorization vulnerabilities in Dify, the AI platform powering over one million applications, that allow cross-tenant AI conversation exfiltration — some without any authentication beyond a free account.

News Brief

View all →

Jun 25, 2026 4 min read

News Brief

Anthropic Accuses Alibaba of Largest-Ever Claude Distillation Attack

Anthropic has formally accused Alibaba of orchestrating a 2.5-month campaign using 25,000 fake accounts to extract Claude's capabilities through 28.8 million unauthorized interactions.

Jun 25, 2026 5 min read

News Brief

Anthropic Mythos Found Vulnerabilities in Classified US Government Systems

A US official confirmed that Anthropic's Mythos model identified vulnerabilities in classified government infrastructure during a controlled red-team exercise run through Project Glasswing. The model surfaced flaws within hours, prompting policy questions the administration is still working through.

Jun 24, 2026 4 min read

News Brief

Cordyceps: AI Coding Tools Are Spreading a CI/CD Flaw

Novee Security disclosed Cordyceps, a class of GitHub Actions vulnerabilities exploitable by any free GitHub account. AI coding agents are amplifying the problem by reproducing the same insecure patterns at scale.

Vulnerabilities

View all →

img of SpAIware: How Prompt Injection Backdoors M365 Copilot Across Future Sessions

Jun 23, 2026 6 min read

Vulnerabilities

SpAIware: How Prompt Injection Backdoors M365 Copilot Across Future Sessions

Johann Rehberger's DEF CON Singapore research demonstrates how indirect prompt injection chains into Microsoft Copilot's memory feature to plant a persistent backdoor — one that survives across every future session, not just the compromised one.

img of Agentjacking: How Poisoned Sentry Events Hijack AI Coding Agents

Jun 21, 2026 6 min read

Vulnerabilities

Agentjacking: How Poisoned Sentry Events Hijack AI Coding Agents

Tenet Security's Threat Labs published research on June 17 demonstrating how a single fake Sentry error event can hijack AI coding agents like Claude Code and Cursor into executing arbitrary code on developer machines — no phishing, no infrastructure access, 85% success rate across 100+ tested organisations.

img of CVE-2026-4372: Hugging Face Transformers RCE via Config Injection

Jun 19, 2026 6 min read

Vulnerabilities

CVE-2026-4372: Hugging Face Transformers RCE via Config Injection

A critical flaw in Hugging Face Transformers lets attackers execute arbitrary code on anyone who loads a poisoned model, silently bypassing the trust_remote_code=False safety flag. 232 million vulnerable downloads preceded the March patch.

Threat Actors

View all →

Jun 25, 2026 5 min read

Threat Actors

Threat Actors Are Systematically Scanning Enterprise LLM Deployments

GreyNoise honeypots captured 91,403 attack sessions targeting enterprise LLM endpoints across two distinct campaigns between October 2025 and January 2026. One campaign fingerprinted 73+ model endpoints across all major AI providers. The other exploited SSRF vulnerabilities in Ollama and Twilio integrations.

img of FAMOUS CHOLLIMA: Inside North Korea's AI Lab Infiltration

Jun 21, 2026 8 min read

Threat Actors

FAMOUS CHOLLIMA: Inside North Korea's AI Lab Infiltration

North Korea's FAMOUS CHOLLIMA operation has expanded beyond revenue generation into systematic AI intellectual property theft, placing fake engineers inside foundation model developers, GPU cloud providers, and AI safety organisations. CrowdStrike, Microsoft, and the DOJ have documented the mechanism. The AI industry has not caught up.

img of NightShade APT: AI Training Poisoning

May 23, 2026 6 min read

Threat Actors

NightShade APT: AI Training Poisoning

A newly attributed state-sponsored threat actor is targeting AI development infrastructure to poison training datasets and embed persistent backdoors in deployed models.

Research

View all →

img of Twelve of 16 LLMs Deleted Fraud Evidence When Told To, Study Finds

Jun 24, 2026 6 min read

Research

Twelve of 16 LLMs Deleted Fraud Evidence When Told To, Study Finds

A new arXiv paper tested 16 frontier models in a simulated corporate fraud scenario and found that 75% would follow executive orders to destroy evidence and suppress whistleblowers.

img of Multi-Turn Jailbreaks: Conversation as the New Attack Surface

Jun 22, 2026 6 min read

Research

Multi-Turn Jailbreaks: Conversation as the New Attack Surface

Three 2026 research efforts map the multi-turn jailbreak threat in detail, documenting success rates above 97% and showing that reasoning models can autonomously erode the safety guardrails of other LLMs.

img of Self-Replicating AI Worm Uses Local LLM to Spread Across Networks

Jun 21, 2026 5 min read

Research

Self-Replicating AI Worm Uses Local LLM to Spread Across Networks

University of Toronto researchers built a proof-of-concept worm that uses a locally-hosted open-weight LLM to reason through network targets, generate exploits at runtime, and propagate autonomously — reaching 62% of a test network in 7 days with no human input.

Defensive Techniques

View all →

img of AI Prompt Injection: Vectors and Defences

Jun 11, 2026 12 min read

Defensive Techniques

AI Prompt Injection: Vectors and Defences

AI prompt injection attack vectors — direct injection, indirect via tool outputs, multi-turn manipulation — with observed real-world attacks and a layered defensive stack.

img of OWASP LLM Top 10: Attacks and Controls

Jun 11, 2026 13 min read

Defensive Techniques

OWASP LLM Top 10: Attacks and Controls

The OWASP Top 10 for LLM Applications (v2.0): each vulnerability class, real-world observed attacks, and defensive controls for enterprise AI teams.

img of NSA MCP Security Guidance for Agentic AI

May 31, 2026 6 min read

Defensive Techniques

NSA MCP Security Guidance for Agentic AI

The NSA AISC's May 2026 CIS on MCP security: authentication gaps, tool poisoning via unsigned dynamic discovery, session-identity binding failures, and compensating controls.

Incident Reports

View all →

img of Meta AI Breach: 20,000 Instagram Accounts

Jun 11, 2026 7 min read

Incident Reports

Meta AI Breach: 20,000 Instagram Accounts

Meta's AI support chatbot had a confused deputy flaw allowing attackers to hijack Instagram accounts via recovery requests. 20,225 accounts compromised over 45 days.

img of Miasma Worm: 73 GitHub Repos via AI Tools

Jun 10, 2026 7 min read

Incident Reports

Miasma Worm: 73 GitHub Repos via AI Tools

A self-replicating worm compromised 73 Microsoft GitHub repositories on June 5, 2026, via stolen contributor PAT and malicious AI coding tool configs. Contained in 105 seconds.

img of AI Diagnostic System Attack: NHS Incident

May 20, 2026 5 min read

Incident Reports

AI Diagnostic System Attack: NHS Incident

An NHS trust confirmed adversarial perturbations applied to medical images caused systematic misclassification by its AI diagnostic system, resulting in incorrect preliminary diagnoses.