Research Library

Reading notes and references

Each item includes a short note so readers can understand why it matters before opening the original source material.

Attacking AI - Jason Haddix - NDC Security 2026 video thumbnail

YouTube April 16, 2026 video

Attacking AI - Jason Haddix - NDC Security 2026

Attacking AI is a one of a kind session releasing case studies, tactics, and methodology from Arcanum’s AI assessments in 2024 and 2025. While most AI assessment material focuses on academic AI red team content, “Attacking AI” is focused on the task of assessing AI enabled systems.

Prompt Injection

Open notes Watch on YouTube

Krebs on Security April 14, 2026 news

Patch Tuesday, April 2026 Edition

Krebs on Security covers April 2026 patching activity, including a record-sized Microsoft release and active exploitation notes.

Read summary Source link

NIST April 7, 2026 framework

AI Risk Management Framework

NIST’s AI RMF hub now highlights the upcoming Trustworthy AI in Critical Infrastructure profile alongside the playbook and related implementation resources.

AI Compliance Model Evaluation

Read summary Source link

Microsoft Security Blog March 12, 2026 guide

Detecting and analyzing prompt abuse in AI tools

Microsoft Incident Response walks through how to detect prompt abuse operationally, tying prompt injection risk back to logging, telemetry, and incident response workflows.

Prompt Injection Agent Security

Read summary Source link

OpenAI March 11, 2026 analysis

Designing AI agents to resist prompt injection

OpenAI frames prompt injection as an evolving agent-security problem that increasingly resembles social engineering rather than a simple string-matching issue.

Prompt Injection Agent Security

Read summary Source link

OpenAI March 9, 2026 news

OpenAI to acquire Promptfoo

OpenAI announced plans to acquire Promptfoo, highlighting automated AI security testing, red teaming, and evaluation as core enterprise requirements.

AI Red Teaming Prompt Engineering Prompt Injection

Read summary Source link

MITRE Center for Threat-Informed Defense February 9, 2026 framework

MITRE ATLAS OpenClaw Investigation Discovers New and Likeliest Techniques

MITRE maps incident patterns in an open-source agentic ecosystem to ATLAS techniques, showing how AI-first systems create distinct execution paths for attackers.

Agent Security AI Red Teaming

Read summary Source link

European Commission January 27, 2026 framework

AI Act

The European Commission’s AI Act hub centralizes the EU’s risk-based AI compliance framework, implementation material, and links to governance, enforcement, and standardisation resources.

Read summary Source link

OpenAI December 22, 2025 analysis

Continuously hardening ChatGPT Atlas against prompt injection attacks

OpenAI describes using automated red teaming and reinforcement learning to discover agent prompt injection attacks before they appear in the wild.

Prompt Injection Agent Security AI Red Teaming

Read summary Source link

Google Cloud Blog December 4, 2025 guide

Building a Production-Ready AI Security Foundation

Google Cloud outlines a defense-in-depth view of AI security spanning application controls, data protections, and infrastructure isolation.

Agent Security Prompt Injection Adversarial ML

Read summary Source link

OpenAI November 7, 2025 guide

Understanding prompt injections: a frontier security challenge

An accessible explanation of prompt injection risk in real AI products, including how third-party content can redirect or manipulate agent behavior.

Prompt Injection Prompt Engineering

Read summary Source link

Google Cloud Blog June 12, 2025 analysis

Cloud CISO Perspectives: How Google secures AI Agents

Google’s CISO perspective on why agents need a new security paradigm and what changes when models can observe, plan, and act.

Read summary Source link

NIST March 24, 2025 framework

Adversarial Machine Learning: A Taxonomy and Terminology of Attacks and Mitigations

NIST finalizes AI 100-2e2025, providing a terminology and taxonomy for adversarial machine learning across predictive and generative AI systems.

Adversarial ML Model Evaluation AI Compliance

Read summary Source link

Anthropic March 19, 2025 analysis

Progress from our Frontier Red Team

Anthropic shares lessons from frontier red teaming and discusses where models are showing early-warning signs of higher-risk cyber and biology capabilities.

AI Red Teaming Model Evaluation

Read summary Source link

Google Cloud Blog March 5, 2025 news

Announcing AI Protection: Security for the AI era

Google introduced AI Protection and Model Armor to address prompt injection, jailbreaks, data loss, and multicloud AI workload security.

Prompt Injection Agent Security

Read summary Source link

OpenAI February 25, 2025 framework

Deep research System Card

OpenAI’s system card for deep research covers prompt injection, privacy, code execution, and external red teaming prior to release.

Model Evaluation Prompt Injection AI Compliance

Read summary Source link

OpenAI January 23, 2025 framework

Operator System Card

The Operator system card documents red teaming and mitigation choices for a computer-using agent, with prompt injections listed as a central risk area.

Agent Security Model Evaluation Prompt Injection AI Compliance

Read summary Source link

Microsoft Cloud Blog January 14, 2025 analysis

Enhancing AI safety: Insights and lessons from red teaming

Microsoft summarizes lessons from red teaming more than one hundred generative AI products, emphasizing system-level testing, human expertise, and automation.

AI Red Teaming Prompt Injection Agent Security

Read summary Source link

Microsoft Security Blog January 13, 2025 guide

3 takeaways from red teaming 100 generative AI products

Microsoft Security highlights practical red-team lessons, including prompt injections against multimodal systems and the need to stay grounded in basic cyber hygiene.

AI Red Teaming Prompt Injection

Read summary Source link

OWASP January 1, 2025 framework

OWASP Top 10 for Large Language Model Applications

OWASP’s GenAI security project remains a practical baseline for teams building or assessing LLM applications and agentic systems.

Prompt Injection Agent Security Adversarial ML

Read summary Source link