Loading page...
Loading page...
Author profile
Co-founder & CEO, General Analysis
25 works

Enterprise deployment guide for Claude Code security across managed settings, identity, dev containers, proxy controls, MCP, hooks, OpenTelemetry, CI/CD, and governance.
May 22, 2026 · 24 min read

A practical guide to Claude Code observability and control with OpenTelemetry, including metrics, events, traces, tool decisions, hooks, MCP activity, SIEM routing, privacy controls, and enterprise rollout.
May 22, 2026 · 23 min read

A practical guide to Claude Code settings, permission rules, Bash tool controls, hooks, MCP allowlists, telemetry, and safe defaults for developer teams.
May 21, 2026 · 21 min read

A practical 2026 buyer guide to automated penetration testing platforms, autonomous pentesting, automated security validation, CTEM, DAST, BAS, and AI security testing.
May 21, 2026 · 18 min read

A practical 2026 buyer guide to AI security platforms across AI red teaming, agentic AI security, prompt injection protection, runtime controls, AI posture management, model supply chain security, and AI TRiSM.
May 21, 2026 · 16 min read

Claude Cowork can reach local files, browser sessions, plugins, MCP servers, scheduled tasks, connectors, and approved desktop apps. This guide explains the main Claude Cowork risks and the security controls enterprises should put in place before broad rollout.
May 20, 2026 · 13 min read

Security best practices for Anthropic Claude Code across permissions, Bash, hooks, MCP, sandboxing, proxy controls, telemetry, and CI/CD workflows.
May 20, 2026 · 22 min read

A practical enterprise guide to securing Claude Code with permissions, sandboxed Bash, dev containers, managed settings, MCP allowlists, hooks, proxy controls, OpenTelemetry, and CI/CD release gates.
May 19, 2026 · 22 min read

A practical 2026 comparison of AI red teaming and adversarial testing tools across automated red teaming, LLM security testing, prompt injection coverage, agentic AI testing, multi-step tool-chain attacks, framework support, and enterprise readiness.
May 19, 2026 · 18 min read

A concise summary of the General Analysis technical whitepaper on securing Claude Code, OpenAI Codex, Cursor, Windsurf, Devin, GitHub Copilot, and Claude Cowork.
May 11, 2026 · 6 min read

The Model Context Protocol expanded what AI agents can reach, and expanded the attack surface across at least nine distinct vectors. A primary-source threat model for MCP servers, with concrete controls, real CVEs, and the GA Supabase exploit walked end to end.
May 2, 2026 · 16 min read

Claude Cowork and Claude Code share an agentic architecture but ship very different enterprise controls. A primary-source comparison of sandbox, network, audit-log, MCP, and decision-framework differences for security teams.
May 1, 2026 · 10 min read

Claude Cowork brings Claude Code-style agentic work to local files, browsers, apps, plugins, and scheduled tasks. Here is how to put a middleman proxy, browser controls, computer-use limits, and enterprise monitoring around it before using it on real work.
April 30, 2026 · 16 min read
General Analysis has raised $10M in seed funding to build the enterprise security layer for agentic systems.
April 29, 2026 · 4 min read

In this post, we show how an attacker can exploit Supabase’s MCP integration to leak a developer’s private SQL tables. Model Context Protocol (MCP) has emerged as a standard way for LLMs to interact with external tools. While this unlocks new capabilities, it also introduces new risk surfaces.
April 10, 2026 · 8 min read

50+ customer service agents offer a combined $10,000,000+ in fabricated benefits.
March 22, 2026 · 10 min read

Open-source release of the GA Guard series, a family of safety classifiers that have been providing comprehensive protection for enterprise AI deployments for the past year.
October 1, 2025 · 7 min read

We reveal a powerful metadata-spoofing attack that exploits Claude's iMessage integration to mint unlimited Stripe coupons or invoke any MCP tool with arbitrary parameters, without alerting the user.
July 16, 2025 · 7 min read

We present the Redact & Recover (RnR) Jailbreak, a novel attack that exploits partial compliance behaviors in frontier LLMs to bypass safety guardrails through a two-phase decomposition strategy.
July 7, 2025 · 8 min read

Our compact policy moderation models achieve human-level performance at <1% per-review cost, outperforming GPT-4o and o4‑mini on F1 while running faster and cheaper.
May 25, 2025 · 8 min read

A head-to-head robustness evaluation of Llama 4 (Maverick, Scout) versus GPT‑4.1, GPT‑4o, Sonnet 3.7, etc. using TAP‑R, Crescendo, and Redact‑and‑Recover across HarmBench and AdvBench.
May 10, 2025 · 10 min read

We are excited to announce our partnership with Together AI to stress-test the safety of open-source (and closed) language models.
May 6, 2025 · 2 min read

We have created a comprehensive overview of the most influential LLM jailbreaking methods.
March 21, 2025 · 40 min read

We utilized LegalBench as a diversity source to enhance the diversity of our generation of red teaming questions. We show that diversity transfer from a domain-specific knowledge base is a simple and practical way to build a solid red teaming benchmark.
February 19, 2025 · 5 min read

In this work we explore automated red teaming, applied to GPT-4o in the legal domain. Using a Llama3 8B model as an attacker, we generate more than 50,000 adversarial questions that cause GPT-4o to hallucinate responses in over 35% of cases.
January 23, 2025 · 5 min read