What are the two new security features Anthropic introduced for Claude?

Anthropic introduced a self-hosted sandbox for tool execution and a security guidance plugin.

Claude Code: Safer Coding Starts Now

Q: How does the self-hosted sandbox maintain security boundaries during code execution?

It utilizes operating-system-level controls to isolate both the filesystem and the network.

Q: When does the security guidance plugin perform vulnerability scans?

Scans occur during file edits, after AI-generated changes, and at commit time.

Claude Code Gets Safer: Self-Hosted Sandboxing and Vulnerability Scanning for AI-Assisted Coding

28 mag 2026 (Aggiornato il 28 mag 2026) - Scritto da Christian Tico

165

Condividi questo articolo:

Artificial Intelligence

Anthropic and Claude are trademarks of Anthropic PBC; this article is an independent editorial piece.

Sponsorizzato

Dimentichi tutto quello che studi? Sblocca il sistema Flashcard

Studiare senza ripassi attivi ti fa dimenticare subito le nozioni. Usa le flashcard integrate nella dashboard o tramite bot per ripassare ovunque.

Usa Flashcard

Anthropic Adds a Self-Hosted Claude Sandbox and Security Plugin for Safer Coding

Anthropic has introduced two new security-focused additions to Claude: a self-hosted sandbox for controlled tool execution and a security guidance plugin that helps developers detect and fix vulnerabilities while coding. Together, these features are designed to make Claude more autonomous for developers while keeping code, files, and network access inside tighter security boundaries.

What Anthropic Announced

Anthropic’s latest update centers on making Claude Code safer to use in real development environments. The self-hosted sandbox lets organizations run tool execution inside infrastructure they control, while Anthropic keeps orchestration on its side. The security guidance plugin, available for Claude Code, scans code for vulnerabilities during edits, after AI-generated changes, and at commit time.

Self-hosted sandbox, tool execution runs in user-controlled infrastructure rather than Anthropic-managed containers.
Security guidance plugin, the plugin helps detect risky code patterns and suggests fixes as developers write.
Public beta and preview rollout, both features are being introduced as early-stage capabilities for developers and teams.

How the Self-Hosted Sandbox Works

Anthropic says the sandbox is built to reduce permission prompts while preserving security boundaries. In Claude Code, sandboxing uses operating-system-level controls to isolate both the filesystem and the network. That means Claude can be restricted to approved directories and approved servers, which helps prevent accidental or prompt-injected access to sensitive files or external destinations.

For self-hosted sandboxes, the execution environment can live on infrastructure controlled by the customer or a managed provider, while Claude’s orchestration remains on Anthropic’s infrastructure. Anthropic’s documentation says this approach is useful when data must stay inside a network boundary, when internal services need to be accessed, or when organizations want their own compliance and audit controls.

Filesystem isolation, Claude can be limited to specific directories.
Network isolation, Claude can be limited to approved network hosts.
Customer control, organizations can define compute size, runtime image, and network policy.
Security benefit, sensitive files and repositories do not leave the user’s perimeter.

What the Security Plugin Does

The new security guidance plugin is aimed at helping developers catch vulnerabilities earlier in the coding process. Anthropic says it scans for vulnerabilities on file edits, after AI-generated changes, and before commits, using the surrounding code context and full diffs to spot dangerous patterns.

According to Anthropic, the capability is designed to help teams find and fix issues that traditional methods can miss. The company says the feature has already been used extensively internally, which suggests it was tested in real development workflows before broader release.

Key tasks the plugin supports

Detecting risky code patterns during edits
Reviewing AI-generated changes for security issues
Scanning diffs before commit time
Suggesting targeted fixes for human review

Why This Matters for Developers

The two releases point to the same goal, giving developers more automation without sacrificing control. For teams using AI coding tools, that balance matters because more autonomy can also mean more risk if tool access, file access, or network access are not constrained.

Anthropic’s sandboxing model is especially relevant for organizations that work with proprietary code or regulated data. By keeping execution inside customer-controlled infrastructure, the company is positioning Claude as a tool that can be used more safely in enterprise workflows.

How It Fits Anthropic’s Broader Claude Strategy

Anthropic has been steadily expanding Claude Code with more developer-focused capabilities. The sandboxing feature and security plugin fit into that broader strategy by making Claude more practical for software engineering tasks, especially in environments where security review is part of the development process.

Anthropic’s documentation also frames self-hosted sandboxes as separate from other access models, meaning organizations can combine execution controls with other connection methods as needed. That gives teams more flexibility when designing secure AI-assisted workflows.

What Developers and Security Teams Should Watch

These releases are promising, but they also raise practical questions that teams will need to evaluate during adoption. Security and platform teams will want to understand how much policy control they have, how logging works, what approval flows remain, and how the sandbox behaves with internal tooling and private repositories.

Access control, how tightly can directories and network hosts be restricted?
Auditability, what logs are available for review and compliance?
Operational fit, does the sandbox integrate cleanly with existing developer tooling?
Security coverage, how well does the plugin perform on real-world codebases?

Conclusion

Anthropic’s self-hosted sandbox and security guidance plugin show a clear push toward safer AI-assisted software development. The sandbox gives organizations more control over where Claude runs, while the plugin adds security checks directly into the coding workflow, making it easier to catch vulnerabilities earlier and keep development moving with fewer risky blind spots.

Pensiero dell'autore

The real shift is not safer coding, but safer delegation: Anthropic is turning Claude from a helpful assistant into a governed operator, where security is enforced by architecture rather than by developer vigilance alone. That matters because the next bottleneck in AI coding is no longer generation quality, but whether organizations can trust the machine to act without silently expanding its own blast radius.

Christian Tico

Metti alla prova le tue conoscenze

When does the security guidance plugin perform vulnerability scans?

Claude Code: Safer Coding Starts Now

Claude Code Gets Safer: Self-Hosted Sandboxing and Vulnerability Scanning for AI-Assisted Coding

Anthropic Adds a Self-Hosted Claude Sandbox and Security Plugin for Safer Coding

What Anthropic Announced

How the Self-Hosted Sandbox Works

What the Security Plugin Does

Key tasks the plugin supports

Why This Matters for Developers

How It Fits Anthropic’s Broader Claude Strategy

What Developers and Security Teams Should Watch

Conclusion

Leggi anche

Anthropic launches Claude AI connectors for Adobe and other

Claude Opus 4.7: Code Like a Pro Now

Claude Code: What Actually Leaked

Alessia Bot AI Mode: Enable Smarter Chat Now

Claude Code Channels: Code From Your Phone