Claude Agent SDK Full Workshop

Channel Anthropic

Published January 5, 2026

Duration 1:52:25

Anthropic Claude Agent SDK Workshop Tutorial AI Agents Claude Code

TL;DR

Thariq Shihipar from Anthropic delivers a comprehensive workshop on building AI agents using the Claude Agent SDK. The core philosophy centers on "Bash is all you need" - leveraging Unix primitives (bash, file system) as the foundation for powerful agents. The workshop covers the agent loop (gather context, take action, verify work), security through "Swiss cheese defense" layers, and demonstrates how to build production agents using Claude Code as a prototyping platform before deploying with the SDK.

Key Takeaways

Agents vs Workflows: An agent is defined by having a model in the driver's seat making decisions, unlike workflows where humans orchestrate the steps
Bash is the most powerful tool: The bash tool serves as "first code mode" - composable, familiar, and incredibly versatile for agent actions
The Agent Loop: Gather context, take action, verify work - this cycle repeats until the task is complete
Swiss Cheese Defense: Security comes from multiple layers - model alignment, harness permissioning, and sandboxing
Code Generation for Non-Coding Tasks: Use code generation even for non-technical problems - it provides structure, determinism, and verifiability
Skills for Progressive Context: Load context progressively through skills rather than dumping everything into the system prompt
File System as Memory: Use the file system to persist state, notes, and context between agent sessions
Sub-agents for Context Management: Spawn sub-agents to handle complex subtasks without polluting the main context window
Prototype with Claude Code: Build and test your agent using Claude Code before productionizing with the SDK
Simple but Not Easy: Your agent code should be minimal and elegant, but achieving that requires deep understanding of the domain

Summary

What is the Claude Agent SDK?

The Claude Agent SDK is built on top of Claude Code, providing a framework for building production-ready AI agents. The key insight is that models are "grown, not designed" - meaning developers need to understand and work with the model's natural capabilities rather than fighting against them. The SDK handles the infrastructure complexity (sub-agents, bash execution, context management) so developers can focus on domain-specific problems.

The Anthropic Way to Build Agents

The workshop introduces the "Anthropic way" of building agents, which emphasizes simplicity and leveraging what models already know. Rather than creating complex tool ecosystems, the philosophy is to use Unix primitives - bash and the file system - as the foundation. Bash provides a composable, well-understood interface that models are deeply familiar with from training data.

Tools vs Bash vs Code Generation

Three main approaches for agent actions:

Structured Tools: Most reliable, best for static operations with clear input/output schemas
Bash: Highly composable, good for combining operations, familiar to models
Code Generation: Most flexible, best for highly dynamic operations requiring custom logic

The choice depends on how dynamic the operation is. Static operations favor tools; highly dynamic operations favor code generation. Bash sits in the middle, offering composability for relatively static scripts.

The Agent Loop

Every agent follows a core loop:

Gather Context: Search, read files, query APIs - get the information needed
Take Action: Write files, execute commands, make changes
Verify Work: Run tests, check outputs, validate results

This loop continues until the task is complete. Verification is crucial - use deterministic checks (linting, tests) wherever possible, and save model-based verification for cases where rules cannot be written.

Security: Swiss Cheese Defense

Security cannot rely on a single layer. The "Swiss cheese defense" model uses multiple overlapping protections:

Model Alignment: The model's training to avoid harmful actions
Harness Permissioning: Explicit allow-lists and permission systems in the agent code
Sandboxing: Container isolation to limit potential damage

Each layer has holes, but overlapping them provides robust protection.

Context Engineering and Skills

Rather than front-loading all context into the system prompt, use "progressive context disclosure" through skills. Skills are markdown files that get loaded on-demand, providing relevant context exactly when needed. This keeps the initial context clean while making rich information available.

File System as Memory

The file system serves as persistent memory for agents. Write notes, state, and context to files so agents can resume work across sessions. This is analogous to how Claude Code uses CLAUDE.md files - they provide persistent context that survives context window resets.

Sub-agents for Complex Tasks

Sub-agents are crucial for managing context and parallelizing work. Use them when:

A task requires significant work but the main agent only needs the result
You want to avoid polluting the main context with intermediate steps
Tasks can be parallelized (e.g., reading multiple spreadsheet sheets simultaneously)

Sub-agents can also serve as adversarial verifiers, checking the main agent's work without "sympathetic" context pollution.

Prototyping with Claude Code

The recommended workflow is to prototype with Claude Code first. Give Claude Code access to your APIs and data, iterate on the approach, and once it works well, translate to the Agent SDK for production. Claude Code serves as the perfect prototyping environment because it uses the same underlying primitives.

State and Reversibility

Consider how reversible your agent's actions are when choosing use cases. Code is highly reversible (git history), while computer use is not (ordering the wrong item cannot be undone). Agents work best in domains with reversible state or the ability to create checkpoints.

Live Demonstration: Pokemon Agent

The workshop includes a live demonstration building a Pokemon agent that can query the PokeAPI, search competitive data from Smogon, and help build Pokemon teams. The example showcases:

Generating TypeScript libraries from API documentation
Using code generation to query and filter data
Progressive context loading for domain-specific knowledge
The agent SDK's minimal boilerplate approach

Notable Quotes

"Bash is all you need. The bash tool is just like one of the most powerful tools because it is so composable."

"Models are grown and not designed. We're sort of understanding their capabilities like riding a horse - giving signals, calming it down, figuring out how to push it faster."

"Simple is not the same as easy. The amount of code in your agent should not be huge, but it does need to be elegant. It needs to be what the model wants."

"If you're using other SDKs and you think about tools first, I would challenge you to try bash first and see if that's all you need."

"The agent should gather context as much as possible. Give it the tools to find its own work. Think about it like someone locked in a room - would you want a stack of papers or a computer?"

References

Anthropic Cookbook - Official examples and patterns
PokeAPI - RESTful Pokemon API used in the demonstration
Smogon - Competitive Pokemon data source
Cloudflare Workers - Mentioned for sandbox deployment of agents
Adam Wolf's QCON talk on the Claude Code bash tool implementation

Claude Agent SDK [Full Workshop] - Thariq Shihipar, Anthropic