Subagents: Parallel Execution and Context Isolation

Channel Visual Studio Code

Date February 9, 2026

Duration 29:41

TL;DR

Harald Kirschner from the VS Code team explains how subagents work in VS Code's Copilot—isolated agent loops with their own context windows that can run tasks in parallel without bloating the main agent's context. Subagents enable efficient delegation of specialized tasks like code research, security reviews, and parallel implementation, returning only summarized results to the orchestrating agent.

Key Takeaways

Subagents are isolated agent loops: Each subagent has its own context window, runs independently, and returns only a summary to the parent agent—protecting the main context from information overload
Parallel execution is the killer feature: Multiple subagents can run simultaneously (e.g., one for security review, one for architecture, one for code quality) because they operate in isolated contexts
Context engineering happens automatically: Using plan mode already triggers subagent usage for exploration tasks, so developers get context isolation benefits without explicit configuration
Custom agents can be subagents: Create specialized agents (e.g., "DeepCodeResearch") with specific workflows and the main agent will automatically invoke them based on their description
Model selection per subagent: Different subagents can use different models—fast mini models for simple automation, larger reasoning models for planning, code-optimized models for implementation
Ephemeral memory by design: Subagents start with zero context (no memory from previous conversations), which keeps them focused but requires good initial instructions

Summary

The Agent Loop Fundamentals

The agentic loop in VS Code starts with a user question, a system prompt defining behavior, and access to tools. The agent reasons about the task, calls tools (like search or file reads), and accumulates results in a conversation—building up the context window. VS Code exposes powerful search tools and allows parallel tool calls, so the agent can efficiently gather information before synthesizing an answer.

How Subagents Differ

A subagent is essentially a "delegate"—like asking a colleague to research a topic and report back. It runs its own agent loop with isolated context, performs the assigned task (file searches, code exploration, hypothesis testing), and returns only a condensed summary. The parent agent sees just the task request and the result, not all the intermediate exploration.

Practical Subagent Use Cases

Code Review Parallelization: Instead of one agent reading all files and running out of context, spawn multiple subagents:

Security-focused subagent
Architecture review subagent
"Slop detector" subagent (checks if AI generated unnecessary utilities)

Each returns focused findings without polluting the main context.

Plan Mode Benefits: When using plan mode, exploration already runs in subagents automatically. The planning agent gives research tasks to subagents, which return findings while the main planning context stays clean.

Day-to-Day Development Tips

Use plan mode: You automatically get subagent benefits for context isolation
Annotate parallel work: When writing specs or plans, note what can run in parallel—the agent will spawn parallel subagents
Let it happen naturally: The team is working toward automatic detection of parallelizable work (e.g., independent frontend and backend tasks)

Custom Agents as Subagents

Create custom agents with specific descriptions like "use when understanding cross-repo dependencies." The main agent will automatically invoke them when the task matches. Key differences from skills:

Skills: Inject into main context, subject to other context noise
Custom agents: Run in isolated context with complete focus on their defined workflow

Model Optimization Strategy

Harald's "Loop" orchestrator pattern demonstrates strategic model selection:

Fast context gathering: Mini model quickly scans files, writes findings to a scratch file
Deep planning: Larger reasoning model (Opus, GPT-5.2 Codex) analyzes the scratch pad
Parallel implementation: Fast code-writing models churn through the detailed plan
Quality review: Larger model reviews all changes, catches divergence from plan

This approach optimizes for speed, cost, and quality—avoiding the "everything in Opus" anti-pattern.

Notable Quotes

"The subagent basically is its own agent loop with its own context. And most often, the best way to describe it is you want to delegate something."

"All you get back is like, this is what I found. And then maybe some confidence with it as well. So that's the sub-agent solution, that context isolation to do a specific task."

"We have this context indicator now. So you see in a conversation, how much context the agent has built up. So it's much easier to understand how that agent loop works."

"If you have something that has to be rock solid and really deterministic and a workflow you really want to get down to the right steps, then that would be a custom agent."

"I just want to see it happening. I don't want Opus building a beautiful coded UI. I just want to figure out what is that critical thing I'm missing and iterate fast."

Chapters

Time	Topic
00:00	Introduction and Overview
03:01	Understanding Agents and Context
08:04	Sub-Agents and Context Isolation
13:51	Daily Development Practices
17:52	Custom Agents and Orchestration
24:22	Model Selection and Performance

References & Resources

From Description

Mentioned in Video

Tools & Products: VS Code, Copilot CLI, GitHub Copilot, Plan Mode, MCP servers, Claude, GPT-5.2 Codex, Opus, Gemini
Concepts: Agentic loop, context window, tool calls, parallel execution, context isolation, custom agents, skills, agents.md
People: James (host), Harald Kirschner (VS Code team)