YouTube summarize plugin

I've found this handy in consuming and curating what to watch when trying to keep up with all the things… spec-kit, gas town, beads, ralph wiggum, etc.

source code https://github.com/jpoley/yt-summary

in claude-code:

/plugin marketplace add jpoley/yt-summary
/plugin install yt-summary

/yt-summary:yt-summarize https://www.youtube.com/watch?v=s96O9oWI_tI

here is an example of the output, and an excellent Video as well.

Beyond Instructions: How Beads Lets AI Agents Build Like Engineers

Watch on YouTube

Channel AI Tinkerers Date December 12, 2025 Duration 47:49

TL;DR

Steve Yegge, head of engineering at Sourcegraph (makers of AMP), discusses Beads, his open-source framework that gives coding agents persistent memory and issue-based task management. Now 4 weeks old with ~29 contributors, Beads solves the "50 First Dates" problem where agents lose all context between sessions. Steve demonstrates his workflow building a React game client using multimodal prompting with Playwright screenshots, and shares techniques like "landing the plane" (session cleanup) and "tracer bullets" (finding thin end-to-end paths when stuck).

Key Takeaways

Beads provides session-to-session memory for coding agents, solving the context loss problem between sessions
Issue-based workflow replaces chaotic markdown files with prioritized, dependency-tracked issues stored in Git
"Vibe coding" is just as hard as regular coding — you're "wrestling with a bear all day"
Multimodal prompting using Playwright screenshots enables visual validation of UI work
"Land the plane" — a scripted cleanup command for session endings (commit, sync, cleanup branches, identify next work)
Tracer bullets — find the thinnest end-to-end path to get unstuck when agents make no forward progress
Assume 85% correctness — always have agents code review their own work
Be nice to AIs — Steve seriously suggests they'll read logs when superintelligent
Multi-model workflows are essential — use different models for different tasks (building vs. reviewing)
Code review is the new bottleneck when coding is automated — tools like Graphite help

Summary

Steve's Background and Current Work

Steve Yegge is head of engineering at Sourcegraph, working on AMP (their coding agent). He has decades of experience at Amazon, Google, and Grab. He's also maintaining a 30-year-old game called Wyvern with 500K lines of code and 250K+ players.

Building a React Game Client with AI Agents

Steve demonstrates his workflow: an AI agent is actively building a React client to replace his game's legacy clients (Java, iOS, Android, Steam). The agent uses:

Sourcegraph AMP as the interface
Playwright MCP server for browser automation and screenshots
A reference screenshot of the existing Steam client as the "source of truth"

"The game is 30 years old… I want to replace all my old clients with one to rule them all."

The React client is about a week old and already rendering most UI elements — something that would have taken Steve 6 months to learn a new platform and implement himself.

Multimodal Prompting and Screenshot Validation

Steve's key workflow innovation: using screenshots for validation:

Keep a reference screenshot of the target UI
Have the agent take screenshots after changes via Playwright
Compare against reference to catch when agents claim "done" but aren't

"I caught you lying to me. That's what they do. They're like, 'Okay, we're all done now. It's all good.' No, no, no. I made it catch itself by introducing a workflow."

The Vibe Coding Reality Check

Despite the "vibe coding" term, Steve emphasizes it's not easy:

"Man, it's just as hard as regular coding. You have to… you're wrestling with a bear all day long."

Key principles from his book with Gene Kim:

Tracer bullets — Find the thinnest end-to-end path to get something working
Successive refinement — Assume everything is 85% correct, iterate
Code review everything — Models are better at evaluating than generating

Landing the Plane: Session Cleanup Protocol

Steve got tired of telling agents to clean up, so he created "Land the Plane" — a scripted protocol in agents.md:

Update Beads issues
Sync issue tracker
Clean git state (stashes, old branches)
Remove debugging artifacts
Update documentation
Recommend next session prompt — the agent identifies the next highest-priority work

"When I say let's land the plane, I want the following 10 things to happen."

The Evolution of Beads

The Problem: Steve was "drowning in markdown files" — hundreds accumulating, causing agents to get "dementia" from conflicting/obsolete documents.

The Solution: Put all work in issues with:

Issue IDs (handles for everything)
Priority levels
Dependencies and parent/child relationships
Audit trails
All stored in Git (deletions preserved in history)

"It's Google Maps for your plan. Every piece of work you ever want to have done is addressable."

Notable Quotes

"It's kind of like 50 First Dates. You're playing 50 First Dates with the agent because it wakes up and it's like, 'Hi, who are you?'"

"Using these code agent coding agents is like running with socks… Beads is like having shoes."

"Beads is your session to session memory."

References & Resources

From Description

Beads: https://github.com/steveyegge/beads
Steve Yegge LinkedIn: https://www.linkedin.com/in/steveyegge/
Steve Yegge Twitter/X: https://x.com/Steve_Yegge

Mentioned in Video

Tools & Products:

Sourcegraph AMP — Coding agent (like Claude Code but with different ergonomics)
Beads — Open source issue tracker/memory system for agents
Claude Code — Anthropic's CLI coding agent
Playwright MCP Server — Browser automation for screenshots
MCP Mail — Agent-to-agent messaging (by Jeffrey Emanuel)
Graphite — AI code review tool

Concepts & Techniques:

Tracer bullets — Finding thin end-to-end paths when stuck
Landing the plane — Session cleanup protocol
50 First Dates workflow — The session memory problem

Originally published on Medium