Knowing which generation your agent is in, which failure modes your current evals are blind to, and what to build next is the difference between shipping with confidence and flying blind. Agent architectures have evolved through six generations; prompt, chain, ReAct loop, workflow graph, modern agent loop, AI harness. And each one quietly breaks the eval strategy of the generation before it. A prompt-quality rubric won't catch a bad tool call; a trace scorer won't catch memory poisoning. Using a single SRE incident response agent threaded through every generation, this talk shows exactly where each architecture outgrows its evals and what you need to close the gap.
AI Architects: Show my Workflow sessions at AI Engineer World's Fair 2026 in San Francisco.
Tuesday, June 30, 2026
11:10 AM - 11:30 AM·20m
Leadership 2 · Room 3020
Capacity: 550 attendees
Sign in to add this talk to your schedule.

Ameya Bhatawdekar
Braintrust
Ameya Bhatawdekar is speaking at AI Engineer World's Fair 2026.