AI Architects: Show my Workflow

Your Agent Evolved. Your Evals Didn't.

TalkIntermediate

Knowing which generation your agent is in, which failure modes your current evals are blind to, and what to build next is the difference between shipping with confidence and flying blind. Agent architectures have evolved through six generations; prompt, chain, ReAct loop, workflow graph, modern agent loop, AI harness. And each one quietly breaks the eval strategy of the generation before it. A prompt-quality rubric won't catch a bad tool call; a trace scorer won't catch memory poisoning. Using a single SRE incident response agent threaded through every generation, this talk shows exactly where each architecture outgrows its evals and what you need to close the gap.

About the AI Architects: Show my Workflow Track

AI Architects: Show my Workflow sessions at AI Engineer World's Fair 2026 in San Francisco.

When

Tuesday, June 30, 2026

11:10 AM - 11:30 AM·20m

Where

Leadership 2 · Room 3020

Capacity: 550 attendees

Speaker

Ameya Bhatawdekar

Braintrust

Ameya Bhatawdekar is speaking at AI Engineer World's Fair 2026.

AI Engineer World's Fair 2026