AI Engineer World's Fair 2026

From Agent Prototype to Production: Evaluation-Led Development in Microsoft Foundry

TalkIntermediate

AI agents can look impressive in a demo but behave inconsistently in production. The real challenge is knowing how to improve quality systematically: defining what success looks like, testing agent behavior, identifying failure modes, and making the right changes to prompts, models, tools, and orchestration. Using Claude in Microsoft Foundry, we’ll walk through how teams can define success criteria, compare outputs, test agent behavior, diagnose failures, and iterate through the Foundry control plane. You’ll leave with a repeatable approach for moving agents from experimentation to production with greater confidence while meeting enterprise expectations for data control, governance, procurement, and cloud alignment.

About the Expo Stage 3 Track

Expo Stage 3 sessions at AI Engineer World's Fair 2026 in San Francisco.

From Agent Prototype to Production: Evaluation-Led Development in Microsoft Foundry

About the Expo Stage 3 Track

When

Where

Speaker

From Agent Prototype to Production: Evaluation-Led Development in Microsoft Foundry

About the Expo Stage 3 Track

When

Where

Speaker