AI Engineer WF 2026
ScheduleSpeakers
Sign In
Sign In
Speakers/Miguel González Fernández
Miguel González Fernández

Miguel González Fernández

Browserbase

Miguel González Fernández is speaking at AI Engineer World's Fair 2026.

Sessions (1)

The Art of Building Verifiers for Computer Use Agents
11:40 AM·Expo Stage 1

Every team building browser agents has the same problem: you can't trust your own evals. Browser tasks are too open-ended for deterministic checks, so teams use LLM verifiers as judges, and the judges are wrong constantly. WebVoyager misses 45% of failures. WebJudge misses 22%. Used as RL reward, you're not training a better agent, you're training a more confident liar. This talk walks through the Universal Verifier, open-sourced with Microsoft Research: false positive rate near zero, Cohen's kappa matching human-human agreement. Four design principles, one open benchmark, and an honest account of where auto-research worked and where it plateaued. Speakers: Miguel González Fernández — Browserbase; Corby Rosset — Microsoft Research.

Expo Stage 1intermediatetalk