Human-agent collaboration is changing, becoming more visual. Models can perceive, point, and verify, but most agents still rely on us typing a paragraph to explain what we're looking at. Meet perception agents: computer use agents that see screens how you see screens. They understand, reason, and verify their own work. They let you point, draw, and describe, just as people collaborate in real life. We call this shared perception, and at AGI Lab we just open-sourced the first two primitives of our perception agent harness: visual verification and visual annotation. In this workshop, you'll get hands-on with both, build one sample use case end-to-end, then take the primitives back to your day-to-day in a mini hackathon. Best ideas win prizes. Speakers: Emile Baizel — Amazon AGI Lab; Shruti Arora — Amazon AGI Lab.
Workshops Day 1 sessions at AI Engineer World's Fair 2026 in San Francisco.
Monday, June 29, 2026
2:20 PM - 4:20 PM·2h
Track 8 · Room 2020
Capacity: 250 attendees
Sign in to add this talk to your schedule.

Emile Baizel
Amazon AGI Lab
Co-presenter for the Amazon AGI Lab workshop "Build with Perception Agents" at AI Engineer World's Fair 2026.

Shruti Arora
Amazon AGI Lab
Co-presenter for the Amazon AGI Lab workshop "Build with Perception Agents" at AI Engineer World's Fair 2026.