AI Engineer World's Fair 2026

Build with Perception Agents

TalkIntermediate

Human-agent collaboration is changing, becoming more visual. Models can perceive, point, and verify, but most agents still rely on us typing a paragraph to explain what we're looking at. Meet perception agents: computer use agents that see screens how you see screens. They understand, reason, and verify their own work. They let you point, draw, and describe, just as people collaborate in real life. We call this shared perception, and at AGI Lab we just open-sourced the first two primitives of our perception agent harness: visual verification and visual annotation. In this workshop, you'll get hands-on with both, build one sample use case end-to-end, then take the primitives back to your day-to-day in a mini hackathon. Best ideas win prizes. Speakers: Emile Baizel — Amazon AGI Lab; Shruti Arora — Amazon AGI Lab.

About the Workshops Day 1 Track

Workshops Day 1 sessions at AI Engineer World's Fair 2026 in San Francisco.

Build with Perception Agents

About the Workshops Day 1 Track

When

Where

Speakers (2)

Build with Perception Agents

About the Workshops Day 1 Track

When

Where

Speakers (2)