Claws & Personal Agents

Your Voice Agent is Just a Walkie-Talkie

TalkIntermediate

Everyone says cascaded voice pipelines are dead and native speech models are the future. Yet production environments are still dominated by STT-LLM-TTS stacks. Reconciling the natural flow of native audio with the elite reasoning of a cascaded agent remains an unsolved systems problem. This talk dissects the brutal technical trade-offs behind that counterintuitive reality. We will break down why your voice agent is still stuck behaving like a walkie-talkie and map out the specific technical roadmap required to build full-duplex AI that actually works.

About the Claws & Personal Agents Track

Claws & Personal Agents sessions at AI Engineer World's Fair 2026 in San Francisco.

When

Tuesday, June 30, 2026

12:05 PM - 12:25 PM·20m

Where

Track 6 · Room 2014

Capacity: 250 attendees

Speaker

Neil Zeghidour

CEO

Gradium

@neilzegh

Neil Zeghidour is the co-founder and CEO of Gradium. Neil founded Gradium after a decade of building and leading frontier generative audio teams at Meta and Google DeepMind. Being frustrated by slow and brittle voice assistants , he built the engineering teams that developed the first neural audio codecs and introduced the first audio LLMs, such as AudioLM, at Google. He later created Kyutai to launch Moshi, the world's first real-time, full-duplex conversational AI , and Hibiki, the first simultaneous speech-to-speech translation system. Today, Gradium is focused on helping developers build natural, real-time voice agents by providing ultra-low latency streaming APIs that transition these breakthroughs from the research lab to production.

AI Engineer World's Fair 2026