AI Engineer WF 2026
ScheduleSpeakers
Sign In
Sign In
Speakers/Philip Kiely
Philip Kiely

Philip Kiely

Developer Relations

Baseten

@philip_kiely

Philip Kiely leads Developer Relations at Baseten. Prior to joining Baseten in 2022, he worked across software engineering and technical writing for a variety of startups. Outside of work, you'll find Philip practicing martial arts, reading a new book, or cheering for his adopted bay area sports teams.

Sessions (1)

What's New in Inference Engineering
1:30 PM·Track 9 · Room 2016

More than 30,000 engineers have learned the fundamentals of inference since Inference Engineering was published. But the field keeps accelerating, so it's time for the first public addendum to the book. The past four months have seen a renewed focus on training-dependent inference optimization across the "big three" performance techniques of speculation, caching, and quantization. This talk provides structured guidance for training DFlash and EAGLE 3 draft models to accelerate LLM decode, introduces the concept of KV compaction, and explains the hype behind TurboQuant.

Inferenceintermediatetalk