AI Engineer WF 2026
ScheduleSpeakers
Sign In
Sign In
Speakers/TBA
TBA

TBA

Speaker

Speaker to be announced.

Sessions (121)

SonarQube + OpenAI: Wiring Your Team for Agentic Development
1:15 PM·Track 6 · Room 2014

As AI agents take on increasingly complex development tasks, the critical challenge has shifted from generation to verification. A growing body of evidence suggests that as models grow more capable, failures become more frequent and more convincing, making cognitive surrender among human reviewers an acute risk. This talk introduces Sonar's Agent Centric Development Cycle (AC/DC), a three-stage continuous loop of Guide, Verify, and Solve, as the engineering discipline teams need to build now. Teams that embrace AC/DC guide agents within their organizational standards before they write a line of code, verify output in real-time, and solve issues automatically without manual triage. This session will also feature a live demo of the SonarQube OpenAI plugin, showing how a well-guided agent produces code that is faster to verify and cheaper to fix.

Track 6intermediatesponsor
Build a Document Triage Agent with Reducto: Classify, Extract, and More
1:15 PM·Track 7 · Room 2024

Ingest a mixed corpus (think insurance claims, legal filings, or medical intake forms), classify each doc, extract relevant fields per type, and route to downstream handlers. We'll cover the full agentic document workflow end to end, and show you how to use Reducto Studio to do it. Learn how to build Reducto pipelines from scratch that can handle a corpus of mixed documents.

Track 7intermediate
Neo4J L&L
1:15 PM·Track 2 · Room 2006

Track 2intermediatesponsor
Runway AI Film Festival
6:15 PM·Expo Stage 3

Runway's annual AI Festival — a celebration of creatives experimenting at the forefront of art and technology across film, design, new media, fashion, advertising, and gaming, with a screening of finalist AI films. https://aif.runwayml.com/

Expo Stage 3intermediatetalk
Exa Expo Session
10:45 AM·Expo Stage 4

Expo Stage 4intermediatetalk
Kimchi Expo Session
11:10 AM·Expo Stage 3

Expo Stage 3intermediatetalk
Telnyx Expo Session
11:40 AM·Expo Stage 1

Expo Stage 1intermediatetalk
TBA
11:40 AM·Track 9 · Room 2016

Data Qualityintermediatetalk
TBA
12:05 PM·Track 5 · Room 2005

Securityintermediatesponsor
TBA
12:05 PM·Track 8 · Room 2020

Forward Deployed Engineeringintermediatetalk
Lunch
12:30 PM·Track 1 · Room 2010

Claws & Personal Agentsbeginnerplenary
Lunch
12:30 PM·Track 2 · Room 2006

Vision & OCRbeginnerplenary
Lunch
12:30 PM·Track 3 · Room 2003

Search & Retrievalbeginnerplenary
Lunch
12:30 PM·Track 4 · Room 2009

Workshops Day 2beginnerplenary
Lunch & Learn
12:30 PM·Track 5 · Room 2005

Securitybeginnerplenary
Lunch
12:30 PM·Track 7 · Room 2024

LLM Recsysbeginnerplenary
Lunch
12:30 PM·Track 6 · Room 2014

Voice & Realtime AIbeginnerplenary
Lunch
12:30 PM·Expo Stage 4

Expo Stage 4beginnerplenary
Lunch
12:30 PM·Expo Stage 3

Expo Stage 3beginnerplenary
Lunch
12:30 PM·Expo Stage 2

Expo Stage 2beginnerplenary
Lunch
12:30 PM·Expo Stage 1

Expo Stage 1beginnerplenary
Lunch
12:30 PM·Leadership 2 · Room 3020

AI Architects: Show my Workflowbeginnerplenary
Lunch
12:30 PM·Leadership 1 · Room 3016

AI-Native Enterprisesbeginnerplenary
Lunch
12:30 PM·Track M · Room 2001

Track Mbeginnerplenary
Lunch
12:30 PM·Track 9 · Room 2016

Data Qualitybeginnerplenary
Lunch
12:30 PM·Track 8 · Room 2020

Forward Deployed Engineeringbeginnerplenary
Deepmind Expo Session 1
1:30 PM·Expo Stage 2

Expo Stage 2intermediatetalk
TBA
1:30 PM·Track 7 · Room 2024

LLM Recsysintermediatetalk
TBA
1:55 PM·Track 7 · Room 2024

LLM Recsysintermediatetalk
Greptile Expo Session
1:55 PM·Expo Stage 3

Expo Stage 3intermediatetalk
Continuous Engineering: Software Development for the Age of Agents
2:25 PM·Expo Stage 4

AI has changed everything about how we write code. But the hard parts of building software have gotten even harder: aligning your team, maintaining architectural integrity, and worst of all, reviewing the oceans of agent-driven code. The tools and processes we rely on git pull requests; code review were built for emailing patch files. We need a new paradigm. In this talk, we're going to explore Continuous Engineering, a new approach to software development that treats the agent thread as the core unit of collaboration. Branches should be as cheap as ideas, code should carry the context of the conversation that generated it, and the work should be available to your colleagues (and their agents) as it happens. We'll walk through what this looks like in practice, and what we're building to make it possible.

Expo Stage 4intermediate
TBA
2:25 PM·Track 7 · Room 2024

LLM Recsysintermediatetalk
Buildkite Expo 1
2:25 PM·Expo Stage 3

Expo Stage 3intermediatetalk
TBA
2:50 PM·Track 7 · Room 2024

LLM Recsysintermediatetalk
Self-Driving Production: AI Wrote your Code. AI Should Fix It, Too
2:50 PM·Expo Stage 4

Self-driving production is the next frontier of autonomous software development but getting there is a journey. In this session, we ll show how enterprises are progressing from manual operations and AI copilots toward closed-loop, autonomous production systems with Traversal.

Expo Stage 4intermediatetalk
Baseten Expo Session
2:50 PM·Expo Stage 3

Expo Stage 3intermediatetalk
Video Discovery for Agentic World-Model Training
2:50 PM·Expo Stage 2

Physical AI had its “Attention Is All You Need” moment with the rise of Vision-Language-Action models. The next bottleneck is data: not just more video, but the ability to find the exact real-world moments that teach models how the world works: gravity, motion, causality, human behavior, and object interactions. This session explores a new approach: discovering specific scenes from the vastness of the web. We’ll show how teams can search for moments like objects falling, people interacting with environments, or actions unfolding over time, then collect and structure only the relevant clips for training and evaluation. Attendees will learn how scene-level discovery changes multimodal data pipelines, reducing wasted collection, processing, storage, and review, while making it easier to build targeted datasets for VLA systems, robotics, physical AI, and agentic world models.

Expo Stage 2intermediate
6 Pillars of an Agentic Harness That Fixes Production Incidents
2:50 PM·Expo Stage 1

A model delights us when any plausible answer works, but a production incident has one right answer, and the model alone can't reliably reach it. Getting there depends less on the model and more on the orchestration, context, and judgment built around it. That work is harness engineering, and it is the new frontier. This session breaks down the six pillars of an agentic harness required to fix production incidents: model orchestration, context, reasoning, actions, learning, and evals. Join Resolve AI to walk through what each one does, why a better model doesn't make any of them go away, and how they compose to find the root cause of a live incident across massive context, under a clock, with real revenue on the line.

Expo Stage 1intermediate
While You Were Generating: The Verification Gap Nobody Talked About
3:20 PM·Expo Stage 1

Every enterprise is asking the same question: how do we move fast with AI without breaking things? While the market chased generation — better models, faster agents, more output — a different problem was compounding quietly: nobody built the verification layer to match. The team built Gitar because they saw firsthand what happens when development velocity outpaces code quality, and AI has made that problem an order of magnitude bigger. In this session, Ali-Reza Adl-Tabatabai, formerly of Uber, Google, and Meta, now leading Gitar development inside Sonar, makes the case for why AI-native code review is the missing layer in every enterprise's agentic stack. Gitar uses agentic reasoning to review code, generate fixes, validate them against your CI, and commit to the branch. It automatically analyzes and de-duplicates CI failures, detects flaky tests, and fixes remaining build, lint, and test failures — keeping reviews moving across time zones without the back-and-forth that kills engineering throughput. As a critical layer in Sonar's multilayered, zero-trust verification platform, Gitar enables organizations to analyze syntax, data flows, logic flows, architectures, and dependencies; set and enforce standards in a consistent, auditable manner; and agentically fix issues both as agents write code and in CI workflows. Sonar intelligently sequences analysis so deterministic verification handles simpler issues first, while AI tackles the nuanced ones, reducing token costs and keeping the pipeline lean. In an agentic world, zero trust is an engineering principle: assume every line an agent writes needs to be verified, every time, at every layer.

TBA
3:45 PM·Track 7 · Room 2024

LLM Recsysintermediatetalk
Circle Expo Session
10:45 AM·Expo Stage 1

Expo Stage 1intermediatetalk
TBA
11:10 AM·Track 9 · Room 2016

Posttraining & Midtrainingintermediatetalk
TBA
11:10 AM·Track 1 · Room 2010

Sandbox & Platform Engineeringintermediatetalk
Fault-Tolerant Training at Scale: Making Hardware Failures a Non-Event
11:40 AM·Expo Stage 1

Hardware failures in large-scale distributed training are inevitable when you're running thousands of GPUs, they happen multiple times a day. The standard response is manual intervention: an engineer gets paged, SSHs into the cluster, and spends an hour fixing something the infrastructure should have handled automatically. That lost time compounds directly into wasted compute and delayed research. This session walks through the self-healing platform Crusoe built to eliminate that manual loop entirely a managed Slurm environment running on Kubernetes, with automated node failure remediation and real-time cluster observability and how these components work together so hardware failures become a non-event. We'll cover this architecture end-to-end: how running Slurm on Kubernetes unlocks infrastructure resilience that traditional GPU clusters don't have, how automated hardware monitoring and node remediation can eliminate manual intervention entirely, and how full observability into every remediation event keeps engineering teams informed without keeping them on-call. For teams that want deeper control, we'll also discuss open-loop remediation, which gives teams full control over the node replacement process for application-specific workflows.

Expo Stage 1
AI agents don't read your policy docs. They hit your APIs.
11:40 AM·Expo Stage 4

Every organisation has a policy for what AI should and shouldn't do. But in the era of autonomous agents, who is that document actually for? Odds are no agent has ever read it. It opens a connection and makes a call, and whatever happens at that millisecond is your real policy. So put the control there. This talk is about the gateway as the runtime where AI governance actually executes: per-agent identity and scoped, short-lived credentials instead of a shared god-key. PII and secrets stripped from prompts in flight. Token-aware rate limits so one looping agent can't torch your quota. Semantic caching that cuts spend and latency on requests you've already answered. I'll share the architectural patterns behind each control, what they look like in practice, and what breaks the moment you take them away. Policy states intent. Infrastructure enforces it.

Expo Stage 4intermediate
TBA
12:05 PM·Track 9 · Room 2016

Posttraining & Midtrainingintermediatetalk
TBA
12:05 PM·Track 5 · Room 2005

Evalsintermediatesponsor
Lunch
12:30 PM·Track 6 · Room 2014

AI Designers/Design Engineersbeginnerplenary
Lunch
12:30 PM·Leadership 2 · Room 3020

AI Architects: Tokenmaxxingbeginnerplenary
Lunch
12:30 PM·Expo Stage 1

Expo Stage 1beginnerplenary
Lunch
12:30 PM·Expo Stage 4

Expo Stage 4beginnerplenary
Lunch
12:30 PM·Expo Stage 3

Expo Stage 3beginnerplenary
Lunch
12:30 PM·Expo Stage 2

Expo Stage 2beginnerplenary
Lunch
12:30 PM·Track 1 · Room 2010

Sandbox & Platform Engineeringbeginnerplenary
Lunch & Learn
12:30 PM·Track 2 · Room 2006

Robotics & World Modelsbeginnerplenary
Lunch
12:30 PM·Track 3 · Room 2003

Memory & Continual Learningbeginnerplenary
Lunch
12:30 PM·Track 4 · Room 2009

Workshops Day 3beginnerplenary
Lunch & Learn
12:30 PM·Track 5 · Room 2005

Evalsbeginnerplenary
Lunch
12:30 PM·Track 7 · Room 2024

Computer Usebeginnerplenary
Lunch
12:30 PM·Track 8 · Room 2020

Context Engineeringbeginnerplenary
Lunch
12:30 PM·Track 9 · Room 2016

Posttraining & Midtrainingbeginnerplenary
Lunch
12:30 PM·Track M · Room 2001

Track Mbeginnerplenary
Lunch
12:30 PM·Leadership 1 · Room 3016

AI-Native Enterprisesbeginnerplenary
TBA
1:30 PM·Track 2 · Room 2006

Robotics & World Modelsintermediatesponsor
Daytona Expo 1
1:30 PM·Expo Stage 4

Expo Stage 4intermediatetalk
Deepmind Expo Session 3
1:30 PM·Expo Stage 3

Expo Stage 3intermediatetalk
Natively Multimodal from Step Zero
1:55 PM·Expo Stage 4

Most AI models start as text systems and have vision, audio, and other modalities added later. That ordering shows up in the work: handoffs between modalities, brittle understanding of mixed inputs, and gaps that surface exactly when real tasks demand reading a chart, a document, and code together. This session looks at a different approach — models trained as multimodal from step zero, where text, image, audio, and video share the same foundation rather than being stitched together. We'll look at why that matters for the kind of work organizations actually want from AI: understanding messy, mixed real-world inputs, holding context across them, and carrying complex tasks end to end. The throughline is what this unlocks for teams deciding where AI can take real work today — and how MiniMax is building toward that frontier.

Expo Stage 4intermediate
Warp Expo Session
1:55 PM·Expo Stage 3

Expo Stage 3intermediatetalk
Latency Is a Budget. Humanlike Is the Goal.
2:25 PM·Expo Stage 3

Most agents do their work in the background. They write code, automate tasks, and run research. But the moment an agent has to interact with a human in real time, everything you know about building and evaluating it changes. This session is about designing humanlike agents that can hear, see, and speak. It starts with the question nobody can answer today. With hundreds of models to choose from, how do you pick a stack that holds up in a live conversation? We'll show why public leaderboards fail for realtime agents, and why the latency on your dashboard isn't what your users experience. Then we'll flip the process around. Define the outcomes you want as human-equivalent behaviors, and work backwards from there to your evaluations, your models, and a production iteration loop. You'll leave with a concrete decision framework and an open benchmark you can run yourself.

Expo Stage 3intermediate
Lightrun Expo Session
2:25 PM·Expo Stage 4

Expo Stage 4intermediatetalk
TBA
2:25 PM·Main Stage

Autoresearchintermediatetalk
The Rise of CaaS: Context-as-a-Service for Agentic AI
2:25 PM·Track 7 · Room 2024

Agentic workflows have commoditized. The new bottleneck is context. As models improve, AI agents are increasingly limited not by reasoning ability, but by the quality, freshness, and specificity of the information they can access. This session introduces Context as a Service, or CaaS, an emerging category for builders creating web-native context layers for AI agents. These tools collect, structure, enrich, index, and analyze live web data, making it available as agent-ready knowledge for specific use cases and vertical downstream applications. We ll explore how builders are turning hard-to-access web domains into agent-ready context layers: fragmented public data, dynamic sources, multimodal content, and fast-changing signals that generic models cannot reliably process within their token limits. Attendees will learn how to think about CaaS as both a technical architecture and a market opportunity: what to build, where context creates defensibility, and how raw web data can become the foundation for reliable agentic products.

Computer Use
TBA
2:50 PM·Track 6 · Room 2014

AI Designers/Design Engineersintermediatetalk
Vibe Code Safely: Introducing Gadgets
3:20 PM·Expo Stage 4

We ve all heard that the future belongs to custom, AI-generated micro-apps, but how do we actually make them secure? Hear more from Cloudflare on the debut of Gadgets, an AI productivity suite that makes personal app creation scalable and safe for everyone.

Expo Stage 4intermediatetalk
The Self-Improving OSS Agent Stack
3:20 PM·Expo Stage 1

Agents are starting to debug and improve themselves: production traces become evals, evals propose PRs, and PRs are tested against datasets before they ship. Langfuse co-founder, Marc, will live-demo this loop in Langfuse. He'll make the case that the infrastructure underlying this powerful loop should be open-source.

Expo Stage 1intermediatetalk
Runpod Expo Session
3:20 PM·Expo Stage 2

Expo Stage 2intermediatetalk
TBA
4:50 PM·Main Stage

Autoresearchbeginnerkeynote
TBA
10:45 AM·Track 4 · Room 2009

Local AIintermediatetalk
Elastic Expo Session
10:45 AM·Expo Stage 4

Expo Stage 4intermediatetalk
TBA
10:45 AM·Track 1 · Room 2010

Generative Mediaintermediatetalk
Your AI Agent Has No Nervous System
11:10 AM·Expo Stage 4

Expo Session 18 minutes Expo floor stage Expo Sessions are dedicated, 18-minute technical presentations delivered by sponsors in designated Expo Session rooms during conference expo hours. These sessions are designed to allow sponsors to engage directly with attendees through a structured, technical presentation format. To give you an overview of what happens in an expo session and how its being collected please see key details below: Duration: Sessions run for approximately 18 minutes each. Placement: They take place in dedicated Expo Session rooms during scheduled expo hours and are listed in the official conference agenda and event schedule. Content Focus: Sessions should be technical and informative, focusing on thought leadership, deep technical insights, architecture discussions, or engineering case studies. They are intended to drive interest to your product/service/booth exhibit by showing how your team is solving technical problems and are explicitly discouraged from being overly promotional or a "vendor pitch". Sponsors commonly use these for technical deep dives, product demonstrations, implementation walkthroughs, or customer case studies. Lead Capture: Opt-in lead data is provided for attendees who scan into the session. This lead data includes Name, Email, Job Title, Company, City, Country, and Company Size.Session Title and Session Lead: Once you have access in your Accel Events sponsor portal, you'll be able to add your session title and session lead directly. Please note that you can update/edit this until June 1, 2026.

Expo Stage 4
TBA
11:10 AM·Track 4 · Room 2009

Local AIintermediatetalk
Keycard AI Expo 1
11:10 AM·Expo Stage 2

Expo Stage 2intermediatetalk
[Unblocked Expo Session 3]
11:40 AM·Expo Stage 4

Expo Stage 4intermediatetalk
TBA
11:40 AM·Track 4 · Room 2009

Local AIintermediatetalk
TBA
12:05 PM·Track 3 · Room 2003

AI in Financeintermediatetalk
PLANETSCALE Expo 1
12:05 PM·Expo Stage 3

Expo Stage 3intermediatetalk
Replicated Expo Session
12:05 PM·Expo Stage 2

Expo Stage 2intermediatetalk
TBA
12:05 PM·Track 4 · Room 2009

Local AIintermediatetalk
Lunch
12:30 PM·Track 9 · Room 2016

Inferencebeginnerplenary
Lunch
12:30 PM·Track 8 · Room 2020

Agentic Engineeringbeginnerplenary
Lunch
12:30 PM·Track 7 · Room 2024

AI in Healthcarebeginnerplenary
Lunch
12:30 PM·Track 6 · Room 2014

AI in GTMbeginnerplenary
Lunch & Learn
12:30 PM·Track 5 · Room 2005

Graphsbeginnerplenary
Lunch
12:30 PM·Track 4 · Room 2009

Local AIbeginnerplenary
Lunch
12:30 PM·Track 3 · Room 2003

AI in Financebeginnerplenary
Lunch & Learn
12:30 PM·Track 2 · Room 2006

Agentic Commercebeginnerplenary
Lunch
12:30 PM·Track 1 · Room 2010

Generative Mediabeginnerplenary
Lunch
12:30 PM·Main Stage

Harness Engineeringbeginnerplenary
Lunch
12:30 PM·Expo Stage 4

Expo Stage 4beginnerplenary
Lunch
12:30 PM·Expo Stage 2

Expo Stage 2beginnerplenary
Lunch
12:30 PM·Expo Stage 1

Expo Stage 1beginnerplenary
Lunch
12:30 PM·Leadership 2 · Room 3020

AI Architects: AI Factoriesbeginnerplenary
Lunch
12:30 PM·Leadership 1 · Room 3016

AI-Native Enterprisesbeginnerplenary
Lunch
12:30 PM·Track M · Room 2001

Track Mbeginnerplenary
Lunch
12:30 PM·Expo Stage 3

Expo Stage 3beginnerplenary
Video Discovery for Agentic World-Model Training
1:30 PM·Expo Stage 4

Physical AI had its “Attention Is All You Need” moment with the rise of Vision-Language-Action models. The next bottleneck is data: not just more video, but the ability to find the exact real-world moments that teach models how the world works: gravity, motion, causality, human behavior, and object interactions. This session explores a new approach: discovering specific scenes from the vastness of the web. We’ll show how teams can search for moments like objects falling, people interacting with environments, or actions unfolding over time, then collect and structure only the relevant clips for training and evaluation. Attendees will learn how scene-level discovery changes multimodal data pipelines, reducing wasted collection, processing, storage, and review, while making it easier to build targeted datasets for VLA systems, robotics, physical AI, and agentic world models.

Expo Stage 4intermediate
TBA
1:30 PM·Track 4 · Room 2009

Local AIintermediatetalk
Dash0 Add-On Expo Session
1:55 PM·Expo Stage 1

Expo Stage 1intermediatetalk
LlamaIndex Expo Session
1:55 PM·Expo Stage 4

Expo Stage 4intermediatetalk
TBA
1:55 PM·Track 4 · Room 2009

Local AIintermediatetalk
How Reducto parsed the Epstein Files for the Viral JMail Project: The Secret Complexities of Documents
2:25 PM·Expo Stage 2

When the Epstein files dropped, a team of indie hackers built JMail: a duplicate of Gmail that was logged in as Jeffrey himself. It went viral. But the parsing problem underneath it was brutal. Court documents are some of the nastiest inputs a parser can face. Scanned exhibits with varying resolution, redactions sitting directly over key text, inconsistent formatting across decades of filings, handwritten annotations mixed into typed pages, documents photocopied from a photocopy of a photocopy. But legal is just one flavor of hard. In finance, you're dealing with tables nested inside tables, footnotes that span pages, and numbers that mean different things depending on which section of the filing you're in. In healthcare, it's mixed handwritten and typed content, inconsistent date formats, and forms that were designed in 1987 and never updated. In government records, it's degraded scans, stamps overlapping text, and documents where a key field is missing on half the corpus. Every industry has its own specific ways documents break parsers. This session walks through the failure modes we've actually hit across these corpora, what causes them, and how to build pipelines that hold up when the documents stop cooperating.

Expo Stage 2
TBA
2:25 PM·Track 4 · Room 2009

Local AIintermediatetalk
Nebius Expo Session
2:25 PM·Expo Stage 1

Expo Stage 1intermediatetalk
[Braintrust Expo Session 3]
2:25 PM·Expo Stage 3

Expo Stage 3intermediatetalk
TBA
2:50 PM·Track 4 · Room 2009

Local AIintermediatetalk
The Software Factory
2:50 PM·Expo Stage 4

In the leading engineering organizations, a single engineer now supervises teams of agents, migrations scoped for years close in weeks, and code review has become the tightest constraint in the system. The teams pulling ahead are operating a software factory: an integrated system of agents that share context across the entire SDLC. This session is a field guide to that operating model and how it runs at scale: what each stage looks like in practice, what shifts for engineers as they move from writing code to stewarding the system, and the hard truths that decide whether a factory compounds, starting with why the infrastructure you built for humans sets the ceiling on what agents can do.

Expo Stage 4intermediate
TBA
3:20 PM·Track 4 · Room 2009

Local AIintermediatetalk
Deepmind Expo Session 2
3:20 PM·Expo Stage 2

Expo Stage 2intermediatetalk
Composio Expo Session
3:45 PM·Expo Stage 2

Expo Stage 2intermediatetalk
TBA
3:45 PM·Track 4 · Room 2009

Local AIintermediatetalk
sponsor
talk
talk
talk
Expo Stage 1
intermediate
talk
intermediate
talk
talk
talk
talk
intermediate
talk
intermediate
talk
talk
intermediate
talk
talk