The four golden signals (Latency, Errors, Traffic, Saturation) have been the foundation of application monitoring for years, and it still matters, but for GenAI applications, these signals alone leave significant blind spots. A request can return 200 OK with low latency while the response hallucinates, leaks PII, or costs much more than expected. This talk will walk you through what changes when you're monitoring non-deterministic, token-priced, prompt-injectable systems. We'll cover three additional monitoring dimensions: Cost (token attribution, model-mix tracking, wasted spend on failed requests), Safety (prompt injection detection, PII scanning, jailbreak attempts), and Quality (hallucination rate, relevance scoring, user satisfaction) and show why each one is necessary alongside your existing instrumentation.
Expo Stage 1 sessions at AI Engineer World's Fair 2026 in San Francisco.
Tuesday, June 30, 2026
2:25 PM - 2:45 PM·20m
Expo Stage 1
Capacity: 250 attendees
Sign in to add this talk to your schedule.
Marina Petzel
Marina Petzel is speaking at AI Engineer World's Fair 2026.