Legora’s foundational engineering challenge is connecting frontier LLMs to billions of legal documents so the models can efficiently solve end-to-end legal workflows without burning extra tokens. We’ll share the retrieval architecture we built with turbopuffer that achieves: 1. Strict data isolation across millions of legal cases in a very security-conscious domain 2. Predictable search performance (<100ms p90 latency) on large contexts 3. High retrieval quality (95%+ recall@10) with fewer agent loops We’ll retrospect on two architectures that failed to achieve all 3 (and why), and the key design factors that make the current solution work at our scale. Practical takeaways include: - How to evaluate per-tenant vs shared-index retrieval under strict data isolation - How to efficiently index and retrieve context to maximize relevance per input token - How to build a highly intelligent AI application when your inference budget is constrained Speakers: Simon Eskildsen — turbopuffer; Jacob Lauritzen — Legora.
Search & Retrieval sessions at AI Engineer World's Fair 2026 in San Francisco.
Tuesday, June 30, 2026
1:30 PM - 1:50 PM·20m
Track 3 · Room 2003
Capacity: 250 attendees
Sign in to add this talk to your schedule.

Simon Eskildsen
Co-Founder & CEO
turbopuffer
@Sirupsen
Simon Eskildsen is the co-founder and CEO at turbopuffer. Formerly Principal Engineer at Shopify, where he helped scale infra from 1K → 1M RPS. Jacob Lauritzen is the CTO at Legora

Jacob Lauritzen
CTO
Legora
CTO at Legora.