GPT-5 is shockingly good at search, and that changes the "BM25 as a baseline" story. Using GPT-5 search trajectories from BrowseComp-Plus, I'll show how default BM25 parameters and evaluation harnesses can make lexical retrieval look weak, while real agent queries often play directly to BM25's strengths. Much like grep became a core retrieval primitive for coding agents, BM25 is re-emerging as a powerful primitive for agentic search.
Search & Retrieval sessions at AI Engineer World's Fair 2026 in San Francisco.
Tuesday, June 30, 2026
11:10 AM - 11:30 AM·20m
Track 3 · Room 2003
Capacity: 250 attendees
Sign in to add this talk to your schedule.

Jo Kristian Bergum
CEO & co-founder
Hornet.dev
@jobergum
CEO Hornet.dev - building the retrieval engine for agents