2:25 PM·Track 9 · Room 2016
What does it take to scale language models to year long tasks? In this talk we'll cover the algorithm, environment and compute considerations for scaling language models to long horizons. We'll cover the latest reinforcement learning approaches, how to build hard, high-fidelity long-horizon environments, and how to build scalable infrastructure for these tasks.
Speakers: Ross Taylor — General Reasoning; Chengxi Taylor — General Reasoning Inc..
Data Qualityintermediatetalk