“METR: Measuring AI Ability to Complete Long Tasks” by Zach Stein-Perlman
by
LessWrong (Curated & Popular)
2025-04-07 20:15:39
Release date
11:09
Length