[Linkpost] “METR: Measuring AI Ability to Complete Long Tasks” by Zach Stein-Perlman
by
LessWrong (Curated & Popular)
2025-03-19 23:30:41
Release date
01:19
Length