“Alignment Faking Revisited: Improved Classifiers and Open Source Extensions” by John Hughes, abhayesian, Akbir Khan, Fabien Roger
by
LessWrong (Curated & Popular)
2025-04-09 10:15:40
Release date
41:04
Length