#18 Nathan Labenz on reinforcement learning, reasoning models, emergent misalignment & more

/ Consistently Candid

  • 2025-03-02 21:00:00リリースの日付
  • 106:17継続時間