#18 Nathan Labenz on reinforcement learning, reasoning models, emergent misalignment & more

by Consistently Candid

  • 2025-03-02 21:00:00Release date
  • 106:17Length