#18 Nathan Labenz on reinforcement learning, reasoning models, emergent misalignment & more

di Consistently Candid

  • 2025-03-02 21:00:00Data di uscita
  • 106:17Durata