Listen to LessWrong (Curated & Popular) podcast

“A Pragmatic Vision for Interpretability” by Neel Nanda	LessWrong (Curated & Popular)	63:58
“AI in 2025: gestalt” by technicalities	LessWrong (Curated & Popular)	41:59
“Eliezer’s Unteachable Methods of Sanity” by Eliezer Yudkowsky	LessWrong (Curated & Popular)	16:13
“An Ambitious Vision for Interpretability” by leogao	LessWrong (Curated & Popular)	08:49
“6 reasons why ‘alignment-is-hard’ discourse seems alien to human intuitions, and vice-versa” by Steven Byrnes	LessWrong (Curated & Popular)	32:39
“Three things that surprised me about technical grantmaking at Coefficient Giving (fka Open Phil)” by null	LessWrong (Curated & Popular)	09:45
“MIRI’s 2025 Fundraiser” by alexvermeer	LessWrong (Curated & Popular)	15:37
“The Best Lack All Conviction: A Confusing Day in the AI Village” by null	LessWrong (Curated & Popular)	12:03
“The Boring Part of Bell Labs” by Elizabeth	LessWrong (Curated & Popular)	25:57
[Linkpost] “The Missing Genre: Heroic Parenthood - You can have kids and still punch the sun” by null	LessWrong (Curated & Popular)	04:18
“Writing advice: Why people like your quick bullshit takes better than your high-effort posts” by null	LessWrong (Curated & Popular)	09:21
“Claude 4.5 Opus’ Soul Document” by null	LessWrong (Curated & Popular)	79:57
“Unless its governance changes, Anthropic is untrustworthy” by null	LessWrong (Curated & Popular)	53:22
“Alignment remains a hard, unsolved problem” by null	LessWrong (Curated & Popular)	23:23
“Video games are philosophy’s playground” by Rachel Shu	LessWrong (Curated & Popular)	31:50
“Stop Applying And Get To Work” by plex	LessWrong (Curated & Popular)	02:52
“Gemini 3 is Evaluation-Paranoid and Contaminated” by null	LessWrong (Curated & Popular)	14:59
“Natural emergent misalignment from reward hacking in production RL” by evhub, Monte M, Benjamin Wright, Jonathan Uesato	LessWrong (Curated & Popular)	18:45
“Anthropic is (probably) not meeting its RSP security commitments” by habryka	LessWrong (Curated & Popular)	08:57
“Varieties Of Doom” by jdp	LessWrong (Curated & Popular)	98:48
“How Colds Spread” by RobertM	LessWrong (Curated & Popular)	20:31
“New Report: An International Agreement to Prevent the Premature Creation of Artificial Superintelligence” by Aaron_Scher, David Abecassis, Brian Abeyta, peterbarnett	LessWrong (Curated & Popular)	06:52
“Where is the Capital? An Overview” by johnswentworth	LessWrong (Curated & Popular)	18:06
“Problems I’ve Tried to Legibilize” by Wei Dai	LessWrong (Curated & Popular)	04:17
“Do not hand off what you cannot pick up” by habryka	LessWrong (Curated & Popular)	06:39
“7 Vicious Vices of Rationalists” by Ben Pace	LessWrong (Curated & Popular)	09:47
“Tell people as early as possible it’s not going to work out” by habryka	LessWrong (Curated & Popular)	03:19
“Everyone has a plan until they get lied to the face” by Screwtape	LessWrong (Curated & Popular)	12:48
“Please, Don’t Roll Your Own Metaethics” by Wei Dai	LessWrong (Curated & Popular)	04:11
“Paranoia rules everything around me” by habryka	LessWrong (Curated & Popular)	22:32
“Human Values ≠ Goodness” by johnswentworth	LessWrong (Curated & Popular)	11:31
“Condensation” by abramdemski	LessWrong (Curated & Popular)	30:29
“Mourning a life without AI” by Nikola Jurkovic	LessWrong (Curated & Popular)	11:17
“Unexpected Things that are People” by Ben Goldhaber	LessWrong (Curated & Popular)	08:13
“Sonnet 4.5’s eval gaming seriously undermines alignment evals, and this seems caused by training on alignment evals” by Alexa Pan, ryan_greenblatt	LessWrong (Curated & Popular)	35:57
“Publishing academic papers on transformative AI is a nightmare” by Jakub Growiec	LessWrong (Curated & Popular)	07:23
“The Unreasonable Effectiveness of Fiction” by Raelifin	LessWrong (Curated & Popular)	15:03
“Legible vs. Illegible AI Safety Problems” by Wei Dai	LessWrong (Curated & Popular)	03:29
“Lack of Social Grace is a Lack of Skill” by Screwtape	LessWrong (Curated & Popular)	11:08
[Linkpost] “I ate bear fat with honey and salt flakes, to prove a point” by aggliu	LessWrong (Curated & Popular)	01:07

“A Pragmatic Vision for Interpretability” by Neel Nanda

LessWrong (Curated & Popular)

63:58

“AI in 2025: gestalt” by technicalities

LessWrong (Curated & Popular)

41:59

“Eliezer’s Unteachable Methods of Sanity” by Eliezer Yudkowsky

LessWrong (Curated & Popular)

16:13

“An Ambitious Vision for Interpretability” by leogao

LessWrong (Curated & Popular)

08:49