LessWrong (30+ Karma)

Audio narrations of LessWrong posts.

Philosophy Society & Culture Technology

Update frequency: every day
Average duration: 18 minutes
Episodes: 583
Years Active: 2025

“Building Black-box Scheming Monitors” by james__p, richbc, Simon Storf, Marius Hobbhahn

This is a five-week interim report for a MATS 8.0 project supervised by Marius Hobbhahn. Produced as part of the ML Alignment & Theory Scholars Program - Summer 2025 Cohort.

Executive Summary

Our g…

00:23:30 | Thu 31 Jul 2025

“Follow-up to ‘My Empathy Is Rarely Kind’” by johnswentworth

There's now over 100 comments on “My Empathy Is Rarely Kind”, and the consensus response is roughly “John, that's not what empathy is”.

So, first things first: I agree. I was using the wrong words f…

00:03:35 | Thu 31 Jul 2025

“I am worried about near-term non-LLM AI developments” by testingthewaters

TL;DR

I believe that:

Almost all LLM-centric safety research will not provide any significant safety value with regards to existential or civilisation-scale risks.
The capabilities-related forecast…

00:10:55 | Thu 31 Jul 2025

“Childhood and Education: College Admissions” by Zvi

Table of Contents

College Applications.
The College Application Essay (Is) From Hell.
Don’t Guess The Teacher's Password, Ask For It Explicitly.
A Dime a Dozen.
Treat Admissions Essays Like…

00:33:33 | Thu 31 Jul 2025

“Optimizing The Final Output Can Obfuscate CoT (Research Note)” by lukemarks, jacob_drori, cloud, TurnTrout

Produced as part of MATS 8.0 under the mentorship of Alex Turner and Alex Cloud. This research note overviews some early results which we are looking for feedback on.

TL;DR: We train language model…

00:11:31 | Wed 30 Jul 2025

“China proposes new global AI cooperation organisation” by Matrice Jacobine

SHANGHAI, July 26 (Reuters) - China said on Saturday it wanted to create an organisation to foster global cooperation on artificial intelligence, positioning itself as an alternative to the U.S. as …

00:02:11 | Wed 30 Jul 2025

“My Empathy Is Rarely Kind” by johnswentworth

There's a narrative I hear a lot: if I empathize more, put myself in other peoples’ shoes, try to feel what they’re feeling, see things from their perspective, etc, then I’ll feel kinder toward them…

00:06:29 | Wed 30 Jul 2025

“The many paths to permanent disempowerment even with shutdownable AIs (MATS project summary for feedback)” by GideonF

I’d like to thank my mentors David Duvenaud, Raymond Douglas, David Krueger and Jan Kulveit for providing helpful ideas, comments and discussions. The views expressed in this, and any mistakes, are …

00:18:22 | Wed 30 Jul 2025

“Spilling the Tea” by Zvi

The Tea app is or at least was on fire, rapidly gaining lots of users. This opens up two discussions, one on the game theory and dynamics of Tea, one on its abysmal security.

It's a little too on t…

00:22:46 | Tue 29 Jul 2025

“I wrote a song parody” by CronoDAS

Since Tom Lehrer passed away recently, I thought I'd honor him by adapting one of his songs to be about a more recent existential risk. Presenting...

"We Will All Go Together When We Go (ASI versio…

00:02:50 | Tue 29 Jul 2025

“Low P(x-risk) as the Bailey for Low P(doom)” by Vladimir_Nesov

Nick Bostrom defines existential risk as

Existential risk – One where an adverse outcome would either annihilate Earth-originating intelligent life or permanently and drastically curtail its poten…

00:04:38 | Tue 29 Jul 2025

“Low P(x-risk) as the Bailey for Low P(doom)” by Vladimir_Nesov

Nick Bostrom defines existential risk as

Existential risk – One where an adverse outcome would either annihilate Earth-originating intelligent life or permanently and drastically curtail its poten…

00:04:39 | Tue 29 Jul 2025

“About 30% of Humanity’s Last Exam chemistry/biology answers are likely wrong” by bohaska

FutureHouse is a company that builds literature research agents. They tested it on the bio + chem subset of HLE questions, then noticed errors in them.

The post's first paragraph:

Humanity's Last Ex…

00:06:41 | Tue 29 Jul 2025

“Procrastination Drill” by silentbob

Have you ever been endlessly procrastinating on some task, but then once you eventually, finally do it, you realize that it is not half as bad as you thought?

Somehow, many of us keep overestimating…

00:05:02 | Tue 29 Jul 2025

“Teaching kids to swim” by Steven Byrnes

Both my kids can swim! Yay! 🥂🍾 Some notes about the process:

The options were group lessons, individual lessons, and parent-is-the-teacher. We have tried all three. Individual lessons were logistic…

00:04:59 | Tue 29 Jul 2025

“Recursions on LessOnline 2025” by Error

Meta: Last year I wrote a retrospective of the first LessOnline. This year, several people told me that they’d read it and found it helpful/interesting/entertaining, so I figure I’ll do it again. It…

00:33:02 | Tue 29 Jul 2025

“Simplex Progress Report - July 2025” by Adam Shai, Paul Riechers, hrbigelow, Eric Alt, mntss

Thanks to Jasmina Urdshals, Xavier Poncini, and Justis Mills for comments.

Introduction

At Simplex our mission is to develop a principled science of the representations and emergent behaviors of AI…

00:32:58 | Tue 29 Jul 2025

“Optimally Combining Probe Monitors and Black Box Monitors” by Tim Hua, jamesbaskerville, BionicD0LPH1N, Mia Hopman, Aryan Bhatt, Tyler Tracy

Link to our arXiv paper: https://arxiv.org/abs/2507.15886

TL, DR;: We study how to efficiently combine multiple monitors with different performance and cost profiles into a single protocol. When t…

00:13:21 | Mon 28 Jul 2025

AI companions, other forms of personalized AI content and persuasion and related issues continue to be a hot topic. What do people use companions for? Are we headed for a goonpocalypse? Mostly no, co…

00:28:42 | Mon 28 Jul 2025

“This Is Not Life” by samhealy

A science-fiction short story exploring how far AI capitalists might go in their quest for (the illusion of) success. Wildly speculative, of course.

Thank you, but if you don’t mind, I’d prefer …

00:44:23 | Mon 28 Jul 2025

Disclaimer: The podcast and artwork embedded on this page are the property of LessWrong ([email protected]). This content is not affiliated with or endorsed by eachpod.com.

EachPod

EachPod

LessWrong (30+ Karma)

“Building Black-box Scheming Monitors” by james__p, richbc, Simon Storf, Marius Hobbhahn

“Follow-up to ‘My Empathy Is Rarely Kind’” by johnswentworth

“I am worried about near-term non-LLM AI developments” by testingthewaters

“Childhood and Education: College Admissions” by Zvi

“Optimizing The Final Output Can Obfuscate CoT (Research Note)” by lukemarks, jacob_drori, cloud, TurnTrout

“China proposes new global AI cooperation organisation” by Matrice Jacobine

“My Empathy Is Rarely Kind” by johnswentworth

“The many paths to permanent disempowerment even with shutdownable AIs (MATS project summary for feedback)” by GideonF

“Spilling the Tea” by Zvi

“I wrote a song parody” by CronoDAS

“Low P(x-risk) as the Bailey for Low P(doom)” by Vladimir_Nesov

“Low P(x-risk) as the Bailey for Low P(doom)” by Vladimir_Nesov

“About 30% of Humanity’s Last Exam chemistry/biology answers are likely wrong” by bohaska

“Procrastination Drill” by silentbob

“Teaching kids to swim” by Steven Byrnes

“Recursions on LessOnline 2025” by Error

“Simplex Progress Report - July 2025” by Adam Shai, Paul Riechers, hrbigelow, Eric Alt, mntss

“Optimally Combining Probe Monitors and Black Box Monitors” by Tim Hua, jamesbaskerville, BionicD0LPH1N, Mia Hopman, Aryan Bhatt, Tyler Tracy

“AI Companion Piece” by Zvi

“This Is Not Life” by samhealy