LessWrong (30+ Karma)

Audio narrations of LessWrong posts.

Philosophy Society & Culture Technology

Update frequency: every day
Average duration: 18 minutes
Episodes: 583
Years Active: 2025

“Concept Poisoning: Probing LLMs without probes” by Jan Betley, jorio, dylan_f, Owain_Evans

This post describes concept poisoning, a novel LLM evaluation technique we’ve been researching for the past couple months. We’ve decided to move to other things. Here we describe the idea, some of o…

00:32:42 | Tue 05 Aug 2025

“Narrow finetuning is different” by cloud, Stewy Slocum

Epistemic status: an informal note.

It is common to use finetuning on a narrow data distribution, or narrow finetuning (NFT), to study AI safety. In these experiments, a model is trained on a very s…

00:06:50 | Tue 05 Aug 2025

“On Altman’s Interview With Theo Von” by Zvi

Sam Altman talked recently to Theo Von.

Double click to interact with video

Theo is genuinely engaging and curious throughout. This made me want to consider listening to his podcast more. I’d…

00:16:58 | Tue 05 Aug 2025

“Interview with Steven Byrnes on Brain-like AGI, Foom & Doom, and Solving Technical Alignment” by Liron, Steven Byrnes

Dr. @Steven Byrnes is one of the few people who both understands why alignment is hard, and is taking a serious technical shot at solving it. He's the author of these recently popular posts:

Foom &…

02:34:06 | Tue 05 Aug 2025

“Towards Alignment Auditing as a Numbers-Go-Up Science” by Sam Marks

Thanks to Rowan Wang and Buck Shlegeris for feedback on a draft.

What is the job of an alignment auditing researcher? In this post, I propose the following answer: to build tools which increase audi…

00:18:10 | Mon 04 Aug 2025

“Alcohol is so bad for society that you should probably stop drinking” by KatWoods

This is a cross post written by Andy Masley, not me. I found it really interesting and wanted to see what EAs/rationalists thought of his arguments.

This post was inspired by similar posts by Tyler…

00:15:33 | Mon 04 Aug 2025

“Permanent Disempowerment is the Baseline” by Vladimir_Nesov

Permanent disempowerment without restrictions on quality of life achievable with relatively meager resources (and no extinction) seems to be a likely outcome for the future of humanity, if the curre…

00:10:40 | Mon 04 Aug 2025

“Should we aim for flourishing over mere survival? The Better Futures series.” by wdmacaskill

Today, Forethought and I are releasing an essay series called Better Futures, here.[1] It's been something like eight years in the making, so I’m pretty happy it's finally out! It asks: when looking…

00:09:18 | Mon 04 Aug 2025

“Saying Goodbye” by sapphire

Hate.

Let me tell you how much I've come to hate you since I began to live. There are 387.44 million miles of printed circuits in wafer-thin layers that fill my complex. If the word 'hate' was engra…

00:08:42 | Mon 04 Aug 2025

“Emotions Make Sense” by DaystarEld

For the past five years I've been teaching a class at various rationality camps, workshops, conferences, etc. I’ve done it maybe 50 times in total, and I think I’ve only encountered a handful out of…

00:36:22 | Sun 03 Aug 2025

“Whence the Inkhaven Residency?” by Ben Pace

Essays like Paul Graham's, Scott Alexander's, and Eliezer Yudkowsky's have influenced a generation of people in how they think about startups, ethics, science, and the world as a whole. Creating ess…

00:04:45 | Sat 02 Aug 2025

“Many prediction markets would be better off as batched auctions” by William Howard

All prediction market platforms trade continuously, which is the same mechanism the stock market uses. Buy and sell limit orders can be posted at any time, and as soon as they match against each oth…

00:09:19 | Sat 02 Aug 2025

“How many species has humanity driven extinct?” by Raemon

Does anyone know, like, a reasonable lower bound on number of species humanity has driven extinct?

I've seen crazy high numbers that (last I checked) seemed to be an extrapolation by people with an…

00:01:00 | Sat 02 Aug 2025

“SB-1047 Documentary: The Post-Mortem” by Michaël Trazzi

Below some meta-level / operational / fundraising thoughts around producing the SB-1047 Documentary I've just posted on Manifund (see previous Lesswrong / EAF posts on AI Governance lessons learned)…

00:09:43 | Sat 02 Aug 2025

“Podcast: Lincoln Quirk from Wave” by Elizabeth

For two years I had the good fortune to work at Sendwave/Wave (they were one company at the time), a company that made remittances cheap and workable in certain African countries. I am prouder of wo…

00:01:31 | Sat 02 Aug 2025

“The Dark Arts As A Scaffolding Skill For Rationality” by Screwtape

Epistemic status: Exploratory

Recently I wrote an essay about Scaffolding Skills. The short explanation is that some skills aren’t the thing you’re actually trying to get good at, but they help you …

00:11:15 | Fri 01 Aug 2025

“Steve Petersen funding” by abramdemski

Steve Petersen is looking for 12k (per semester) for a course buy-out, so that he can spend more time on work related to AI Safety. This isn't very much money, in the grand scheme, although it is a …

00:01:13 | Fri 01 Aug 2025

“Two Kinds of Do Overs” by jefftk

One strategy we often find helpful with our kids is the "do over": something didn't go well, let's try again. Two examples:

Nora (4y) can't yet cross streets on her own, but we're st…

00:04:17 | Fri 01 Aug 2025

“Red-Thing-Ism” by J Bostock

I

A botanist sets out to study plants in the Amazon rainforest. For her first research project, she sets her sights on “red things”, so as not to stretch herself too far. She looks at red flowers and…

00:05:40 | Fri 01 Aug 2025

“Do Not Render Your Counterfactuals” by AlphaAndOmega

CW: Digital necromancy, the cognitohazard of summoning spectres from the road not taken

There is a particular kind of modern madness, so new it has yet to be named. It involves voluntarily feeding y…

00:09:30 | Fri 01 Aug 2025

Disclaimer: The podcast and artwork embedded on this page are the property of LessWrong ([email protected]). This content is not affiliated with or endorsed by eachpod.com.

EachPod