LessWrong (30+ Karma)

LessWrong ([email protected])

Audio narrations of LessWrong posts.

Society & Culture Philosophy Technology

Update frequency: every day
Average duration: 18 minutes
Episodes: 571
Years Active: 2025

Share to:

“What are important UI-shaped problems that Lightcone could tackle?” by Raemon

As I think about "what to do about AI x-risk?", some principles that seem useful to me:

Short timelines seem plausible enough that, for the next year or so, I'd like to focus on plans that are rele…

00:04:27 | Sun 27 Apr 2025

“We should try to automate AI safety work asap” by Marius Hobbhahn

This is a personal post and does not necessarily reflect the opinion of other members of Apollo Research. I think I could have written a better version of this post with more time. However, my main …

00:31:13 | Sat 26 Apr 2025

“Worries About AI Are Usually Complements Not Substitutes” by Zvi

A common claim is that concern about [X] ‘distracts’ from concern about [Y]. This is often used as an attack to cause people to discard [X] concerns, on pain of being enemies of [Y] concerns, as att…

00:08:10 | Sat 26 Apr 2025

“AI #113: The o3 Era Begins” by Zvi

Enjoy it while it lasts. The Claude 4 era, or the o4 era, or both, are coming soon. Also, welcome to 2025, we measure eras in weeks or at most months. For now, the central thing going on continues …

02:00:41 | Fri 25 Apr 2025

“Token and Taboo” by Guive

What in retrospect seem like serious moral crimes were often widely accepted while they were happening. This means that moral progress can require intellectual progress.[1] Intellectual progress oft…

00:09:59 | Fri 25 Apr 2025

“Reward hacking is becoming more sophisticated and deliberate in frontier LLMs” by Kei

Something's changed about reward hacking in recent systems. In the past, reward hacks were usually accidents, found by non-general, RL-trained systems. Models would randomly explore different behavio…

00:26:15 | Fri 25 Apr 2025

“The Intelligence Curse: an essay series” by L Rudolf L, lukedrago

We've published an essay series on what we call the intelligence curse. Most content is brand new, and all previous writing has been heavily reworked.

Visit intelligence-curse.ai for the full series…

00:05:41 | Thu 24 Apr 2025

[Linkpost] “Modifying LLM Beliefs with Synthetic Document Finetuning” by RowanWang, Johannes Treutlein, Ethan Perez, Fabien Roger, Sam Marks

This is a link post.

In this post, we study whether we can modify an LLM's beliefs and investigate whether doing so could decrease risk from advanced AI systems.

We describe a pipeline for modifying …

00:05:18 | Thu 24 Apr 2025

“‘The Era of Experience’ has an unsolved technical alignment problem” by Steven Byrnes

Every now and then, some AI luminaries

(1) propose that the future of powerful AI will be reinforcement learning agents—an algorithm class that in many ways has more in common with MuZero (2019) th…

00:43:49 | Thu 24 Apr 2025

[Linkpost] “My Favorite Productivity Blog Posts” by Parker Conley

This is a link post.

I’ve read at least a few hundred blog posts, maybe upwards of a thousand. Agreeing with Gavin Leech, I believe I’ve gained from essays more than any other medium. I’m an intellec…

00:02:50 | Thu 24 Apr 2025

“OpenAI Alums, Nobel Laureates Urge Regulators to Save Company’s Nonprofit Structure” by garrison

Converting to a for-profit model would undermine the company's founding mission to ensure AGI "benefits all of humanity," argues new letter

This is the full text of a post from Obsolete, a Substack …

00:18:53 | Thu 24 Apr 2025

Disclaimer: The podcast and artwork embedded on this page are the property of LessWrong ([email protected]). This content is not affiliated with or endorsed by eachpod.com.

EachPod