Audio narrations of LessWrong posts.
So the situation as it stands is that the fraction of the light cone expected to be filled with satisfied cats is not zero. This is already remarkable. What's more remarkable is …
In 2021, a circle of researchers left OpenAI, after a bitter dispute with their executives. They started a competing company, Anthropic, stating that they wanted to put safety first. The safety commu…
While I'm intrigued by the idea of acausal trading, I confess that so far I fail to see how they make sense in practice. Here I share my (unpolished) musings, in the hopes that someone can point me …
(Cross-posted from speaker's notes of my talk at Deepmind today.)
Good local time, everyone. I am Audrey Tang, 🇹🇼 Taiwan's Cyber Ambassador and first Digital Minister (2016-2024). It is an honor to …
Epistemic status: Philosophical argument. I'm critiquing Hinton's maternal instinct metaphor and proposing relationship-building as a better framework for thinking about alignment. This is about shi…
Epistemic status: I think you should interpret this as roughly something like “GenAI is not so powerful that it shows up in the most obvious way of analyzing the data, but maybe if someone did a mor…
(Crossposted from my Substack: https://taylorgordonlunt.substack.com/p/my-ai-predictions-for-2027)
I think a lot of blogging is reactive. You read other people's blogs and you're like, no, that's …
Audio note: this article contains 42 uses of latex notation, so the narration may be difficult to follow. There's a link to the original text in the episode description.
This post is an attempt to…
In 1787, Catherine the Great sailed down the Dnieper to inspect its banks. Her trusted advisor, Governor Potemkin, set out to present those war-torn lands to her in the best poss…
[CW: Partial nudity]
I'm a straight man. If you're a straight man who befriends other straight men, you will be occasionally have conversations that sound like this:
I often see people treating defensiveness as proof of guilt. The thought seems to go that if someone is defensive, it's because they know they’ve done something wrong. There are even proverbs around…
I got this crazy idea; I wonder if anyone could try it. Let's make an online encyclopedia, similar to Wikipedia, with one difference: all articles would be edited by AIs.
Why? (I mean, other than "b…
PauseAI organised an open letter from UK lawmakers and civil society organisations to Demis Hassabis, CEO of Google DeepMind. PauseAI UK members emailed their MPs asking them to …
Last month we held a workshop on Post-AGI outcomes. This post is a list of all the talks, with short summaries, as well as my personal takeaways.
The first keynote was @Joe Carlsmith on “Can Goodne…
Introduction
I’m excited by deception probes. When I mention this, I’m sometimes asked “Do deception probes work?”
But I think there are many applications of deception probes, and each application w…
Once again we’ve reached the point where the weekly update needs to be split in two. Thus, the alignment and policy coverage will happen tomorrow. Today covers the rest.
The secret big announcement…
I am a knowledge worker. Over the course of my life I've felt insecure about not knowing more than I already do. I took a general cognitive ability test that placed me in the 98% percentile of the p…
We released our first Safety Report with AI misbehaviour in the wild.
I think Andon Labs' AI vending machines provide a unique opportunity to study AI safety on real-life data, …
This is my first post, so forgive me for it being a bit of a carelessly referenced, informal ramble. Feedback is appreciated.
As I understand it, Yudkowsky's contends that is that there exists an id…