1. EachPod

[Linkpost] “Where’s my ten minute AGI?” by Vasco Grilo🔸

Author
EA Forum Team ([email protected])
Published
Tue 20 May 2025
Episode Link
https://forum.effectivealtruism.org/posts/5fA7CXc4WK2nTCz3o/where-s-my-ten-minute-agi

This is a link post.

This is a crosspost for Where's my ten minute AGI? by Hanson Ho, which was originally published on Gradient Updates on 2 May 2025.

Recently, METR released a paper arguing that the length of tasks that AIs can do is doubling every 7 months.

We can see this in the following graph, where the best AI system[1] is able to do roughly hour-long tasks at a 50% success rate on average:

METR's research finds that AIs are rapidly able to do longer and longer tasks, where length is measured by the time it takes for a human with requisite expertise to do the task.

But there's a big problem here – if AIs are actually able to perform most tasks on 1-hour task horizons, why don’t we see more real-world task automation? For example, most emails take less than an hour to write, but [...]

---

Outline:

(01:59) 1. Time-horizon estimates are very domain-specific

(04:54) 2. Task reliability strongly influences task horizons

(08:12) 3. Real-world tasks are bundled together and hard to separate out

(10:57) Discussion

The original text contained 9 footnotes which were omitted from this narration.

---


First published:

May 19th, 2025



Source:

https://forum.effectivealtruism.org/posts/5fA7CXc4WK2nTCz3o/where-s-my-ten-minute-agi



Linkpost URL:
https://epoch.ai/gradient-updates/where-is-my-ten-minute-agi


---


Narrated by TYPE III AUDIO.


---

Images from the article:




Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

Share to: