Dreamer V3: A General-Purpose World-Model for Reinforcement Learning

Author: Mike Breault
Published: Sun 06 Apr 2025
Episode Link: None

In this Science Corner episode of The Deep Dive, we unpack Dreamer V3—the single, fixed-hyperparameter agent designed to excel across diverse tasks. We break down the world-model (RSSM), the actor-critic trio, and key innovations like simlog, KL balancing with free bits, return normalization, and the C-MEXP two-hot critic. Join us as we explore why these ideas matter for generality, stability, and the path toward more capable AI.

Note: This podcast was AI-generated, and sometimes AI can make mistakes. Please double-check any critical information.

EachPod

EachPod

Dreamer V3: A General-Purpose World-Model for Reinforcement Learning