1. EachPod

Alignment Newsletter #79: Recursive reward modeling as an alignment technique integrated with deep RL

Author
Robert Miles
Published
Wed 01 Jan 2020
Episode Link
https://alignment-newsletter.libsyn.com/alignment-newsletter-79
Share to: