Anthropic researchers find that AI models can be trained to deceive
Author
TechCrunch
Published
Tue 16 Jan 2024
Episode Link
None
Most humans learn the skill of deceiving other humans. So can AI models learn the same? Yes, the answer seems — and terrifyingly, they’re exceptionally good at it. Learn more about your ad choices. Visit podcastchoices.com/adchoices