The biggest event of the week was the Sleeper Agents paper from Anthropic. I expect that to inform our thoughts for a while to come, and to lay foundation for additional work. We also had the first third of the IMO solved at almost gold metal level by DeepMind, discovering that math competition geometry is actually mostly composed of One Weird Trick. I knew that at the time I was doing it, though, and it was still really hard.
As usual, there was also a bunch of other stuff.
Tomorrow the 19th, I am going to be off to San Francisco for the weekend to attend a workshop. That leaves a lot of time for other events and seeing other people, a lot of which remains unfilled. So if you are interested in meeting up or want to invite me to a gathering, especially on Sunday the [...]
---
Outline:
(00:53) Language Models Offer Mundane Utility
(02:01) Language Models Don’t Offer Mundane Utility
(05:49) GPT-4 Real This Time
(07:05) Fun with Image Generation
(08:02) Copyright Confrontation
(09:43) Deepfaketown and Botpocalypse Soon
(11:31) They Took Our Jobs
(12:16) Get Involved
(13:11) Introducing
(17:36) In Other AI News
(22:48) Quiet Speculations
(25:30) The Quest for Sane Regulations
(31:50) The Week in Audio with Sam Altman
(35:03) David Brin Podcast
(45:46) Rhetorical Innovation
(48:59) Anthropic Paper on Sleeper Agents
(50:45) Anthropic Introduces Impossible Mission Force
(53:57) Aligning a Smarter Than Human Intelligence is Difficult
(58:57) The Belrose Model Continued
(01:23:46) Open Model Weights Are Unsafe And Nothing Can Fix This
(01:28:26) People Are Worried About AI Killing Everyone
(01:28:42) Other People Are Not As Worried About AI Killing Everyone
(01:31:15) The Lighter Side
---
First published:
January 18th, 2024
Source:
https://www.lesswrong.com/posts/WRGmBE3h4WjA5EC5a/ai-48-exponentials-in-geometry
Narrated by TYPE III AUDIO.