There is much talk about so-called Responsible Scaling Policies, as in what we will do so that what we are doing can be considered responsible. Would that also result in actually responsible scaling? It would help. By themselves, in their current versions, no. The good scenario is that these policies are good starts and lay groundwork and momentum to get where we need to go. The bad scenario is that this becomes safetywashing, used as a justification for rapid and dangerous scaling of frontier models, a label that avoids any actual action or responsibility.
Others think it would be better if we flat out stopped. So they say so. And they protest. And they point out that the public is mostly with them, at the same time that those trying to play as Very Serious People say such talk is irresponsible.
Future persuasion will be better. Sam [...]
---
Outline:
(01:31) Language Models Offer Mundane Utility
(01:46) Language Models Don’t Offer Mundane Utility
(02:32) GPT-4 Real This Time
(03:51) A Proposed Bet
(04:38) Fun with Image Generation
(05:54) Deepfaketown and Botpocalypse Soon
(08:26) They Took Our Jobs
(10:38) Get Involved
(11:16) Introducing
(16:19) In Other AI News
(17:57) Quiet Speculations
(21:13) The Quest for Sane Regulations
(26:29) The Week in Audio
(27:08) Rhetorical Innovation
(42:45) Friendship is Optimal
(45:12) Honesty As the Best Policy
(52:43) Aligning a Smarter Than Human Intelligence is Difficult
(54:17) Aligning a Dumber Than Human Intelligence Is Also Difficult
(01:02:49) Humans Do Not Expect to Be Persuaded by Superhuman Persuasion
(01:07:36) DeepMind's Evaluation Paper
(01:17:23) Bengio Offers Letter and Proposes a Synthesis
(01:20:54) Matt Yglesias Responds To Marc Andreessen's Manifesto
(01:25:33) People Are Worried About AI Killing Everyone
(01:31:09) Someone Is Worried AI Alignment Is Going Too Fast
(01:36:14) Please Speak Directly Into This Microphone
(01:37:46) The Lighter Side
---
First published:
October 26th, 2023
Source:
https://www.lesswrong.com/posts/aQ6LDhc2zxrYXFjEF/ai-35-responsible-scaling-policies
Narrated by TYPE III AUDIO.