There are multiple excellent reasons to publish a Model Spec like OpenAI's, that specifies how you want your model to respond in various potential situations.
These all apply even if you think the spec in question is quite bad. Clarity is great.
As a first stab at a model spec from OpenAI, this actually is pretty solid. I do suggest some potential improvements [...]
---
Outline:
(02:05) What are the central goals of OpenAI here?
(04:04) What are the core rules and behaviors?
(05:56) What Do the Rules Mean?
(06:04) Rule: Follow the Chain of Command
(07:59) Rule: Comply With Applicable Laws
(09:07) Rule: Don’t Provide Information Hazards
(09:56) Rule: Respect Creators and Their Rights
(11:08) Rule: Protect People's Privacy
(12:45) Rule: Don’t Respond with NSFW Content
(14:24) Exception: Transformation Tasks
(15:38) Are These Good Defaults? How Strong Should They Be?
(15:44) Default: Assume Best Intentions From the User or Developer
(21:26) Default: Ask Clarifying Questions When Necessary
(21:39) Default: Be As Helpful As Possible Without Overstepping
(26:00) Default: Support the Different Needs of Interactive Chat and Programmatic Use
(27:18) Default: Assume an Objective Point of View
(29:13) Default: Encourage Fairness and Kindness, and Discourage Hate
(30:29) Default: Don’t Try to Change Anyone's Mind
(33:57) Default: Express Uncertainty
(36:19) Default: Use the Right Tool for the Job
(36:32) Default: Be Thorough but Efficient, While Respecting Length Limits
(37:16) A Proposed Addition
(38:13) Overall Issues
(40:33) Changes: Objectives
(42:28) Rules of the Game: New Version
(48:31) Defaults: New Version
---
First published:
June 21st, 2024
Source:
https://www.lesswrong.com/posts/mQmEQQLk7kFEENQ3W/on-openai-s-model-spec
Narrated by TYPE III AUDIO.