Sveriges mest populära poddar

AI Safety Fundamentals: Governance

Listen to resources from the AI Safety Fundamentals: Governance course!https://aisafetyfundamentals.

147 avsnitt • Längd: 25 min • Veckovis: Lördag

Om podden

Listen to resources from the AI Safety Fundamentals: Governance course!https://aisafetyfundamentals.com/governance

The podcast AI Safety Fundamentals: Governance is created by BlueDot Impact. The podcast and the artwork on this page are embedded on this page using the public podcast feed (RSS).

Avsnitt

Deceptively Aligned Mesa-Optimizers: It’s Not Funny if I Have to Explain It

4 januari 2025 | 27 min
Read More

Learning From Human Preferences

4 januari 2025 | 7 min
Read More

Where I Agree and Disagree with Eliezer

4 januari 2025 | 43 min
Read More

Thought Experiments Provide a Third Anchor

4 januari 2025 | 8 min
Read More

Future ML Systems Will Be Qualitatively Different

4 januari 2025 | 13 min
Read More

Why AI Alignment Could Be Hard With Modern Deep Learning

4 januari 2025 | 29 min
Read More

Acquisition of Chess Knowledge in Alphazero

4 januari 2025 | 22 min
Read More

Four Background Claims

4 januari 2025 | 15 min
Read More

Understanding Intermediate Layers Using Linear Classifier Probes

4 januari 2025 | 17 min
Read More

Feature Visualization

4 januari 2025 | 32 min
Read More

Embedded Agents

4 januari 2025 | 18 min
Read More

Logical Induction (Blog Post)

4 januari 2025 | 12 min
Read More

Cooperation, Conflict, and Transformative Artificial Intelligence: Sections 1 & 2 — Introduction, Strategy and Governance

4 januari 2025 | 28 min
Read More

Superintelligence: Instrumental Convergence

4 januari 2025 | 18 min
Read More

Takeaways From Our Robust Injury Classifier Project [Redwood Research]

4 januari 2025 | 12 min
Read More

The Alignment Problem From a Deep Learning Perspective

4 januari 2025 | 34 min
Read More

High-Stakes Alignment via Adversarial Training [Redwood Research Report]

4 januari 2025 | 19 min
Read More

A Short Introduction to Machine Learning

4 januari 2025 | 18 min
Read More

Introduction to Logical Decision Theory for Computer Scientists

4 januari 2025 | 14 min
Read More

Yudkowsky Contra Christiano on AI Takeoff Speeds

4 januari 2025 | 62 min
Read More
00:00 -00:00