Start / AI Safety Fundamentals: Governance / Introduction to mechanistic interpretability

Introduction to Mechanistic Interpretability

12 min • 4 januari 2025

Our introduction introduces common mech interp concepts, to prepare you for the rest of this session's resources.

Original text: https://aisafetyfundamentals.com/blog/introduction-to-mechanistic-interpretability/

Author(s): Sarah Hastings-Woodhouse

A podcast by BlueDot Impact.

Learn more on the AI Safety Fundamentals website.

Kategorier

Förekommer på

00:00 -00:00