It was all quiet. Then it wasn’t.
Note the timestamps on both of these.
Dwarkesh Patel did a podcast with Mark Zuckerberg on the 18th. It was timed to coincide with the release of much of Llama-3, very much the approach of telling your story directly. Dwarkesh is now the true tech media. A meteoric rise, and well earned.
This is two related posts in one. First I cover the podcast, then I cover Llama-3 itself.
My notes are edited to incorporate context from later explorations of Llama-3, as I judged that the readability benefits exceeded the purity costs.
Podcast Notes: Llama-3 Capabilities
---
Outline:
(00:51) Podcast Notes: Llama-3 Capabilities
(03:09) The Need for Inference
(07:08) Great Expectations
(11:29) Open Source and Existential and Other Risks
(30:50) Interview Overview
(33:22) A Few Reactions
(47:53) Safety First
(54:15) Core Capability Claims
(56:11) How Good are the 8B and 70B Models in Practice?
(01:02:31) Architecture and Data
(01:05:08) Training Day
(01:09:17) What Happens Next With Meta's Products?
(01:12:24) What Happens Next With AI Thanks To These Two Models?
(01:14:04) The Bigger One: It's Coming
(01:14:59) Who Wins?
(01:17:21) Who Loses?
(01:21:49) How Unsafe Will It Be to Release Llama-3 400B?
(01:24:12) The Efficient Market Hypothesis is False
(01:27:09) What Next?
---
First published:
April 22nd, 2024
Narrated by TYPE III AUDIO.