Start / LessWrong posts by zvi / On deepseeks r1 by zvi

“On DeepSeek’s r1” by Zvi

68 min • 22 januari 2025

r1 from DeepSeek is here, the first serious challenge to OpenAI's o1.

r1 is an open model, and it comes in dramatically cheaper than o1.

People are very excited. Normally cost is not a big deal, but o1 and its inference-time compute strategy is the exception. Here, cheaper really can mean better, even if the answers aren’t quite as good.

You can get DeepSeek-r1 on HuggingFace here, and they link to the paper.

The question is how to think about r1 as it compares to o1, and also to o1 Pro and to the future o3-mini that we’ll get in a few weeks, and then to o3 which we’ll likely get in a month or two.

Taking into account everything I’ve seen, r1 is still a notch below o1 in terms of quality of output, and further behind o1 Pro and the future o3-mini [...]

---

Outline:

(01:43) Part 1: RTFP: Read the Paper

(03:38) How Did They Do It

(06:19) The Aha Moment

(08:27) Benchmarks

(09:46) Reports of Failure

(11:11) Part 2: Capabilities Analysis

(11:16) Our Price Cheap

(15:44) Other People's Benchmarks

(18:20) r1 Makes Traditional Silly Mistakes

(23:11) The Overall Vibes