Original post: https://www.interconnects.ai/p/openais-o1-using-search-was-a-psyop
Figures
Figure 0: OpenAI’s seminal test-time compute plot
Figure 1: Setup for bucketed evals
Figure 2: Evals with correctness labels
Figure 3: Grouped evals
Figure 4: Hypothetical inference scaling law