How Claude's computer use works. Where OpenAI, Anthropic, and Google all have a lead on eachother.
Original post: https://www.interconnects.ai/p/claudes-agency
Chapters
00:00 Claude's agentic future and the current state of the frontier models
04:43 The state of the frontier models
04:49 1. Anthropic has the best model we are accustomed to using
05:27 Google has the best small & cheap model for building automation and basic AI engineering
08:07 OpenAI has the best model for reasoning, but we don’t know how to use it
09:12 All of the laboratories have much larger models they’re figuring out how to release (and use)
10:42 Who wins?
Figures
Fig 1, Sonnet New Benchmarks: https://substackcdn.com/image/fetch/w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d2e63ff-ac9f-4f8e-9749-9ef2b9b25b6c_1290x1290.png
Fig 2, Sonnet Old Benchmarks: https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4bccbd4d-f1c8-4a38-a474-69a3df8a4448_2048x1763.png
Get Interconnects (https://www.interconnects.ai/)...
... on YouTube: https://www.youtube.com/@interconnects
... on Twitter: https://x.com/interconnectsai
... on Linkedin: https://www.linkedin.com/company/interconnects-ai
... on Spotify: https://open.spotify.com/show/2UE6s7wZC4kiXYOnWRuxGv
… on Apple Podcasts: https://podcasts.apple.com/us/podcast/interconnects/id1719552353