Sveriges mest populära poddar

LessWrong posts by zvi

“AI #112: Release the Everything” by Zvi

83 min • 17 april 2025
OpenAI has upgraded its entire suite of models. By all reports, they are back in the game for more than images. GPT-4.1 and especially GPT-4.1-mini are their new API non-reasoning models. All reports are that GPT-4.1-mini especially is very good. o3 is the new top of the line ChatGPT reasoning model, with o3-pro coming in a few weeks. Reports are that it too looks very good, even without us yet taking much advantage of its tool usage. If you have access, check it out. Full coverage is coming soon. There's also o4-mini and o4-mini-high. Oh, they also made ChatGPT memory cover all your conversations, if you opt in, and gave us a version of Claude Code called Codex. And an update to their preparedness framework that I haven’t had time to examine yet. Anthropic gave us (read-only for now) Google integration (as in GMail and Calendar to complement Drive), and [...]

---

Outline:

(01:32) Language Models Offer Mundane Utility

(03:08) Language Models Don't Offer Mundane Utility

(06:07) Huh, Upgrades

(09:57) On Your Marks

(12:42) Research Quickly, There's No Time

(15:16) Choose Your Fighter

(16:07) Deepfaketown and Botpocalypse Soon

(16:30) The Art of the Jailbreak

(16:44) Get Involved

(18:46) Introducing

(21:21) In Other AI News

(23:16) Come on OpenAI, Again?

(25:58) Show Me the Money

(26:41) In Memory Of

(28:28) Quiet Speculations

(32:17) America Restricts H20 Sales

(39:04) House Select Committee Report on DeepSeek

(48:46) Tariff Policy Continues To Be How America Loses

(58:53) The Quest for Sane Regulations

(01:01:03) The Week in Audio

(01:06:13) Rhetorical Innovation

(01:10:39) Aligning a Smarter Than Human Intelligence is Difficult

(01:14:40) AI 2027

(01:15:13) People Are Worried About AI Killing Everyone

(01:19:54) The Lighter Side

---

First published:
April 17th, 2025

Source:
https://www.lesswrong.com/posts/nycc4QxQAMkzmXmfz/ai-112-release-the-everything

---

Narrated by TYPE III AUDIO.

---

Images from the article:

Single dandelion growing through concrete under gloomy sky, with text.
Google Cloud logo and partner company logos contributing to Agent2Agent protocol.
XKCD comic strip about experts overestimating public knowledge of epistemology.
Correlation matrix showing relationships between TQA, DQA, RepEng, and AmongUs metrics.
LinkedIn profile experience section showing OpenAI and Meta AI positions.
Flow diagram showing three steps: Retrieval, Prompt Augmentation, and Generation process.
Advertisement concept comparing Netflix's Black Mirror and White Mirror series, featuring glowing circular devices.
News program
Graph showing
Cartoon tabby cat sitting on laptop with bright green eyes
Scatter plot comparing Detection vs Deception ELO scores for AI language models.
Large wave with
Scatter plot comparing win rates and ELO scores of AI reasoning models.

The plot shows multiple language models plotted by their performance metrics in the game
Two graphs comparing GPU export controls across different bandwidth metrics (2022/2025).

This shows scatter plots measuring computational performance (FLOP/s) against interconnect bandwidth, highlighting changes in restrictions for various GPU categories (Gaming GPU, H800, H100, A800, A100, H20) between October 2022 and January 2025 export controls.
News article screenshot. The headline reads:
Markets & Mayhem tweets:
Peter Wildeford tweets:
La Main de la Mort tweets:
Cartoon comparing laptop manufacturing: U.S. with 145% tariffs versus China with 0%

Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

00:00 -00:00