Two new The Information articles with insider information on OpenAI's next models and moves.
They are paywalled, but here are the new bits of information:
- Strawberry is more expensive and slow at inference time, but can solve complex problems on the first try without hallucinations. It seems to be an application or extension of process supervision
- Its main purpose is to produce synthetic data for Orion, their next big LLM
- But now they are also pushing to get a distillation of Strawberry into ChatGPT as soon as this fall
- They showed it to feds
Some excerpts about these:
Plus this summer, his team demonstrated the technology [Strawberry] to American national security officials, said a person with direct knowledge of those meetings, which haven't previously been reported.
One of the most important applications of Strawberry is to generate high-quality training data for Orion, OpenAI's next flagship large [...]
---
First published: August 27th, 2024
Source: https://www.lesswrong.com/posts/8oX4FTRa8MJodArhj/the-information-openai-shows-strawberry-to-feds-races-to ---
Narrated by
TYPE III AUDIO.