📜
GPT-4o System CardThis technical document is the System Card for OpenAI's GPT-4o, a multimodal, autoregressive language model that can process and generate text, audio, images, and video. The card provides a detailed overview of the model's capabilities, limitations, and safety evaluations across various categories, with a particular focus on its speech-to-speech (voice) capabilities. The card details the model's training data, including web data, code and math, and multimodal data. It also covers OpenAI's risk identification, assessment, and mitigation strategies, including red teaming, evaluation methodologies, and observed safety challenges. The document examines the potential societal impacts of the model, including anthropomorphization and emotional reliance, health applications, and scientific capabilities. Finally, the card concludes with a discussion of the next steps for research and development in omni models.
📎
Link to paper