Sveriges mest populära poddar

Agentic Horizons

The Art of Storytelling: Dynamic Multimodal Narratives

7 min • 28 november 2024

This episode explores the use of AI for children's storytelling, featuring a system that generates multimodal stories with text, audio, and video. The episode discusses the multi-agent architecture behind the system, where AI models like large language models, text-to-speech, and text-to-video work together. Key roles include the Writer, Reviewer, Narrator, Film Director, and Animator.


The episode highlights how storytelling frameworks guide the AI’s creative process, evaluates the quality of the generated content, and addresses ethical concerns, especially around content moderation. It concludes with a look at future possibilities, like user interaction and incorporating user-drawn images. This episode is ideal for parents, educators, and AI enthusiasts.


https://arxiv.org/pdf/2409.11261

Kategorier
Förekommer på
00:00 -00:00