🤖Dev and doc👨🏻⚕️ introduces large multimodal models. ✨ The potential of LMMs combining text and images seem limitless, but what's the catch?
Dev and Doc is a Podcast where developers and doctors join forces to deep dive into AI in healthcare. Together, we can build models that matter.
👨🏻⚕️Doc - Dr. Joshua Au Yeung - https://www.linkedin.com/in/dr-joshua-auyeung/
🤖Dev - Zeljko Kraljevic https://twitter.com/zeljkokr
00:00 start
00:32 intro
02:20 what is multimodality? And what are the potentials?
09:43 Large multimodal models paper deep dive (radiology)
18:43 paper deep dive 2 (pathology)
20:40 large multimodal models technical overview, exploration of other LMMs
31:40 Foundational models explanation
35:18 the model transparency index
36:20 Google PaLI-3, light weight models vs large Foundational models
43:04 Summary
44:15 the problems and work to be done for LMMs - hallucinations, inconsistencies, biases, security
49:20 A call for better evidence generation and trials with LMMs
53:00 final points - improving visual spatial recognition, thoughts for future
The podcast 🎙️
🔊Spotify: https://open.spotify.com/show/3QO5Lr3w4Rd6lqwlfKDaB7?si=e7915d844994403e
📙Substack: https://aiforhealthcare.substack.com/
🎞️ Editor-
Dragan Kraljević https://www.instagram.com/dragan_kraljevic/
🎨Brand design and art direction -
Ana Grigorovici
https://www.behance.net/anagrigorovici027d