Podd: The Gradient: Perspectives on AI

Jacob Andreas: Language, Grounding, and World Models

10 oktober 2024 | 113 min

Episode 140

I spoke with Professor Jacob Andreas about:

* Language and the world

* World models

* How he’s developed as a scientist

Enjoy!

Jacob is an associate professor at MIT in the Department of Electrical Engineering and Computer Science as well as the Computer Science and Artificial Intelligence Laboratory. His research aims to understand the computational foundations of language learning, and to build intelligent systems that can learn from human guidance. Jacob earned his Ph.D. from UC Berkeley, his M.Phil. from Cambridge (where he studied as a Churchill scholar) and his B.S. from Columbia. He has received a Sloan fellowship, an NSF CAREER award, MIT's Junior Bose and Kolokotrones teaching awards, and paper awards at ACL, ICML and NAACL.

Find me on Twitter for updates on new episodes, and reach me at [email protected] for feedback, ideas, guest suggestions.

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (00:40) Jacob’s relationship with grounding fundamentalism

* (05:21) Jacob’s reaction to LLMs

* (11:24) Grounding language — is there a philosophical problem?

* (15:54) Grounding and language modeling

* (24:00) Analogies between humans and LMs

* (30:46) Grounding language with points and paths in continuous spaces

* (32:00) Neo-Davidsonian formal semantics

* (36:27) Evolving assumptions about structure prediction

* (40:14) Segmentation and event structure

* (42:33) How much do word embeddings encode about syntax?

* (43:10) Jacob’s process for studying scientific questions

* (45:38) Experiments and hypotheses

* (53:01) Calibrating assumptions as a researcher

* (54:08) Flexibility in research

* (56:09) Measuring Compositionality in Representation Learning

* (56:50) Developing an independent research agenda and developing a lab culture

* (1:03:25) Language Models as Agent Models

* (1:04:30) Background

* (1:08:33) Toy experiments and interpretability research

* (1:13:30) Developing effective toy experiments

* (1:15:25) Language Models, World Models, and Human Model-Building

* (1:15:56) OthelloGPT’s bag of heuristics and multiple “world models”

* (1:21:32) What is a world model?

* (1:23:45) The Big Question — from meaning to world models

* (1:28:21) From “meaning” to precise questions about LMs

* (1:32:01) Mechanistic interpretability and reading tea leaves

* (1:35:38) Language and the world

* (1:38:07) Towards better language models

* (1:43:45) Model editing

* (1:45:50) On academia’s role in NLP research

* (1:49:13) On good science

* (1:52:36) Outro

Links:

* Jacob’s homepage and Twitter

* Language Models, World Models, and Human Model-Building

* Papers

* Semantic Parsing as Machine Translation (2013)

* Grounding language with points and paths in continuous spaces (2014)

* How much do word embeddings encode about syntax? (2014)

* Translating neuralese (2017)

* Analogs of linguistic structure in deep representations (2017)

* Learning with latent language (2018)

* Learning from Language (2018)

* Measuring Compositionality in Representation Learning (2019)

* Experience grounds language (2020)

* Language Models as Agent Models (2022)

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Evan Ratliff: Our Future with Voice Agents

26 september 2024 | 80 min

Episode 139

I spoke with Evan Ratliff about:

* Shell Game, Evan’s new podcast, where he creates an AI voice clone of himself and sets it loose.

* The end of the Longform Podcast and his thoughts on the state of journalism.

Enjoy!

Evan is an award-winning investigative journalist, bestselling author, podcast host, and entrepreneur. He’s the author of the The Mastermind: A True Story of Murder, Empire, and a New Kind of Crime Lord; the writer and host of the hit podcasts Shell Game and Persona: The French Deception; and the cofounder of The Atavist Magazine, Pop-Up Magazine, and the Longform Podcast. As a writer, he’s a two-time National Magazine Award finalist. As an editor and producer, he’s a two-time Emmy nominee and National Magazine Award winner.

Find me on Twitter for updates on new episodes, and reach me at [email protected] for feedback, ideas, guest suggestions.

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (01:05) Evan’s ambitious and risky projects

* (04:45) Wearing different personas as a journalist

* (08:31) Boundaries and acceptability in using voice agents

* (11:42) Impacts on other people

* (13:12) “The kids these days” — how will new technologies impact younger people?

* (17:12) Evan’s approach to children’s technology use

* (20:05) Techno-solutionism and improvements in medicine, childcare

* (24:15) Evan’s perspective on simulations of people

* (27:05) On motivations for building tech startups

* (30:42) Evan’s outlook for Shell Game’s impact and motivations for his work

* (36:05) How Evan decided to write for a career

* (40:02) How voice agents might impact our conversations

* (43:52) Evan’s experience with Longform and podcasting

* (47:15) Perspectives on doing good interviews

* (52:11) Mimicking and inspiration, developing style

* (57:15) Writers and their motivations, the state of longform journalism

* (1:06:15) The internet and writing

* (1:09:41) On the ending of Longform

* (1:19:48) Outro

Links:

* Evan’s homepage and Twitter

* Shell Game, Evan’s new podcast

* Longform Podcast

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Meredith Ringel Morris: Generative AI's HCI Moment

12 september 2024 | 98 min

Episode 138

I spoke with Meredith Morris about:

* The intersection of AI and HCI and why we need more cross-pollination between AI and adjacent fields

* Disability studies and AI

* Generative ghosts and technological determinism

* Developing a useful definition of AGI

I didn’t get to record an intro for this episode since I’ve been sick.

Enjoy!

Meredith is Director for Human-AI Interaction Research for Google DeepMind and an Affiliate Professor in The Paul G. Allen School of Computer Science & Engineering and in The Information School at the University of Washington, where she participates in the dub research consortium. Her work spans the areas of human-computer interaction (HCI), human-centered AI, human-AI interaction, computer-supported cooperative work (CSCW), social computing, and accessibility. She has been recognized as an ACM Fellow and ACM SIGCHI Academy member for her contributions to HCI.

Find me on Twitter for updates on new episodes, and reach me at [email protected] for feedback, ideas, guest suggestions.

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Meredith’s influences and earlier work

* (03:00) Distinctions between AI and HCI

* (05:56) Maturity of fields and cross-disciplinary work

* (09:03) Technology and ends

* (10:37) Unique aspects of Meredith’s research direction

* (12:55) Forms of knowledge production in interdisciplinary work

* (14:08) Disability, Bias, and AI

* (18:32) LaMPost and using LMs for writing

* (20:12) Accessibility approaches for dyslexia

* (22:15) Awareness of AI and perceptions of autonomy

* (24:43) The software model of personhood

* (28:07) Notions of intelligence, normative visions and disability studies

* (32:41) Disability categories and learning systems

* (37:24) Bringing more perspectives into CS research and re-defining what counts as CS research

* (39:36) Training interdisciplinary researchers, blurring boundaries in academia and industry

* (43:25) Generative Agents and public imagination

* (45:13) The state of ML conferences, the need for more cross-pollination

* (46:42) Prestige in conferences, the move towards more cross-disciplinary work

* (48:52) Joon Park Appreciation

* (49:51) Training interdisciplinary researchers

* (53:20) Generative Ghosts and technological determinism

* (57:06) Examples of generative ghosts and clones, relationships to agentic systems

* (1:00:39) Reasons for wanting generative ghosts

* (1:02:25) Questions of consent for generative clones and ghosts

* (1:05:01) Labor involved in maintaining generative ghosts, psychological tolls

* (1:06:25) Potential religious and spiritual significance of generative systems

* (1:10:19) Anthropomorphization

* (1:12:14) User experience and cognitive biases

* (1:15:24) Levels of AGI

* (1:16:13) Defining AGI

* (1:23:20) World models and AGI

* (1:26:16) Metacognitive abilities in AGI

* (1:30:06) Towards Bidirectional Human-AI Alignment

* (1:30:55) Pluralistic value alignment

* (1:32:43) Meredith’s perspective on deploying AI systems

* (1:36:09) Meredith’s advice for younger interdisciplinary researchers

Links:

* Meredith’s homepage, Twitter, and Google Scholar

* Papers

* Mediating Group Dynamics through Tabletop Interface Design

* SearchTogether: An Interface for Collaborative Web Search

* AI and Accessibility: A Discussion of Ethical Considerations

* Disability, Bias, and AI

* LaMPost: Design and Evaluation of an AI-assisted Email Writing Prototype for Adults with Dyslexia

* Generative Ghosts

* Levels of AGI

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Davidad Dalrymple: Towards Provably Safe AI

5 september 2024 | 81 min

Episode 137

I spoke with Davidad Dalrymple about:

* His perspectives on AI risk

* ARIA (the UK’s Advanced Research and Invention Agency) and its Safeguarded AI Programme

Enjoy—and let me know what you think!

Davidad is a Programme Director at ARIA. He was most recently a Research Fellow in technical AI safety at Oxford. He co-invented the top-40 cryptocurrency Filecoin, led an international neuroscience collaboration, and was a senior software engineer at Twitter and multiple startups.

Find me on Twitter for updates on new episodes, and reach me at [email protected] for feedback, ideas, guest suggestions.

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (00:36) Calibration and optimism about breakthroughs

* (03:35) Calibration and AGI timelines, effects of AGI on humanity

* (07:10) Davidad’s thoughts on the Orthogonality Thesis

* (10:30) Understanding how our current direction relates to AGI and breakthroughs

* (13:33) What Davidad thinks is needed for AGI

* (17:00) Extracting knowledge

* (19:01) Cyber-physical systems and modeling frameworks

* (20:00) Continuities between Davidad’s earlier work and ARIA

* (22:56) Path dependence in technology, race dynamics

* (26:40) More on Davidad’s perspective on what might go wrong with AGI

* (28:57) Vulnerable world, interconnectedness of computers and control

* (34:52) Formal verification and world modeling, Open Agency Architecture

* (35:25) The Semantic Sufficiency Hypothesis

* (39:31) Challenges for modeling

* (43:44) The Deontic Sufficiency Hypothesis and mathematical formalization

* (49:25) Oversimplification and quantitative knowledge

* (53:42) Collective deliberation in expressing values for AI

* (55:56) ARIA’s Safeguarded AI Programme

* (59:40) Anthropic’s ASL levels

* (1:03:12) Guaranteed Safe AI —

* (1:03:38) AI risk and (in)accurate world models

* (1:09:59) Levels of safety specifications for world models and verifiers — steps to achieve high safety

* (1:12:00) Davidad’s portfolio research approach and funding at ARIA

* (1:15:46) Earlier concerns about ARIA — Davidad’s perspective

* (1:19:26) Where to find more information on ARIA and the Safeguarded AI Programme

* (1:20:44) Outro

Links:

* Davidad’s Twitter

* ARIA homepage

* Safeguarded AI Programme

* Papers

* Guaranteed Safe AI

* Davidad’s Open Agency Architecture for Safe Transformative AI

* Dioptics: a Common Generalization of Open Games and Gradient-Based Learners (2019)

* Asynchronous Logic Automata (2008)

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Clive Thompson: Tales of Technology

29 augusti 2024 | 148 min

Episode 136

I spoke with Clive Thompson about:

* How he writes

* Writing about the climate and biking across the US

* Technology culture and persistent debates in AI

* Poetry

Enjoy—and let me know what you think!

Clive is a journalist who writes about science and technology. He is a contributing writer forWired magazine, and is currently writing his next book about micromobility and cycling across the US.

Find me on Twitter for updates on new episodes, and reach me at [email protected] for feedback, ideas, guest suggestions.

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (01:07) Clive’s life as a Tarantino movie

* (03:07) Boring life and interesting art, life as material for art

* (10:25) Cycling across the US — Clive’s new book on mobility and decarbonization

* (15:07) Turning inward in writing

* (27:21) Including personal experience in writing

* (31:53) Personal and less personal writing

* (36:08) Conveying uncertainty and the “voice from nowhere” in traditional journalism

* (41:10) Finding the natural end of a piece

* (1:02:10) Writing routine

* (1:05:08) Theories of change in Clive’s writing

* (1:12:33) How Clive saw things before the rest of us

* (1:27:00) Automation in software engineering

* (1:31:40) The anthropology of coders, poetry as a framework

* (1:43:50) Proust discourse

* (1:45:00) Technology culture in NYC + interaction between the tech world and other worlds

* (1:50:30) Technological developments Clive wants to see happen (free ideas)

* (2:01:11) Clive’s argument for memorizing poetry

* (2:09:24) How Clive finds poetry

* (2:18:03) Clive’s pursuit of freelance writing and making compromises

* (2:27:25) Outro

Links:

* Clive’s Twitter and website

* Selected writing

* The Attack of the Incredible Grading Machine (Lingua Franca, 1999)

* The Know-It-All Machine (Lingua Franca, 2001)

* How to teach AI some common sense (Wired, 2018)

* Blogs to Riches (NY Mag, 2006)

* Clive vs. Jonathan Franzen on whether the internet is good for writing (The Chronicle of Higher Education, 2013)

* The Minecraft Generation (New York Times, 2016)

* What AI College Exam Proctors are Really Teaching Our Kids (Wired, 2020)

* Companies Don’t Need to Be Creepy to Make Money (Wired, 2021)

* Is Sucking Carbon Out of the Air the Solution to Our Climate Crisis? (Mother Jones, 2021)

* AI Shouldn’t Compete with Workers—It Should Supercharge Them (Wired, 2022)

* Back to BASIC—the Most Consequential Programming Language in the History of Computing Wired, 2024)

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Judy Fan: Reverse Engineering the Human Cognitive Toolkit

22 augusti 2024 | 93 min

L.M. Sacasas: The Questions Concerning Technology

15 augusti 2024 | 107 min

Episode 135

I spoke with L. M. Sacasas about:

* His writing and intellectual influences

* The value of asking hard questions about technology and our relationship to it

* What happens when we decide to outsource skills and competency

* Evolving notions of what it means to be human and questions about how to live a good life

Enjoy—and let me know what you think!

Michael is Executive Director of the Christian Study Center of Gainesville, Florida and author of The Convivial Society, a newsletter about technology and society.

He does some of the best writing on technology I’ve had the pleasure to read, and I highly recommend his newsletter.

Find me on Twitter for updates on new episodes, and reach me at [email protected] for feedback, ideas, guest suggestions.

I spend a lot of time on this podcast—if you like my work, you can support me on Patreon :) You can also support upkeep for the full Gradient team/project through a paid subscription on Substack!

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (01:12) On podcasts as a medium

* (06:12) Michael’s writing

* (12:38) Michael’s intellectual influences, contingency

* (18:48) Moral seriousness

* (22:00) Michael’s ambitions for his work

* (26:17) The value of asking the right questions (about technology)

* (34:18) Technology use and the “natural” pace of human life

* (46:40) Outsourcing of skills and competency, engagement with others

* (55:33) Inevitability narratives and technological determinism, the “Borg Complex”

* (1:05:10) Notions of what it is to be human, embodiment

* (1:12:37) Higher cognition vs. the body, dichotomies

* (1:22:10) The body as a starting point for philosophy, questions about the adoption of new technologies

* (1:30:01) Enthusiasm about technology and the cultural milieu

* (1:35:30) Projectivism, desire for knowledge about and control of the world

* (1:41:22) Positive visions for the future

* (1:47:11) Outro

Links:

* Michael’s Substack: The Convivial Society and his book, The Frailest Thing: Ten Years of Thinking about the Meaning of Technology

* Michael’s Twitter

* Essays

* Humanist Technology Criticism

* What Does the Critic Love?

* The Ambling Mind

* Waste Your Time, Your Life May Depend On It

* The Work of Art

* The Stuff of (a Well-Lived) Life

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Pete Wolfendale: The Revenge of Reason

8 augusti 2024 | 173 min

Episode 134

I spoke with Pete Wolfendale about:

* The flaws in longtermist thinking

* Selections from his new book, The Revenge of Reason

* Metaphysics

* What philosophy has to say about reason and AI

Enjoy—and let me know what you think!

Pete is an independent philosopher based in Newcastle. Dr. Wolfendale got both his undergraduate degree and his Ph.D in Philosophy at the University of Warwick. His Ph.D thesis offered a re-examination of the Heideggerian Seinsfrage, arguing that Heideggerian scholarship has failed to fully do justice to its philosophical significance, and supplementing the shortcomings in Heidegger’s thought about Being with an alternative formulation of the question. He is the author of Object-Oriented Philosophy: The Noumenon's New Clothes and The Revenge of Reason. His blog is Deontologistics.

Find me on Twitter for updates on new episodes, and reach me at [email protected] for feedback, ideas, guest suggestions.

I spend a lot of time on this podcast—if you like my work, you can support me on Patreon :) You can also support upkeep for the full Gradient team/project through a paid subscription on Substack!

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (01:30) Pete’s experience with (para-)academia, incentive structures

* (10:00) Progress in philosophy and the analytic tradition

* (17:57) Thinking through metaphysical questions

* (26:46) Philosophy of science, uncovering categorical properties vs. dispositions

* (31:55) Structure of thought and the world, epistemological excess

* (49:31) What reason is, relation to language models, semantic fragmentation of AGI

* (1:00:55) Neural net interpretability and intervention

* (1:08:16) World models, architecture and behavior of AI systems

* (1:12:35) Language acquisition in humans and LMs

* (1:15:30) Pretraining vs. evolution

* (1:16:50) Technological determinism

* (1:18:19) Pete’s thinking on e/acc

* (1:27:45) Prometheanism vs. e/acc

* (1:29:39) The Weight of Forever — Pete’s critique of What We Owe the Future

* (1:30:15) Our rich deontological language and longtermism’s limits

* (1:43:33) Longtermism and the opacity of desire

* (1:44:41) Longtermism’s historical narrative and technological determinism, theories of power

* (1:48:10) The “posthuman” condition, language and techno-linguistic infrastructure

* (2:00:15) Type-checking and universal infrastructure

* (2:09:23) Multitudes and selfhood

* (2:21:12) Definitions of the self and (non-)circularity

* (2:32:55) Freedom and aesthetics, aesthetic exploration and selfhood

* (2:52:46) Outro

Links:

* Pete’s blog and Twitter

* Book: The Revenge of Reason

* Writings / References

* The Weight of Forever

* On Neorationalism

* So, Accelerationism, what’s that all about?

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Peter Lee: Computing Theory and Practice, and GPT-4's Impact

1 augusti 2024 | 62 min

Episode 133

I spoke with Peter Lee about:

* His early work on compiler generation, metacircularity, and type theory

* Paradoxical problems

* GPT-4s impact, Microsoft’s “Sparks of AGI” paper, and responses and criticism

Enjoy—and let me know what you think!

Peter is President of Microsoft Research. He leads Microsoft Research and incubates new research-powered products and lines of business in areas such as artificial intelligence, computing foundations, health, and life sciences. Before joining Microsoft in 2010, he was at DARPA, where he established a new technology office that created operational capabilities in machine learning, data science, and computational social science. Prior to that, he was a professor and the head of the computer science department at Carnegie Mellon University. Peter is a member of the National Academy of Medicine and serves on the boards of the Allen Institute for Artificial Intelligence, the Brotman Baty Institute for Precision Medicine, and the Kaiser Permanente Bernard J. Tyson School of Medicine. He served on President Obama’s Commission on Enhancing National Cybersecurity. He has testified before both the US House Science and Technology Committee and the US Senate Commerce Committee. With Carey Goldberg and Dr. Isaac Kohane, he is the coauthor of the best-selling book, “The AI Revolution in Medicine: GPT-4 and Beyond.” In 2024, Peter was named by Time magazine as one of the 100 most influential people in health and life sciences.

Find me on Twitter for updates on new episodes, and reach me at [email protected] for feedback, ideas, guest suggestions.

I spend a lot of time on this podcast—if you like my work, you can support me on Patreon :) You can also support upkeep for the full Gradient team/project through a paid subscription on Substack!

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (00:50) Basic vs. applied research

* (05:20) Theory and practice in computing

* (10:28) Traditional denotational semantics and semantics engineering in modern-day systems

* (16:47) Beauty and practicality

* (20:40) Metacircularity in the polymorphic lambda calculus: research directions

* (24:31) Understanding the nature of difficulties with metacircularity

* (26:30) Difficulties with reflection, classic paradoxes

* (31:02) Sparks of AGI

* (31:41) Reproducibility

* (38:04) Confirming and disconfirming theories, foundational work

* (42:00) Back and forth between commitments and experimentation

* (51:01) Dealing with responsibility

* (56:30) Peter’s picture of AGI

* (1:01:38) Outro

Links:

* Peter’s Twitter, LinkedIn, and Microsoft Research pages

* Papers and references

* The automatic generation of realistic compilers from high-level semantic descriptions

* Metacircularity in the polymorphic lambda calculus

* A Fresh Look at Combinator Graph Reduction

* Sparks of AGI

* Re-envisioning DARPA

* Fundamental Research in Engineering

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Manuel & Lenore Blum: The Conscious Turing Machine

25 juli 2024 | 143 min

Episode 132

I spoke with Manuel and Lenore Blum about:

* Their early influences and mentors

* The Conscious Turing Machine and what theoretical computer science can tell us about consciousness

Enjoy—and let me know what you think!

Manuel is a pioneer in the field of theoretical computer science and the winner of the 1995 Turing Award in recognition of his contributions to the foundations of computational complexity theory and its applications to cryptography and program checking, a mathematical approach to writing programs that check their work. He worked as a professor of computer science at the University of California, Berkeley until 2001. From 2001 to 2018, he was the Bruce Nelson Professor of Computer Science at Carnegie Mellon University.

Lenore is a Distinguished Career Professor of Computer Science, Emeritus at Carnegie Mellon University and former Professor-in-Residence in EECS at UC Berkeley. She is president of the Association for Mathematical Consciousness Science and newly elected member of the American Academy of Arts and Sciences. Lenore is internationally recognized for her work in increasing the participation of girls and women in Science, Technology, Engineering, and Math (STEM) fields. She was a founder of the Association for Women in Mathematics, and founding Co-Director (with Nancy Kreinberg) of the Math/Science Network and its Expanding Your Horizons conferences for middle- and high-school girls.

Find me on Twitter for updates on new episodes, and reach me at [email protected] for feedback, ideas, guest suggestions.

I spend a lot of time on this podcast—if you like my work, you can support me on Patreon :) You can also support upkeep for the full Gradient team/project through a paid subscription on Substack!

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (03:09) Manuel’s interest in consciousness

* (05:55) More of the story — from memorization to derivation

* (11:15) Warren McCulloch’s mentorship

* (14:00) McCulloch’s anti-Freudianism

* (15:57) More on McCulloch’s influence

* (27:10) On McCulloch and telling stories

* (32:35) The Conscious Turing Machine (CTM)

* (33:55) A last word on McCulloch

* (35:20) Components of the CTM

* (39:55) Advantages of the CTM model

* (50:20) The problem of free will

* (52:20) On pain

* (1:01:10) Brainish / CTM’s multimodal inner language, language and thinking

* (1:13:55) The CTM’s lack of a “central executive”

* (1:18:10) Empiricism and a self, tournaments in the CTM

* (1:26:30) Mental causation

* (1:36:20) Expertise and the CTM model, role of TCS

* (1:46:30) Dreams and dream experience

* (1:50:15) Disentangling components of experience from multimodal language

* (1:56:10) CTM Robot, meaning and symbols, embodiment and consciousness

* (2:00:35) AGI, CTM and AI processors, capabilities

* (2:09:30) CTM implications, potential worries

* (2:17:15) Advice for younger (computer) scientists

* (2:22:57) Outro

Links:

* Manuel’s homepage

* Lenore’s homepage; find Lenore on Twitter (https://x.com/blumlenore) and Linkedin (https://www.linkedin.com/in/lenore-blum-1a47224)

* Articles

* “The ‘Accidental Activist’ Who Changed the Face of Mathematics” — Ben Brubaker’s Q&A with Lenore

* “How this Turing-Award-winning researcher became a legendary academic advisor” — Sheon Han’s profile of Manuel

* Papers (Manuel and Lenore)

* AI Consciousness is Inevitable: A Theoretical Computer Science Perspective

* A Theory of Consciousness from a Theoretical Computer Science Perspective: Insights from the Conscious Turing Machine

* A Theoretical Computer Science Perspective on Consciousness and Artificial General Intelligence

* References (McCulloch)

* Embodiments of Mind

* Rebel Genius

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Kevin Dorst: Against Irrationalist Narratives

18 juli 2024 | 135 min

Episode 131

I spoke with Professor Kevin Dorst about:

* Subjective Bayesianism and epistemology foundations

* What happens when you’re uncertain about your evidence

* Why it’s rational for people to polarize on political matters

Enjoy—and let me know what you think!

Kevin is an Associate Professor in the Department of Linguistics and Philosophy at MIT. He works at the border between philosophy and social science, focusing on rationality.

Find me on Twitter for updates on new episodes, and reach me at [email protected] for feedback, ideas, guest suggestions.

I spend a lot of time on this podcast—if you like my work, you can support me on Patreon :) You can also support upkeep for the full Gradient team/project through a paid subscription on Substack!

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (01:15) When do Bayesians need theorems?

* (05:52) Foundations of epistemology, metaethics, formal models, error theory

* (09:35) Extreme views and error theory, arguing for/against opposing positions

* (13:35) Changing focuses in philosophy — pragmatic pressures

* (19:00) Kevin’s goals through his research and work

* (25:10) Structural factors in coming to certain (political) beliefs

* (30:30) Acknowledging limited resources, heuristics, imperfect rationality

* (32:51) Hindsight Bias is Not a Bias

* (33:30) The argument

* (35:15) On eating cereal and symmetric properties of evidence

* (39:45) Colloquial notions of hindsight bias, time and evidential support

* (42:45) An example

* (48:02) Higher-order uncertainty

* (48:30) Explicitly modeling higher-order uncertainty

* (52:50) Another example (spoons)

* (54:55) Game theory, iterated knowledge, even higher order uncertainty

* (58:00) Uncertainty and philosophy of mind

* (1:01:20) Higher-order evidence about reliability and rationality

* (1:06:45) Being Rational and Being Wrong

* (1:09:00) Setup on calibration and overconfidence

* (1:12:30) The need for average rational credence — normative judgments about confidence and realism/anti-realism

* (1:15:25) Quasi-realism about average rational credence?

* (1:19:00) Classic epistemological paradoxes/problems — lottery paradox, epistemic luck

* (1:25:05) Deference in rational belief formation, uniqueness and permissivism

* (1:39:50) Rational Polarization

* (1:40:00) Setup

* (1:37:05) Epistemic nihilism, expanded confidence akrasia

* (1:40:55) Ambiguous evidence and confidence akrasia

* (1:46:25) Ambiguity in understanding and notions of rational belief

* (1:50:00) Claims about rational sensitivity — what stories we can tell given evidence

* (1:54:00) Evidence vs presentation of evidence

* (2:01:20) ChatGPT and the case for human irrationality

* (2:02:00) Is ChatGPT replicating human biases?

* (2:05:15) Simple instruction tuning and an alternate story

* (2:10:22) Kevin’s aspirations with his work

* (2:15:13) Outro

Links:

* Professor Dorst’s homepage and Twitter

* Papers

* Modest Epistemology

* Hedden: Hindsight bias is not a bias

* Higher-order evidence + (Almost) all evidence is higher-order evidence

* Being Rational and Being Wrong

* Rational Polarization

* ChatGPT and human irrationality

Get full access to The Gradient at thegradientpub.substack.com/subscribe

David Pfau: Manifold Factorization and AI for Science

11 juli 2024 | 121 min

Episode 130

I spoke with David Pfau about:

* Spectral learning and ML

* Learning to disentangle manifolds and (projective) representation theory

* Deep learning for computational quantum mechanics

* Picking and pursuing research problems and directions

David’s work is really (times k for some very large value of k) interesting—I’ve been inspired to descend a number of rabbit holes because of it.

(if you listen to this episode, you might become as cool as this guy)

While I’m at it — I’m still hovering around 40 ratings on Apple Podcasts. It’d mean a lot if you’d consider helping me bump that up!

Enjoy—and let me know what you think!

David is a staff research scientist at Google DeepMind. He is also a visiting professor at Imperial College London in the Department of Physics, where he supervises work on applications of deep learning to computational quantum mechanics. His research interests span artificial intelligence, machine learning and scientific computing.

Find me on Twitter for updates on new episodes, and reach me at [email protected] for feedback, ideas, guest suggestions.

I spend a lot of time on this podcast—if you like my work, you can support me on Patreon :) You can also support upkeep for the full Gradient team/project through a paid subscription on Substack!

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (00:52) David Pfau the “critic”

* (02:05) Scientific applications of deep learning — David’s interests

* (04:57) Brain / neural network analogies

* (09:40) Modern ML systems and theories of the brain

* (14:19) Desirable properties of theories

* (18:07) Spectral Inference Networks

* (19:15) Connections to FermiNet / computational physics, a series of papers

* (33:52) Deep slow feature analysis — interpretability and findings on eigenfunctions

* (39:07) Following up on eigenfunctions (there are indeed only so many hours in a day; I have been asking the Substack people if they can ship 40-hour days, but I don’t think they’ve gotten to it yet)

* (42:17) Power iteration and intuitions

* (45:23) Projective representation theory

* (46:00) ???

* (46:54) Geomancer and learning to decompose a manifold from data

* (47:45) we consider the question of whether you will spend 90 more minutes of this podcast episode (there are not 90 more minutes left in this podcast episode, but there could have been)

* (1:08:47) Learning embeddings

* (1:11:12) The “unexpected emergent property” of Geomancer

* (1:14:43) Learned embeddings and disentangling and preservation of topology

* n/b I still haven’t managed to do this in colab because I keep crashing my instance when I use s3o4d :(

* (1:21:07) What’s missing from the ~ current (deep learning) paradigm ~

* (1:29:04) LLMs as swiss-army knives

* (1:32:05) RL and human learning — TD learning in the brain

* (1:37:43) Models that cover the Pareto Front (image below)

* (1:46:54) AI accelerators and doubling down on transformers

* (1:48:27) On Slow Research — chasing big questions and what makes problems attractive

* (1:53:50) Future work on Geomancer

* (1:55:35) Finding balance in pursuing interesting and lucrative work

* (2:00:40) Outro

Links:

* Papers

* Natural Quantum Monte Carlo Computation of Excited States (2023)

* Making sense of raw input (2021)

* Integrable Nonparametric Flows (2020)

* Disentangling by Subspace Diffusion (2020)

* Ab initio solution of the many-electron Schrödinger equation with deep neural networks (2020)

* Spectral Inference Networks (2018)

* Connecting GANs and Actor-Critic Methods (2016)

* Learning Structure in Time Series for Neuroscience and Beyond (2015, dissertation)

* Robust learning of low-dimensional dynamics from large neural ensembles (2013)

* Probabilistic Deterministic Infinite Automata (2010)

* Other

* On Slow Research

* “I just want to put this out here so that no one ever says ‘we can just get around the data limitations of LLMs with self-play’ ever again.”

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Dan Hart and Michelle Michael: Bringing AI to Students in New South Wales

4 juli 2024 | 74 min

Episode 129

I spoke with Dan Hart and Michelle Michael about:

* Developing NSWEduChat, an AI-powered chatbot designed and delivered by the NSW Department of Education for students and teachers.

* The challenges in effectively teaching students as technology develops

* Understanding and defining the importance of the classroom

Enjoy—and let me know what you think!

Dan Hart is Head of AI, and Michelle Michael is Director of Educational Support and Rural Initiatives at the New South Wales (NSW) Department of Education.

Find me on Twitter for updates on new episodes, and reach me at [email protected] for feedback, ideas, guest suggestions.

I spend a lot of time on this podcast—if you like my work, you can support me on Patreon :) You can also support upkeep for the full Gradient team/project through a paid subscription on Substack!

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (00:48) How NSWEduChat came to be, educational principles for AI use

* (02:37) Educational environment in New South Wales

* (04:41) How educators have adapted to new challenges for teaching and assessment

* (07:47) Considering technology advancement while teaching and assessing students

* (12:14) Educating teachers and students about how to use AI tools

* (15:03) AI in the classroom and enabling teachers

* (19:44) Product-first thinking for educational AI

* (22:15) Red teaming and testing

* (24:02) Benchmarking, chatbots as an assistant

* (26:35) The importance of the classroom

* (28:10) Media coverage and hype

* (30:35) Measurement and the benchmarking process/methodology

* (34:50) Principles for how chatbots should interact with students

* (44:29) Producing good educational outcomes at scale

* (46:41) Operating with speed and effectiveness while implementing governance

* (49:03) How the experience of building technologies evolves

* (51:45) Identifying good technologists and educators for development and use

* (55:07) Teaching standards and how AI impacts teachers

* (57:01) How technologists incorporate teaching standards and expertise in their work

* (1:00:03) NSWEduChat model details

* (1:02:55) Value alignment for NSWEduChat

* (1:05:40) Practicing caution in filtering chatbot responses

* (1:07:35) Equity and personalized instruction — how NSWEduChat can help

* (1:10:19) Helping students become “the students they could be”

* (1:13:39) Outro

Links:

* NSWEduChat

* Guardian article on NSWEduChat

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Kristin Lauter: Private AI, Homomorphic Encryption, and AI for Cryptography

27 juni 2024 | 77 min

Episode 129

I spoke with Kristin Lauter about:

* Elliptic curve cryptography and homomorphic encryption

* Standardizing cryptographic protocols

* Machine Learning on encrypted data

* Attacking post-quantum cryptography with AI

Enjoy—and let me know what you think!

Kristin is Senior Director of FAIR Labs North America (2022—present), based in Seattle. Her current research areas are AI4Crypto and Private AI. She joined FAIR (Facebook AI Research) in 2021, after 22 years at Microsoft Research (MSR). At MSR she was Partner Research Manager on the senior leadership team of MSR Redmond. Before joining Microsoft in 1999, she was Hildebrandt Assistant Professor of Mathematics at the University of Michigan (1996-1999). She is an Affiliate Professor of Mathematics at the University of Washington (2008—present). She received all her advanced degrees from the University of Chicago, BA (1990), MS (1991), PhD (1996) in Mathematics. She is best known for her work on Elliptic Curve Cryptography, Supersingular Isogeny Graphs in Cryptography, Homomorphic Encryption (SEALcrypto.org), Private AI, and AI4Crypto. She served as President of the Association for Women in Mathematics from 2015-2017 and on the Council of the American Mathematical Society from 2014-2017.

Find me on Twitter for updates on new episodes, and reach me at [email protected] for feedback, ideas, guest suggestions.

I spend a lot of time on this podcast—if you like my work, you can support me on Patreon :) You can also support upkeep for the full Gradient team/project through a paid subscription on Substack!

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (01:10) Llama 3 and encrypted data — where do we want to be?

* (04:20) Tradeoffs: individual privacy vs. aggregated value in e.g. social media forums

* (07:48) Kristin’s shift in views on privacy

* (09:40) Earlier work on elliptic curve cryptography — applications and theory

* (10:50) Inspirations from algebra, number theory, and algebraic geometry

* (15:40) On algebra vs. analysis and on clear thinking

* (18:38) Elliptic curve cryptography and security, algorithms and concrete running time

* (21:31) Cryptographic protocols and setting standards

* (26:36) Supersingular isogeny graphs (and higher-dimensional supersingular isogeny graphs)

* (32:26) Hard problems for cryptography and finding new problems

* (36:42) Guaranteeing security for cryptographic protocols and mathematical foundations

* (40:15) Private AI: Crypto-Nets / running neural nets on homomorphically encrypted data

* (42:10) Polynomial approximations, activation functions, and expressivity

* (44:32) Scaling up, Llama 2 inference on encrypted data

* (46:10) Transitioning between MSR and FAIR, industry research

* (52:45) An efficient algorithm for integer lattice reduction (AI4Crypto)

* (56:23) Local minima, convergence and limit guarantees, scaling

* (58:27) SALSA: Attacking Lattice Cryptography with Transformers

* (58:38) Learning With Errors (LWE) vs. standard ML assumptions

* (1:02:25) Powers of small primes and faster learning

* (1:04:35) LWE and linear regression on a torus

* (1:07:30) Secret recovery algorithms and transformer accuracy

* (1:09:10) Interpretability / encoding information about secrets

* (1:09:45) Future work / scaling up

* (1:12:08) Reflections on working as a mathematician among technologists

Links:

* Kristin’s Meta, Wikipedia, Google Scholar, and Twitter pages

* Papers and sources mentioned/referenced:

* The Advantages of Elliptic Curve Cryptography for Wireless Security (2004)

* Cryptographic Hash Functions from Expander Graphs (2007, introducing Supersingular Isogeny Graphs)

* Families of Ramanujan Graphs and Quaternion Algebras (2008 — the higher-dimensional analogues of Supersingular Isogeny Graphs)

* Cryptographic Cloud Storage (2010)

* Can homomorphic encryption be practical? (2011)

* ML Confidential: Machine Learning on Encrypted Data (2012)

* CryptoNets: Applying neural networks to encrypted data with high throughput and accuracy (2016)

* A community effort to protect genomic data sharing, collaboration and outsourcing (2017)

* The Homomorphic Encryption Standard (2022)

* Private AI: Machine Learning on Encrypted Data (2022)

* SALSA: Attacking Lattice Cryptography with Transformers (2022)

* SalsaPicante: A Machine Learning Attack on LWE with Binary Secrets

* SALSA VERDE: a machine learning attack on LWE with sparse small secrets

* Salsa Fresca: Angular Embeddings and Pre-Training for ML Attacks on Learning With Errors

* The cool and the cruel: separating hard parts of LWE secrets

* An efficient algorithm for integer lattice reduction (2023)

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Sergiy Nesterenko: Automating Circuit Board Design

20 juni 2024 | 64 min

Episode 128

I spoke with Sergiy Nesterenko about:

* Developing an automated system for designing PCBs

* Difficulties in human and automated PCB design

* Building a startup at the intersection of different areas of expertise

By the way — I hit 40 ratings on Apple Podcasts (and am at 66 on Spotify). It’d mean a lot (really, a lot) if you’d consider leaving a rating or a review. I read everything, and it’s very heartening and helpful to hear what you think.

Enjoy, and let me know what you think!

Sergiy is founder and CEO of Quilter. Sergiy spent 5 years at SpaceX developing radiation-hardened avionics for SpaceX's Falcon 9 and Falcon Heavy's second stage rockets, before discovering a big problem: designing printed circuit boards for all the electronics in these rockets was tedious, manual and error prone. So in 2019, he founded Quilter to build the next generation of AI-powered tooling for electrical engineers.

I spend a lot of time on this podcast—if you like my work, you can support me on Patreon :)

Reach me at [email protected] for feedback, ideas, guest suggestions.

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (00:45) Quilter origins and difficulties in designing PCBs

* (04:12) PCBs and schematic implementations

* (06:40) Iteration cycles and simulations

* (08:35) Octilinear traces and first-principles design for PCBs

* (12:38) The design space of PCBs

* (15:27) Benchmarks for PCB design

* (20:05) RL and PCB design

* (22:48) PCB details, track widths

* (25:09) Board functionality and aesthetics

* (27:53) PCB designers and automation

* (30:24) Quilter as a compiler

* (33:56) Gluing social worlds and bringing together expertise

* (36:00) Process knowledge vs. first-principles thinking

* (42:05) Example boards

* (44:45) Auto-routers for PCBs

* (48:43) Difficulties for scaling to larger boards

* (50:42) Customers and skepticism

* (53:42) On experiencing negative feedback

* (56:42) Maintaining stamina while building Quilter

* (1:00:00) Endgame for Quilter and future directions

* (1:03:24) Outro

Links:

* Quilter homepage

* Other pages/features mentioned:

* Thin-to-thick traces

* Octilinear trace routing

* Comment from Tom Fleet

Get full access to The Gradient at thegradientpub.substack.com/subscribe

C. Thi Nguyen: Values, Legibility, and Gamification

13 juni 2024 | 90 min

Episode 127

I spoke with Christopher Thi Nguyen about:

* How we lose control of our values

* The tradeoffs of legibility, aggregation, and simplification

* Gamification and its risks

Enjoy—and let me know what you think!

C. Thi Nguyen as of July 2020 is Associate Professor of Philosophy at the University of Utah. His research focuses on how social structures and technology can shape our rationality and our agency. He has published on trust, expertise, group agency, community art, cultural appropriation, aesthetic value, echo chambers, moral outrage porn, and games. He received his PhD from UCLA. Once, he was a food writer for the Los Angeles Times.

I spend a lot of time on this podcast—if you like my work, you can support me on Patreon :)

Reach me at [email protected] for feedback, ideas, guest suggestions.

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (01:10) The ubiquity of James C. Scott

* (06:03) Legibility and measurement

* (12:50) Value capture, classes and measurement

* (17:30) Political value choice in ML

* (23:30) Why value collapse happens

* (33:00) Blackburn, “Hume and Thick Connexions” — projectivism and legibility

* (36:20) Heuristics and decision-making

* (40:08) Institutional classification systems

* (46:55) Back to Hume

* (48:27) Epistemic arms races, stepping outside our conceptual architectures

* (56:40) The “what to do” question

* (1:04:00) Gamification, aesthetic engagement

* (1:14:51) Echo chambers and defining utility

* (1:22:10) Progress, AGI millenarianism

* (disclaimer: I don’t know what’s going to happen with the world, either.)

* (1:26:04) Parting visions

* (1:30:02) Outro

Links:

* Chrisopher’s Twitter and homepage

* Games: Agency as Art

* Papers referenced

* Transparency is Surveillance

* Games and the art of agency

* Autonomy and Aesthetic Engagement

* Art as a Shelter from Science

* Value Capture

* Hostile Epistemology

* Hume and Thick Connexions (Simon Blackburn)

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Vivek Natarajan: Towards Biomedical AI

6 juni 2024 | 115 min

Episode 126

I spoke with Vivek Natarajan about:

* Improving access to medical knowledge with AI

* How an LLM for medicine should behave

* Aspects of training Med-PaLM and AMIE

* How to facilitate appropriate amounts of trust in users of medical AI systems

Vivek Natarajan is a Research Scientist at Google Health AI advancing biomedical AI to help scale world class healthcare to everyone. Vivek is particularly interested in building large language models and multimodal foundation models for biomedical applications and leads the Google Brain moonshot behind Med-PaLM, Google's flagship medical large language model. Med-PaLM has been featured in The Scientific American, The Economist, STAT News, CNBC, Forbes, New Scientist among others.

I spend a lot of time on this podcast—if you like my work, you can support me on Patreon :)

Reach me at [email protected] for feedback, ideas, guest suggestions.

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (00:35) The concept of an “AI doctor”

* (06:54) Accessibility to medical expertise

* (10:31) Enabling doctors to do better/different work

* (14:35) Med-PaLM

* (15:30) Instruction tuning, desirable traits in LLMs for medicine

* (23:41) Axes for evaluation of medical QA systems

* (30:03) Medical LLMs and scientific consensus

* (35:32) Demographic data and patient interventions

* (40:14) Data contamination in Med-PaLM

* (42:45) Grounded claims about capabilities

* (45:48) Building trust

* (50:54) Genetic Discovery enabled by a LLM

* (51:33) Novel hypotheses in genetic discovery

* (57:10) Levels of abstraction for hypotheses

* (1:01:10) Directions for continued progress

* (1:03:05) Conversational Diagnostic AI

* (1:03:30) Objective Structures Clinical Examination as an evaluative framework

* (1:09:08) Relative importance of different types of data

* (1:13:52) Self-play — conversational dispositions and handling patients

* (1:16:41) Chain of reasoning and information retention

* (1:20:00) Performance in different areas of medical expertise

* (1:22:35) Towards accurate differential diagnosis

* (1:31:40) Feedback mechanisms and expertise, disagreement among clinicians

* (1:35:26) Studying trust, user interfaces

* (1:38:08) Self-trust in using medical AI models

* (1:41:39) UI for medical AI systems

* (1:43:50) Model reasoning in complex scenarios

* (1:46:33) Prompting

* (1:48:41) Future outlooks

* (1:54:53) Outro

Links:

* Vivek’s Twitter and homepage

* Papers

* Towards Expert-Level Medical Question Answering with LLMs (2023)

* LLMs encode clinical knowledge (2023)

* Towards Generalist Biomedical AI (2024)

* AMIE

* Genetic Discovery enabled by a LLM (2023)

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Thomas Mullaney: A Global History of the Information Age

30 maj 2024 | 104 min

Episode 125

False universalism freaks me out. It doesn’t freak me out as a first principle because of epistemic violence; it freaks me out because it works.

I spoke with Professor Thomas Mullaney about:

* Telling stories about your work and balancing what feels meaningful with practical realities

* Destabilizing our understandings of the technologies we feel familiar with, and the work of researching the history of the Chinese typewriter

* The personal nature of research

The Chinese Typewriter and The Chinese Computer are two of the best books I’ve read in a very long time. And they’re not just good and interesting, but important to read, for the history they tell and the ideas and arguments they present—I can’t recommend them and Professor Mullaney’s other work enough.

Tom is Professor of History and Professor of East Asian Languages and Cultures, by courtesy. He is also the Kluge Chair in Technology and Society at the Library of Congress, and a Guggenheim Fellow. He is the author or lead editor of 8 books, including The Chinese Computer, The Chinese Typewriter (winner of the Fairbank prize), Your Computer is on Fire, and Coming to Terms with the Nation: Ethnic Classification in Modern China.

I spend a lot of time on this podcast—if you like my work, you can support me on Patreon :)

Reach me at [email protected] for feedback, ideas, guest suggestions.

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (01:00) “In Their Own Words” interview: on telling stories about your work

* (07:42) Clashing narratives and authenticity/inauthenticity in pursuing your work

* (15:48) Why Professor Mullaney pursued studying the Chinese typewriter

* (18:20) Worldmaking, transforming the physical world to fit our descriptive models

* (30:07) Internal and illegible continuities/coherence in work

* (31:45) The role of a “self”

* (43:06) The 2008 Beijing Olympics and false (alphabetical) universalism, projectivism

* (1:04:23) “Kicking the ladder” and the personal nature of research

* (1:18:07) The “Technolinguistic Chinese Exclusion Act” — the situatedness of historians in their work

* (1:33:00) Is the Chinese typewriter project finished? / on the resolution of problems

* (1:43:35) Outro

Links:

* Professor Mullaney’s homepage and Twitter

* In Their Own Words: Thomas Mullaney

* Books

* The Chinese Computer: A Global History of the Information Age

* The Chinese Typewriter: A History

* Coming to Terms with the Nation: Ethnic Classification in Modern China

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Seth Lazar: Normative Philosophy of Computing

23 maj 2024 | 110 min

Episode 124

You may think you’re doing a priori reasoning, but actually you’re just over-generalizing from your current experience of technology.

I spoke with Professor Seth Lazar about:

* Why managing near-term and long-term risks isn’t always zero-sum

* How to think through axioms and systems in political philosphy

* Coordination problems, economic incentives, and other difficulties in developing publicly beneficial AI

Seth is Professor of Philosophy at the Australian National University, an Australian Research Council (ARC) Future Fellow, and a Distinguished Research Fellow of the University of Oxford Institute for Ethics in AI. He has worked on the ethics of war, self-defense, and risk, and now leads the Machine Intelligence and Normative Theory (MINT) Lab, where he directs research projects on the moral and political philosophy of AI.

Reach me at [email protected] for feedback, ideas, guest suggestions.

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (00:54) Ad read — MLOps conference

* (01:32) The allocation of attention — attention, moral skill, and algorithmic recommendation

* (03:53) Attention allocation as an independent good (or bad)

* (08:22) Axioms in political philosophy

* (11:55) Explaining judgments, multiplying entities, parsimony, intuitive disgust

* (15:05) AI safety / catastrophic risk concerns

* (22:10) Superintelligence arguments, reasoning about technology

* (28:42) Attacking current and future harms from AI systems — does one draw resources from the other?

* (35:55) GPT-2, model weights, related debates

* (39:11) Power and economics—coordination problems, company incentives

* (50:42) Morality tales, relationship between safety and capabilities

* (55:44) Feasibility horizons, prediction uncertainty, and doing moral philosophy

* (1:02:28) What is a feasibility horizon?

* (1:08:36) Safety guarantees, speed of improvements, the “Pause AI” letter

* (1:14:25) Sociotechnical lenses, narrowly technical solutions

* (1:19:47) Experiments for responsibly integrating AI systems into society

* (1:26:53) Helpful/honest/harmless and antagonistic AI systems

* (1:33:35) Managing incentives conducive to developing technology in the public interest

* (1:40:27) Interdisciplinary academic work, disciplinary purity, power in academia

* (1:46:54) How we can help legitimize and support interdisciplinary work

* (1:50:07) Outro

Links:

* Seth’s Linktree and Twitter

* Resources

* Attention, moral skill, and algorithmic recommendation

* Catastrophic AI Risk slides

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Suhail Doshi: The Future of Computer Vision

16 maj 2024 | 68 min

Azeem Azhar: The Exponential View

9 maj 2024 | 106 min

David Thorstad: Bounded Rationality and the Case Against Longtermism

2 maj 2024 | 139 min

Episode 122

I spoke with Professor David Thorstad about:

* The practical difficulties of doing interdisciplinary work

* Why theories of human rationality should account for boundedness, heuristics, and other cognitive limitations

* why EA epistemics suck (ok, it’s a little more nuanced than that)

Professor Thorstad is an Assistant Professor of Philosophy at Vanderbilt University, a Senior Research Affiliate at the Global Priorities Institute at Oxford, and a Research Affiliate at the MINT Lab at Australian National University. One strand of his research asks how cognitively limited agents should decide what to do and believe. A second strand asks how altruists should use limited funds to do good effectively.

Reach me at [email protected] for feedback, ideas, guest suggestions.

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (01:15) David’s interest in rationality

* (02:45) David’s crisis of confidence, models abstracted from psychology

* (05:00) Blending formal models with studies of the mind

* (06:25) Interaction between academic communities

* (08:24) Recognition of and incentives for interdisciplinary work

* (09:40) Movement towards interdisciplinary work

* (12:10) The Standard Picture of rationality

* (14:11) Why the Standard Picture was attractive

* (16:30) Violations of and rebellion against the Standard Picture

* (19:32) Mistakes made by critics of the Standard Picture

* (22:35) Other competing programs vs Standard Picture

* (26:27) Characterizing Bounded Rationality

* (27:00) A worry: faculties criticizing themselves

* (29:28) Self-improving critique and longtermism

* (30:25) Central claims in bounded rationality and controversies

* (32:33) Heuristics and formal theorizing

* (35:02) Violations of Standard Picture, vindicatory epistemology

* (37:03) The Reason Responsive Consequentialist View (RRCV)

* (38:30) Objective and subjective pictures

* (41:35) Reason responsiveness

* (43:37) There are no epistemic norms for inquiry

* (44:00) Norms vs reasons

* (45:15) Arguments against epistemic nihilism for belief

* (47:30) Norms and self-delusion

* (49:55) Difficulty of holding beliefs for pragmatic reasons

* (50:50) The Gibbardian picture, inquiry as an action

* (52:15) Thinking how to act and thinking how to live — the power of inquiry

* (53:55) Overthinking and conducting inquiry

* (56:30) Is thinking how to inquire as an all-things-considered matter?

* (58:00) Arguments for the RRCV

* (1:00:40) Deciding on minimal criteria for the view, stereotyping

* (1:02:15) Eliminating stereotypes from the theory

* (1:04:20) Theory construction in epistemology and moral intuition

* (1:08:20) Refusing theories for moral reasons and disciplinary boundaries

* (1:10:30) The argument from minimal criteria, evaluating against competing views

* (1:13:45) Comparing to other theories

* (1:15:00) The explanatory argument

* (1:17:53) Parfit and Railton, norms of friendship vs utility

* (1:20:00) Should you call out your friend for being a womanizer

* (1:22:00) Vindicatory Epistemology

* (1:23:05) Panglossianism and meliorative epistemology

* (1:24:42) Heuristics and recognition-driven investigation

* (1:26:33) Rational inquiry leading to irrational beliefs — metacognitive processing

* (1:29:08) Stakes of inquiry and costs of metacognitive processing

* (1:30:00) When agents are incoherent, focuses on inquiry

* (1:32:05) Indirect normative assessment and its consequences

* (1:37:47) Against the Singularity Hypothesis

* (1:39:00) Superintelligence and the ontological argument

* (1:41:50) Hardware growth and general intelligence growth, AGI definitions

* (1:43:55) Difficulties in arguing for hyperbolic growth

* (1:46:07) Chalmers and the proportionality argument

* (1:47:53) Arguments for/against diminishing growth, research productivity, Moore’s Law

* (1:50:08) On progress studies

* (1:52:40) Improving research productivity and technology growth

* (1:54:00) Mistakes in the moral mathematics of existential risk, longtermist epistemics

* (1:55:30) Cumulative and per-unit risk

* (1:57:37) Back and forth with longtermists, time of perils

* (1:59:05) Background risk — risks we can and can’t intervene on, total existential risk

* (2:00:56) The case for longtermism is inflated

* (2:01:40) Epistemic humility and longtermism

* (2:03:15) Knowledge production — reliable sources, blog posts vs peer review

* (2:04:50) Compounding potential errors in knowledge

* (2:06:38) Group deliberation dynamics, academic consensus

* (2:08:30) The scope of longtermism

* (2:08:30) Money in effective altruism and processes of inquiry

* (2:10:15) Swamping longtermist options

* (2:12:00) Washing out arguments and justified belief

* (2:13:50) The difficulty of long-term forecasting and interventions

* (2:15:50) Theory of change in the bounded rationality program

* (2:18:45) Outro

Links:

* David’s homepage and Twitter and blog

* Papers mentioned/read

* Bounded rationality and inquiry

* Why bounded rationality (in epistemology)?

* Against the newer evidentialists

* The accuracy-coherence tradeoff in cognition

* There are no epistemic norms of inquiry

* Permissive metaepistemology

* Global priorities and effective altruism

* What David likes about EA

* Against the singularity hypothesis (+ blog posts)

* Three mistakes in the moral mathematics of existential risk (+ blog posts)

* The scope of longtermism

* Epistemics

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Ryan Tibshirani: Statistics, Nonparametric Regression, Conformal Prediction

25 april 2024 | 106 min

Episode 121

I spoke with Professor Ryan Tibshirani about:

* Differences between the ML and statistics communities in scholarship, terminology, and other areas.

* Trend filtering

* Why you can’t just use garbage prediction functions when doing conformal prediction

Ryan is a Professor in the Department of Statistics at UC Berkeley. He is also a Principal Investigator in the Delphi group. From 2011-2022, he was a faculty member in Statistics and Machine Learning at Carnegie Mellon University. From 2007-2011, he did his Ph.D. in Statistics at Stanford University.

Reach me at [email protected] for feedback, ideas, guest suggestions.

The Gradient Podcast on: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (01:10) Ryan’s background and path into statistics

* (07:00) Cultivating taste as a researcher

* (11:00) Conversations within the statistics community

* (18:30) Use of terms, disagreements over stability and definitions

* (23:05) Nonparametric Regression

* (23:55) Background on trend filtering

* (33:48) Analysis and synthesis frameworks in problem formulation

* (39:45) Neural networks as a specific take on synthesis

* (40:55) Divided differences, falling factorials, and discrete splines

* (41:55) Motivations and background

* (48:07) Divided differences vs. derivatives, approximation and efficiency

* (51:40) Conformal prediction

* (52:40) Motivations

* (1:10:20) Probabilistic guarantees in conformal prediction, choice of predictors

* (1:14:25) Assumptions: i.i.d. and exchangeability — conformal prediction beyond exchangeability

* (1:25:00) Next directions

* (1:28:12) Epidemic forecasting — COVID-19 impact and trends survey

* (1:29:10) Survey methodology

* (1:38:20) Data defect correlation and its limitations for characterizing datasets

* (1:46:14) Outro

Links:

* Ryan’s homepage

* Works read/mentioned

* Nonparametric Regression

* Adaptive Piecewise Polynomial Estimation via Trend Filtering (2014)

* Divided Differences, Falling Factorials, and Discrete Splines: Another Look at Trend Filtering and Related Problems (2020)

* Distribution-free Inference

* Distribution-Free Predictive Inference for Regression (2017)

* Conformal Prediction Under Covariate Shift (2019)

* Conformal Prediction Beyond Exchangeability (2023)

* Delphi and COVID-19 research

* Flexible Modeling of Epidemics

* Real-Time Estimation of COVID-19 Infections

* The US COVID-19 Trends and Impact Survey and Big data, big problems: Responding to “Are we there yet?”

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Sasha Luccioni: Connecting the Dots Between AI's Environmental and Social Impacts

18 april 2024 | 63 min

In episode 120 of The Gradient Podcast, Daniel Bashir speaks to Sasha Luccioni.

Sasha is the AI and Climate Lead at HuggingFace, where she spearheads research, consulting, and capacity-building to elevate the sustainability of AI systems. A founding member of Climate Change AI (CCAI) and a board member of Women in Machine Learning (WiML), Sasha is passionate about catalyzing impactful change, organizing events and serving as a mentor to under-represented minorities within the AI community.

Have suggestions for future podcast guests (or other feedback)? Let us know here or reach Daniel at [email protected]

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (00:43) Sasha’s background

* (01:52) How Sasha became interested in sociotechnical work

* (03:08) Larger models and theory of change for AI/climate work

* (07:18) Quantifying emissions for ML systems

* (09:40) Aggregate inference vs training costs

* (10:22) Hardware and data center locations

* (15:10) More efficient hardware vs. bigger models — Jevons paradox

* (17:55) Uninformative experiments, takeaways for individual scientists, knowledge sharing, failure reports

* (27:10) Power Hungry Processing: systematic comparisons of ongoing inference costs

* (28:22) General vs. task-specific models

* (31:20) Architectures and efficiency

* (33:45) Sequence-to-sequence architectures vs. decoder-only

* (36:35) Hardware efficiency/utilization

* (37:52) Estimating the carbon footprint of Bloom and lifecycle assessment

* (40:50) Stable Bias

* (46:45) Understanding model biases and representations

* (52:07) Future work

* (53:45) Metaethical perspectives on benchmarking for AI ethics

* (54:30) “Moral benchmarks”

* (56:50) Reflecting on “ethicality” of systems

* (59:00) Transparency and ethics

* (1:00:05) Advice for picking research directions

* (1:02:58) Outro

Links:

* Sasha’s homepage and Twitter

* Papers read/discussed

* Climate Change / Carbon Emissions of AI Models

* Quantifying the Carbon Emissions of Machine Learning

* Power Hungry Processing: Watts Driving the Cost of AI Deployment?

* Tackling Climate Change with Machine Learning

* CodeCarbon

* Responsible AI

* Stable Bias: Analyzing Societal Representations in Diffusion Models

* Metaethical Perspectives on ‘Benchmarking’ AI Ethics

* Measuring Data

* Mind your Language (Model): Fact-Checking LLMs and their Role in NLP Research and Practice

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Michael Sipser: Problems in the Theory of Computation

11 april 2024 | 88 min

In episode 119 of The Gradient Podcast, Daniel Bashir speaks to Professor Michael Sipser.

Professor Sipser is the Donner Professor of Mathematics and member of the Computer Science and Artificial Intelligence Laboratory at MIT.

He received his PhD from UC Berkeley in 1980 and joined the MIT faculty that same year. He was Chairman of Applied Mathematics from 1998 to 2000 and served as Head of the Mathematics Department 2004-2014. He served as interim Dean of Science 2013-2014 and then as Dean of Science 2014-2020.

He was a research staff member at IBM Research in 1980, spent the 1985-86 academic year on the faculty of the EECS department at Berkeley and at MSRI, and was a Lady Davis Fellow at Hebrew University in 1988. His research areas are in algorithms and complexity theory, specifically efficient error correcting codes, interactive proof systems, randomness, quantum computation, and establishing the inherent computational difficulty of problems. He is the author of the widely used textbook, Introduction to the Theory of Computation (Third Edition, Cengage, 2012).

Have suggestions for future podcast guests (or other feedback)? Let us know here or reach Daniel at [email protected]

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (01:40) Professor Sipser’s background

* (04:35) On interesting questions

* (09:00) Different kinds of research problems

* (13:00) What makes certain problems difficult

* (18:48) Nature of the P vs NP problem

* (24:42) Identifying interesting problems

* (28:50) Lower bounds on the size of sweeping automata

* (29:50) Why sweeping automata + headway to P vs. NP

* (36:40) Insights from sweeping automata, infinite analogues to finite automata problems

* (40:45) Parity circuits

* (43:20) Probabilistic restriction method

* (47:20) Relativization and the polynomial time hierarchy

* (55:10) P vs. NP

* (57:23) The non-connection between GO’s polynomial space hardness and AlphaGo

* (1:00:40) On handicapping Turing Machines vs. oracle strategies

* (1:04:25) The Natural Proofs Barrier and approaches to P vs. NP

* (1:11:05) Debates on methods for P vs. NP

* (1:15:04) On the possibility of solving P vs. NP

* (1:18:20) On academia and its role

* (1:27:51) Outro

Links:

* Professor Sipser’s homepage

* Papers discussed/read

* Halting space-bounded computations (1978)

* Lower bounds on the size of sweeping automata (1979)

* GO is Polynomial-Space Hard (1980)

* A complexity theoretic approach to randomness (1983)

* Parity, circuits, and the polynomial-time hierarchy (1984)

* A follow-up to Furst-Saxe-Sipser

* The Complexity of Finite Functions (1991)

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Andrew Lee: How AI will Shape the Future of Email

4 april 2024 | 64 min

Joss Fong: Videomaking, AI, and Science Communication

28 mars 2024 | 84 min

Episode 117

“You get more of what you engage with. Everyone who complains about coverage should understand that every click, every quote tweet, every argument is registered by these publications as engagement. If what you want is really meaty, dispassionate, balanced, and fair explainers, you need to click on that, you need to read the whole thing, you need to share it, talk about it, comment on it. We get the media that we deserve.”

I spoke with Joss Fong.

Joss is a producer focused on science and technology, and was a founding member of the Vox video team. Her work has been recognized by the AAAS Kavli Science Journalism Awards, the Online Journalism Awards, and the News & Documentary Emmys. She holds a master's degree in science, health, and environmental reporting from NYU.

Have suggestions for future podcast guests (or other feedback)? Let us know here or reach us at [email protected]

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (01:32) Joss’s path into videomaking, J-school

* (07:45) Consumption and creation in explainer journalism

* (10:45) Finding clarity in information

* (13:15) Communication of ML research

* (15:55) Video journalism and science communication as separate and overlapping disciplines

* (19:41) Evolution of videos and videomaking

* (26:33) Explaining AI and communicating mental models

* (30:47) Meeting viewers in the middle, competing for attention

* (34:07) Explanatory techniques in Glad You Asked

* (37:10) Storytelling and communicating scientific information

* (40:57) “Is Beauty Culture Hurting Us?” and participating in video narratives

* (46:37) AI beauty filters

* (52:59) Obvious bias in generative AI

* (59:31) Definitions and ideas of progress, humanities and technology

* (1:05:08) “Iterative development” and outsourcing quality control to the public

* (1:07:10) Disagreement about (tech) journalism’s purpose

* (1:08:51) Incentives in newsrooms and journalistic organizations

* (1:12:04) AI for video generation and implications, limits of creativity

* (1:17:20) Skill and creativity

* (1:22:35) Joss’s new YouTube channel!

* (1:23:29) Outro

Links:

* Joss’s website and playlist of selected work

* AI-focused videos

* AI Art, Explained (2022)

* AI can do your homework. Now what? (2023)

* Computers just got a lot better at writing (2020)

* Facebook showed this ad to 95% women. Is that a problem? (2020)

* What facial recognition steals from us (2019)

* The big debate about the future of work (2017)

* AI and Creativity short film for Runway’s AIFF (2023)

* Others

* Is Beauty Culture Hurting Us? from Glad You Asked (2020)

* Joss’s Scientific American videos :)

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Kate Park: Data Engines for Vision and Language

21 mars 2024 | 42 min

Ben Wellington: ML for Finance and Storytelling through Data

14 mars 2024 | 68 min

In episode 115 of The Gradient Podcast, Daniel Bashir speaks to Ben Wellington.

Ben is the Deputy Head of Feature Forecasting at Two Sigma, a financial sciences company. Ben has been at Two Sigma for more than 15 years, and currently leads efforts focused on natural language processing and feature forecasting. He is also the author of data science blog I Quant NY, which has influenced local government policy, including changes in NYC street infrastructure and the design of NYC subway vending machines. Ben is a Visiting Assistant Professor in the Urban and Community Planning program at the Pratt Institute in Brooklyn where he teaches statistics using urban open data. He holds a Ph.D. in Computer Science from New York University.

Have suggestions for future podcast guests (or other feedback)? Let us know here or reach us at [email protected]

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (01:30) Ben’s background

* (04:30) Why Ben was interested in NLP

* (05:48) Ben’s work on translational equivalence, dominant techniques

* (10:14) Scaling, large datasets at Two Sigma

* (12:50) Applying ML techniques to quantitative finance, features in financial ML systems

* (17:27) Baselines and time-dependence in constructing features, human knowledge

* (19:23) Black box models in finance

* (24:00) Two Sigma’s presence in the AI research community

* (26:55) Short- and long-term research initiatives at Two Sigma

* (30:42) How ML fits into Two Sigma’s investment strategy

* (34:05) Alpha and competition in investing

* (36:13) Temporality in data

* (40:38) Challenges for finance/AI and beating the market

* (44:36) Reproducibility

* (49:47) I Quant NY and storytelling with data

* (56:43) Descriptive statistics and stories

* (1:01:05) Benefits of simple methods

* (1:07:11) Outro

Links:

* Ben’s work on translational equivalence and scalable discriminative learning

* Two Sigma Insights

* Storytelling with data and I Quant NY

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Venkatesh Rao: Protocols, Intelligence, and Scaling

7 mars 2024 | 139 min

“There is this move from generality in a relative sense of ‘we are not as specialized as insects’ to generality in the sense of omnipotent, omniscient, godlike capabilities. And I think there's something very dangerous that happens there, which is you start thinking of the word ‘general’ in completely unhinged ways.”

In episode 114 of The Gradient Podcast, Daniel Bashir speaks to Venkatesh Rao.

Venkatesh is a writer and consultant. He has been writing the widely read Ribbonfarm blog since 2007, and more recently, the popular Ribbonfarm Studio Substack newsletter. He is the author of Tempo, a book on timing and decision-making, and is currently working on his second book, on the foundations of temporality. He has been an independent consultant since 2011, supporting senior executives in the technology industry. His work in recent years has focused on AI, semiconductor, sustainability, and protocol technology sectors. He holds a PhD in control theory (2003) from the University of Michigan. He is currently based in the Seattle area, and enjoys dabbling in robotics in his spare time. You can learn more about his work at venkateshrao.com

Have suggestions for future podcast guests (or other feedback)? Let us know here or reach us at [email protected]

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (01:38) Origins of Ribbonfarm and Venkat’s academic background

* (04:23) Voice and recurring themes in Venkat’s work

* (11:45) Patch models and multi-agent systems: integrating philosophy of language, balancing realism with tractability

* (21:00) More on abstractions vs. tractability in Venkat’s work

* (29:07) Scaling of industrial value systems, characterizing AI as a discipline

* (39:25) Emergent science, intelligence and abstractions, presuppositions in science, generality and universality, cameras and engines

* (55:05) Psychometric terms

* (1:09:07) Inductive biases (yes I mentioned the No Free Lunch Theorem and then just talked about the definition of inductive bias and not the actual theorem 🤡)

* (1:18:13) LLM training and efficiency, comparing LLMs to humans

* (1:23:35) Experiential age, analogies for knowledge transfer

* (1:30:50) More clarification on the analogy

* (1:37:20) Massed Muddler Intelligence and protocols

* (1:38:40) Introducing protocols and the Summer of protocols

* (1:49:15) Evolution of protocols, hardness

* (1:54:20) LLMs, protocols, time, future visions, and progress

* (2:01:33) Protocols, drifting from value systems, friction, compiling explicit knowledge

* (2:14:23) Directions for ML people in protocols research

* (2:18:05) Outro

Links:

* Venkat’s Twitter and homepage

* Mediocre Computing

* Summer of Protocols and 2024 Call for Applications (apply!)

* Essays discussed

* Patch models and their applications to multivehicle command and control

* From Mediocre Computing

* Text is All You Need

* Magic, Mundanity, and Deep Protocolization

* A Camera, Not an Engine

* Massed Muddler Intelligence

* On protocols

* The Unreasonable Sufficiency of Protocols

* Protocols Don’t Build Pyramids

* Protocols in (Emergency) Time

* Atoms, Institutions, Blockchains

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Sasha Rush: Building Better NLP Systems

29 februari 2024 | 54 min

In episode 113 of The Gradient Podcast, Daniel Bashir speaks to Professor Sasha Rush.

Professor Rush is an Associate Professor at Cornell University and a Researcher at HuggingFace. His research aims to develop natural language processing systems that are safe, fast, and controllable. His group is interested primarily in tasks that involve text generation, and they study data-driven probabilistic methods that combine deep-learning based models with probabilistic controls. He is also interested in open-source NLP and deep learning, and develops projects to make deep learning systems safer, clearer, and easier to use.

Have suggestions for future podcast guests (or other feedback)? Let us know here or reach us at [email protected]

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (01:47) Professor Rush’s background

* (03:23) Professor Rush’s reflections on prior work—importance of learning and inference

* (04:58) How much engineering matters in deep learning, the Rush vs. Frankle Bet

* (07:12) On encouraging and incubating good research

* (10:50) Features of good research environments

* (12:36) 5% bets in Professor Rush’s research: State-Space Models (SSMs) as an alternative to Transformers

* (15:58) SSMs vs. Transformers

* (18:53) Probabilistic Context-Free Grammars—are (P)CFGs worth paying attention to?

* (20:53) Sequence-level knowledge distillation: approximating sequence-level distributions

* (25:08) Pruning and knowledge distillation — orthogonality of efficiency techniques

* (26:33) Broader thoughts on efficiency

* (28:31) Works on prompting

* (28:58) Prompting and In-Context Learning

* (30:05) Thoughts on mechanistic interpretability

* (31:25) Multitask prompted training enables zero-shot task generalization

* (33:48) How many data points is a prompt worth?

* (35:13) Directions for controllability in LLMs

* (39:11) Controllability and safety

* (41:23) Open-source work, deep learning libraries

* (42:08) A story about Professor Rush’s post-doc at FAIR

* (43:51) The impact of PyTorch

* (46:08) More thoughts on deep learning libraries

* (48:48) Levels of abstraction, PyTorch as an interface to motivate research

* (50:23) Empiricism and research commitments

* (53:32) Outro

Links:

* Research

* Early work / PhD

* Dual Decomposition and LP Relaxations

* Vine Pruning for Efficient Multi-Pass Dependency Parsing

* Improved Parsing and POS Tagging Using Inter-Sentence Dependency Constraints

* Research — interpretable and controllable natural language generation

* Compound Probabilistic Context-Free Grammars for Grammar Induction

* Multitask prompted training enables zero-shot task generalization

* Research — deep generative models

* A Neural Attention Model for Abstractive Sentence Summarization

* Learning Neural Templates for Text Generation

* How many data points is a prompt worth?

* Research — efficient algorithms and hardware for speech, translation, dialogue

* Sequence-Level Knowledge Distillation

* Open-source work

* NamedTensor

* Torch Struct

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Cameron Jones & Sean Trott: Understanding, Grounding, and Reference in LLMs

22 februari 2024 | 119 min

In episode 112 of The Gradient Podcast, Daniel Bashir speaks to Cameron Jones and Sean Trott.

Cameron is a PhD candidate in the Cognitive Science Department at the University of California, San Diego. His research compares how humans and large language models process language about world knowledge, situation models, and theory of mind.

Sean is an Assistant Teaching Professor in the Cognitive Science Department at the University of California, San Diego. His research interests include probing large language models, ambiguity in languages, how ambiguous words are represented, and pragmatic inference. He previously completed his PhD at UCSD.

Have suggestions for future podcast guests (or other feedback)? Let us know here or reach us at [email protected]

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (02:55) Cameron’s background

* (06:00) Sean’s background

* (08:15) Unexpected capabilities of language models and the need for embodiment to understand meaning

* (11:05) Interpreting results of Turing tests, separating what humans and LLMs do when behaving as though they “understand”

* (14:27) Internal mechanisms, interpretability, how we test theories

* (16:40) Languages are efficient, but for whom?

* (17:30) Initial motivations: lexical ambiguity

* (19:20) The balance of meanings across wordforms

* (22:35) Tension between speaker- and comprehender-oriented pressures in lexical ambiguity

* (25:05) Context and potential vs. realized ambiguity

* (27:15) LLM-ology

* (28:30) Studying LLMs as models of human cognition and as interesting objects of study in their own right

* (30:03) Example of explaining away effects

* (33:54) The internalist account of belief sensitivity—behavior and internal representations

* (37:43) LLMs and the False Belief Task

* (42:05) Hypothetical on observed behavior and inferences about internal representations

* (48:05) Distributional Semantics Still Can’t Account for Affordances

* (50:25) Tests of embodied theories and limitations of distributional cues

* (53:54) Multimodal models and object affordances

* (58:30) Language and grounding, other buzzwords

* (59:45) How could we know if LLMs understand language?

* (1:04:50) Reference: as a thing words do vs. ontological notion

* (1:11:38) The Role of Physical Inference in Pronoun Resolution

* (1:16:40) World models and world knowledge

* (1:19:45) EPITOME

* (1:20:20) The different tasks

* (1:26:43) Confounders / “attending” in LM performance on tasks

* (1:30:30) Another hypothetical, on theory of mind

* (1:32:26) How much information can language provide in service of mentalizing?

* (1:35:14) Convergent validity and coherence/validity of theory of mind

* (1:39:30) Interpretive questions about behavior w/r/t/ theory of mind

* (1:43:35) Does GPT-4 Pass the Turing Test?

* (1:44:00) History of the Turing Test

* (1:47:05) Interrogator strategies and the strength of the Turing Test

* (1:52:15) “Internal life” and personality

* (1:53:30) How should this research impact how we assess / think about LLM abilities?

* (1:58:56) Outro

Links:

* Cameron’s homepage and Twitter

* Sean’s homepage and Twitter

* Research — Language and NLP

* Languages are efficient, but for whom?

* Research — LLM-ology

* Do LLMs know what humans know?

* Distributional Semantics Still Can’t Account for Affordances

* In Cautious Defense of LLM-ology

* Should Psycholinguists use LLMs as “model organisms”?

* (Re)construing Meaning in NLP

* Research — language and grounding, theory of mind, reference [insert other buzzwords here]

* Do LLMs have a “theory of mind”?

* How could we know if LLMs understand language?

* Does GPT-4 Pass the Turing Test?

* Could LMs change language?

* The extended mind and why it matters for cognitive science research

* EPITOME

* The Role of Physical Inference in Pronoun Resolution

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Nicholas Thompson: AI and Journalism

15 februari 2024 | 60 min

In episode 111 of The Gradient Podcast, Daniel Bashir speaks to Nicholas Thompson.

Nicholas is the CEO of The Atlantic. Previously, he served as editor-in-chief of Wired and editor of Newyorker.com. Nick also cofounded Atavist, which sold to Automattic in 2018. Publications under Nick’s leadership have won numerous National Magazine Awards and Pulitzer Prizes, and one WIRED story he edited was the basis for the movie Argo. Nick is also the co-founder of Speakeasy AI, a software platform designed to foster constructive online conversations about the world’s most pressing problems.

Have suggestions for future podcast guests (or other feedback)? Let us know here or reach us at [email protected]

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (02:12) Nick’s path into journalism

* (03:25) The Washington Monthly — a turning point

* (05:09) Perspectives from different positions in the journalism industry

* (08:16) What is great journalism?

* (09:42) Example from The Atlantic

* (11:00) Other examples/pieces of good journalism

* (12:20) Pieces on aging

* (12:56) Mortality and life-force associated with running — Nick’s piece in WIRED

* (15:30) On urgency

* (18:20) The job of an editor

* (22:23) AI in journalism — benefits and limitations

* (26:45) How AI can help writers, experimentation

* (28:40) Examples of AI in journalism and issues: CNET, Sports Illustrated, Nick’s thoughts on how AI should be used in journalism

* (32:20) Speakeasy AI and creating healthy conversation spaces

* (34:00) Details about Speakeasy

* (35:12) Business pivots and business model trouble

* (35:37) Remaining gaps in fixing conversational spaces

* (38:27) Lessons learned

* (40:00) Nick’s optimism about Speakeasy-like projects

* (43:14) Social simulacra, a “Troll WestWorld,” algorithmic adjustments in social media

* (46:11) Lessons and wisdom from journalism about engagement, more on engagement in social media

* (50:27) Successful and unsuccessful futures for AI in journalism

* (54:17) Previous warnings about synthetic media, Nick’s perspective on risks from synthetic media in journalism

* (57:00) Stop trying to build AGI

(59:13) Outro

Links:

* Nicholas’s Twitter and website

* Speakeasy AI

* Writing

* “To Run My Best Marathon at Age 44, I Had to Outrun My Past” in WIRED

* “The year AI actually changes the media business” in NiemanLab’s Predictions for Journalism 2023

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Subbarao Kambhampati: Planning, Reasoning, and Interpretability in the Age of LLMs

8 februari 2024 | 119 min

In episode 110 of The Gradient Podcast, Daniel Bashir speaks to Professor Subbarao Kambhampati.

Professor Kambhampati is a professor of computer science at Arizona State University. He studies fundamental problems in planning and decision making, motivated by the challenges of human-aware AI systems. He is a fellow of the Association for the Advancement of Artificial Intelligence, American Association for the Advancement of Science, and Association for Computing machinery, and was an NSF Young Investigator. He was the president of the Association for the Advancement of Artificial Intelligence, trustee of the International Joint Conference on Artificial Intelligence, and a founding board member of Partnership on AI.

Have suggestions for future podcast guests (or other feedback)? Let us know here or reach us at [email protected]

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (02:11) Professor Kambhampati’s background

* (06:07) Explanation in AI

* (18:08) What people want from explanations—vocabulary and symbolic explanations

* (21:23) The realization of new concepts in explanation—analogy and grounding

* (30:36) Thinking and language

* (31:48) Conscious and subconscious mental activity

* (36:58) Tacit and explicit knowledge

* (42:09) The development of planning as a research area

* (46:12) RL and planning

* (47:47) What makes a planning problem hard?

* (51:23) Scalability in planning

* (54:48) LLMs do not perform reasoning

* (56:51) How to show LLMs aren’t reasoning

* (59:38) External verifiers and backprompting LLMs

* (1:07:51) LLMs as cognitive orthotics, language and representations

* (1:16:45) Finding out what kinds of representations an AI system uses

* (1:31:08) “Compiling” system 2 knowledge into system 1 knowledge in LLMs

* (1:39:53) The Generative AI Paradox, reasoning and retrieval

* (1:43:48) AI as an ersatz natural science

* (1:44:03) Why AI is straying away from its engineering roots, and what constitutes engineering

* (1:58:33) Outro

Links:

* Professor Kambhampati’s Twitter and homepage

* Research and Writing — Planning and Human-Aware AI Systems

* A Validation-structure-based theory of plan modification and reuse (1990)

* Challenges of Human-Aware AI Systems (2020)

* Polanyi vs. Planning (2021)

* LLMs and Planning

* Can LLMs Really Reason and Plan? (2023)

* On the Planning Abilities of LLMs (2023)

* Other

* Changing the nature of AI research

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Russ Maschmeyer: Spatial Commerce and AI in Retail

1 februari 2024 | 56 min

Benjamin Breen: The Intersecting Histories of Psychedelics and AI Research

25 januari 2024 | 68 min

Ted Gibson: The Structure and Purpose of Language

18 januari 2024 | 133 min

Harvey Lederman: Propositional Attitudes and Reference in Language Models

11 januari 2024 | 131 min

In episode 106 of The Gradient Podcast, Daniel Bashir speaks to Professor Harvey Lederman.

Professor Lederman is a professor of philosophy at UT Austin. He has broad interests in contemporary philosophy and in the history of philosophy: his areas of specialty include philosophical logic, the Ming dynasty philosopher Wang Yangming, epistemology, and philosophy of language. He has recently been working on incomplete preferences, on trying in the philosophy of language, and on Wang Yangming’s moral metaphysics.

Have suggestions for future podcast guests (or other feedback)? Let us know here or reach us at [email protected]

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (02:15) Harvey’s background

* (05:30) Higher-order metaphysics and propositional attitudes

* (06:25) Motivations

* (12:25) Setup: syntactic types and ontological categories

* (25:11) What makes higher-order languages meaningful and not vague?

* (25:57) Higher-order languages corresponding to the world

* (30:52) Extreme vagueness

* (35:32) Desirable features of languages and important questions in philosophy

* (36:42) Higher-order identity

* (40:32) Intuitions about mental content, language, context-sensitivity

* (50:42) Perspectivism

* (51:32) Co-referring names, identity statements

* (55:42) The paper’s approach, “know” as context-sensitive

* (57:24) Propositional attitude psychology and mentalese generalizations

* (59:57) The “good standing” of theorizing about propositional attitudes

* (1:02:22) Mentalese

* (1:03:32) “Does knowledge imply belief?” — when a question does not have good standing

* (1:06:17) Sense, Reference, and Substitution

* (1:07:07) Fregeans and the principle of Substitution

* (1:12:12) Follow-up work to this paper

* (1:13:39) Do Language Models Produce Reference Like Libraries or Like Librarians?

* (1:15:02) Bibliotechnism

* (1:19:08) Inscriptions and reference, what it takes for something to refer

* (1:22:37) Derivative and basic reference

* (1:24:47) Intuition: n-gram models and reference

* (1:28:22) Meaningfulness in sentences produced by n-gram models

* (1:30:40) Bibliotechnism and LLMs, disanalogies to n-grams

* (1:33:17) On other recent work (vector grounding, do LMs refer?, etc.)

* (1:40:12) Causal connections and reference, how bibliotechnism makes good on the meanings of sentences

* (1:45:46) RLHF, sensitivity to truth and meaningfulness

* (1:48:47) Intelligibility

* (1:50:52) When LLMs produce novel reference

* (1:53:37) Novel reference vs. find-replace

* (1:56:00) Directionality example

* (1:58:22) Human intentions and derivative reference

* (2:00:47) Between bibliotechnism and agency

* (2:05:32) Where do invented names / novel reference come from?

* (2:07:17) Further questions

* (2:10:04) Outro

Links:

* Harvey’s homepage and Twitter

* Papers discussed

* Higher-order metaphysics and propositional attitudes

* Perspectivism

* Sense, Reference, and Substitution

* Are Language Models More Like Libraries or Like Librarians? Bibliotechnism, the Novel Reference Problem, and the Attitudes of LLMs

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Eric Jang: AI is Good For You

4 januari 2024 | 90 min

2023 in AI, with Nathan Benaich

28 december 2023 | 96 min

In episode 104 of The Gradient Podcast, Daniel Bashir speaks to Nathan Benaich.

Nathan is Founder and General Partner at Air Street Capital, a VC firm focused on investing in AI-first technology and life sciences companies. Nathan runs a number of communities focused on AI including the Research and Applied AI Summit and leads Spinout.fyi to improve the creation of university spinouts. Nathan co-authors the State of AI Report.

Have suggestions for future podcast guests (or other feedback)? Let us know here or reach us at [email protected]

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (02:00) Updates in Nathan World — Air Street’s second fund, spinouts,

* (07:30) Events: Research and Applied AI Summit, State of AI Report launches

* (09:50) The State of AI: main messages, the increasing role of subject matter experts

* Research

* (14:13) Open and closed-source

* (17:55) Benchmarking and evaluation, small/large models and industry verticals

* (21:10) “Vibes” in LLM evaluation

* (24:00) Codegen models, personalized AI, curriculum learning

* (26:20) The exhaustion of human-generated data, lukewarm content, synthetic data

* (29:50) Opportunities for AI applications in the natural sciences

* (35:15) Reinforcement Learning from Human Feedback and alternatives

* (38:30) Industry

* (39:00) ChatGPT and productivity

* (42:37) General app wars, ChatGPT competitors

* (45:50) Compute—demand, supply, competition

* (50:55) Export controls and geopolitics

* (54:45) Startup funding and compute spend

* (59:15) Politics

* (59:40) Calls for regulation, regulatory divergence

* (1:04:40) AI safety

* (1:07:30) Nathan’s perspective on regulatory approaches

* (1:12:30) The UK’s early access to frontier models, standards setting, regulation difficulties

* (1:17:20) Jailbreaking, constitutional AI, robustness

* (1:20:50) Predictions!

* (1:25:00) Generative AI misuse in elections and politics (and, this prediction coming true in Bangladesh)

* (1:26:50) Progress on AI governance

* (1:30:30) European dynamism

* (1:35:08) Outro

Links:

* Nathan’s homepage and Twitter

* The 2023 State of AI Report

* Bringing Dynamism to European Defense

* A prediction coming true: How AI is disrupting Bangladesh’s election

* Air Street Capital is hiring a full-time Community Lead!

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Kathleen Fisher: DARPA and AI for National Security

21 december 2023 | 46 min

In episode 103 of The Gradient Podcast, Daniel Bashir speaks to Dr. Kathleen Fisher.

As the director of DARPA’s Information Innovation Office (I2O), Dr. Kathleen Fisher oversees a portfolio that includes most of the agency’s AI-related research and development efforts, including the recent AI Forward initiative. AI Forward explores new directions for AI research that will result in trustworthy systems for national security missions. This summer, roughly 200 participants from the commercial sector, academia, and the U.S. government attended workshops that generated ideas to inform DARPA’s next phase of AI exploratory projects. Dr. Fisher previously served as a program manager in I2O from 2011 to 2014. As a program manager, she conceptualized, created, and executed programs in high-assurance computing and machine learning, including Probabilistic Programming for Advancing Machine Learning (PPAML), making building ML applications easier. She was also a co-author of a recent paper about the threats posed by large language models.

Since 2018, DARPA has dedicated over $2 billion in R&D funding to AI research. The agency DARPA has been generating groundbreaking research and development for 65 years – leading to game-changing military capabilities and icons of modern society, such as initiating the research field that rendered self-driving cars and developing the technology that led to Apple’s Siri.

Have suggestions for future podcast guests (or other feedback)? Let us know here or reach us at [email protected]

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (01:30) Kathleen’s background

* (05:05) Intersections between programming languages and AI

* (07:15) Neuro-symbolic AI, trade-offs between flexibility and guarantees

* (09:45) History of DARPA and the Information Innovation Office (I2O)

* (13:55) DARPA’s perspective on research

* (17:10) Galvanizing a research community

* (20:06) DARPA’s recent investments in AI and AI Forward

* (26:35) Dual-use nature of generative AI, identifying and mitigating security risks, Kathleen’s perspective on short-term and long-term risk (note: the “Gradient podcast” Kathleen mentions is from Last Week in AI)

* (30:10) Concerns about deployment and interaction

* (32:20) Outcomes from AI Forward workshops and themes

* (36:10) Incentives in building and using AI technologies, friction

* (38:40) Interactions between DARPA and other government agencies

* (40:09) Future research directions

* (44:04) Ways to stay up to date on DARPA’s work

* (45:40) Outro

Links:

* DARPA I2O website

* Probabilistic Programming for Advancing Machine Learning (PPAML) (Archived)

* Assured Neuro Symbolic Learning and Reasoning (ANSR)

* AI Cyber Challenge

* AI Forward

* Identifying and Mitigating the Security Risks of Generative AI Paper

* FoundSci Solicitation

* FACT Solicitation

* Semantic Forensics (SemaFor)

* GARD Open Source Resources

* I2O Newsletter signup

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Peter Tse: The Neuroscience of Consciousness and Free Will

14 december 2023 | 144 min

In episode 102 of The Gradient Podcast, Daniel Bashir speaks to Peter Tse.

Professor Tse is a Professor of Cognitive Neuroscience and chair of the department of Psychological and Brain Sciences at Dartmouth College. His research focuses on using brain and behavioral data to constrain models of the neural bases of attention and consciousness, unconscious processing that precedes and constructs consciousness, mental causation, and human capacities for imagination and creativity. He is especially interested in the processing that goes into the construction of conscious experience between retinal activation at time 0 and seeing an event about a third of a second later.

Have suggestions for future podcast guests (or other feedback)? Let us know here or reach us at [email protected]

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (01:45) Prof. Tse’s background

* (03:25) Early experiences in physics/math and philosophy of physics

* (06:10) Choosing to study neuroscience

* (07:15) Prof Tse’s commitments about determinism

* (10:00) Quantum theory and determinism

* (13:45) Biases/preferences in choosing theories

* (20:41) Falsifiability and scientific questions, transition from physics to neuroscience

* (30:50) How neuroscience is unusual among the sciences

* (33:20) Neuroscience and subjectivity

* (34:30) Reductionism

* (37:30) Gestalt psychology

* (41:30) Introspection in neuroscience

* (45:30) The preconscious buffer and construction of conscious experience, color constancy

* (53:00) Perceptual and cognitive inference

* (55:00) AI systems and intrinsic meaning

* (57:15) Information vs. meaning

* (1:01:45) Consciousness and representation of bodily states

* (1:05:10) Our second-order free will

* (1:07:20) Jaegwon Kim’s exclusion argument

* (1:11:45) Why Kim thought his own argument was wrong

* (1:15:00) Resistance and counterarguments to Kim

* (1:19:45) Criterial causation

* (1:23:00) How neurons evaluate inputs criterially

* (1:24:00) Concept neurons in the hippocampus

* (1:31:57) Criterial causation and physicalism, mental causation

* (1:40:10) Daniel makes another attempt to push back 🤡

* (1:45:47) More on AI

* (1:47:05) Prof Tse’s perspective on modern AI systems, differences with human cognition

* (2:17:25) Consciousness, attention, spirituality

* (2:20:10) Prof Tse’s hopes for AI

* (2:23:30) Outro

Links:

* Professor Tse’s homepage

* Papers

* Vision/Perception

* Perceptual learning based on the learning of diagnostic features

* Complete mergeability and amodal completion

* Attention

* How Attention Can Alter Appearances

* How Top-down Attention Alters Bottom-up preconscious operations

* Consciousness

* Network structure and dynamics of the mental workspace

* On free will

* NDPR review of “Neural Basis of Free Will”

* Kripke’s Category Error

* Ontological Indeterminism undermines Kim’s Exclusion Argument

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Vera Liao: AI Explainability and Transparency

7 december 2023 | 97 min

Thomas Dietterich: From the Foundations

30 november 2023 | 122 min

In episode 100 of The Gradient Podcast, Daniel Bashir speaks to Professor Thomas Dietterich.

Professor Dietterich is Distinguished Professor Emeritus in the School of Electrical Engineering and Computer Science at Oregon State University. He is a pioneer in the field of machine learning, and has authored more than 225 refereed publications and two books. His current research topics include robust artificial intelligence, robust human-AI systems, and applications in sustainability. He is a former President of the Association for the Advancement of Artificial Intelligence, and the founding President of the International Machine Learning Society. Other major roles include Executive Editor of the journal Machine Learning, co-founder of the Journal for Machine Learning Research, and program chair of AAAI 1990 and NIPS 2000. He currently serves as one of the moderators for the cs.LG category on arXiv.

Have suggestions for future podcast guests (or other feedback)? Let us know here or reach us at [email protected]

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Episode 100 Note

* (02:03) Intro

* (04:23) Prof. Dietterich’s background

* (14:20) Kuhn and theory development in AI, how Prof Dietterich thinks about the philosophy of science and AI

* (20:10) Scales of understanding and sentience, grounding, observable evidence

* (23:58) Limits of statistical learning without causal reasoning, systematic understanding

* (25:48) A challenge for the ML community: testing for systematicity

* (26:13) Forming causal understandings of the world

* (28:18) Learning at the Knowledge Level

* (29:18) Background and definitions

* (32:18) Knowledge and goals, a note on LLMs

* (33:03) What it means to learn

* (41:05) LLMs as learning results of inference without learning first principles

* (43:25) System I/II thinking in humans and LLMs

* (47:23) “Routine Science”

* (47:38) Solving multiclass learning problems via error-correcting output codes

* (52:53) Error-correcting codes and redundancy

* (54:48) Why error-correcting codes work, contra intuition

* (59:18) Bias in ML

* (1:06:23) MAXQ for hierarchical RL

* (1:15:48) Computational sustainability

* (1:19:53) Project TAHMO’s moonshot

* (1:23:28) Anomaly detection for weather stations

* (1:25:33) Robustness

* (1:27:23) Motivating The Familiarity Hypothesis

* (1:27:23) Anomaly detection and self-models of competence

* (1:29:25) Measuring the health of freshwater streams

* (1:31:55) An open set problem in species detection

* (1:33:40) Issues in anomaly detection for deep learning

* (1:37:45) The Familiarity Hypothesis

* (1:40:15) Mathematical intuitions and the Familiarity Hypothesis

* (1:44:12) What’s Wrong with LLMs and What We Should Be Building Instead

* (1:46:20) Flaws in LLMs

* (1:47:25) The systems Prof Dietterich wants to develop

* (1:49:25) Hallucination/confabulation and LLMs vs knowledge bases

* (1:54:00) World knowledge and linguistic knowledge

* (1:55:07) End-to-end learning and knowledge bases

* (1:57:42) Components of an intelligent system and separability

* (1:59:06) Thinking through external memory

* (2:01:10) Outro

Links:

* Research — Fundamentals (Philosophy of AI)

* Learning at the Knowledge Level

* What Does it Mean for a Machine to Understand?

* Research – “Routine science”

* Ensemble methods in ML and error-correcting output codes

* Solving multiclass learning problems via error-correcting output codes

* An experimental comparison of bagging, boosting, and randomization

* ML Bias, Statistical Bias, and Statistical Variance of Decision Tree Algorithms

* The definitive treatment of these questions, by Gareth James

* Discovering/Exploiting structure in MDPs:

* MAXQ for hierarchical RL

* Exogenous State MDPs (paper with George Trimponias, slides)

* Research — Ecosystem Informatics and Computational Sustainability

* Project TAHMO

* Challenges for ML in Computational Sustainability

* Research — Robustness

* Steps towards robust AI (AAAI President’s Address)

* Benchmarking NN Robustness to Common Corruptions and Perturbations with Dan Hendrycks

* The familiarity hypothesis: Explaining the behavior of deep open set methods

* Recent commentary

* Toward High-Reliability AI

* What's Wrong with Large Language Models and What We Should Be Building Instead

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Martin Wattenberg: ML Visualization and Interpretability

16 november 2023 | 102 min

In episode 99 of The Gradient Podcast, Daniel Bashir speaks to Professor Martin Wattenberg.

Professor Wattenberg is a professor at Harvard and part-time member of Google Research’s People + AI Research (PAIR) initiative, which he co-founded. His work, with long-time collaborator Fernanda Viégas, focuses on making AI technology broadly accessible and reflective of human values. At Google, Professor Wattenberg, his team, and Professor Viégas have created end-user visualizations for products such as Search, YouTube, and Google Analytics. Note: Professor Wattenberg is recruiting PhD students through Harvard SEAS—info here.

Have suggestions for future podcast guests (or other feedback)? Let us know here or reach us at [email protected]

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (03:30) Prof. Wattenberg’s background

* (04:40) Financial journalism at SmartMoney

* (05:35) Contact with the academic visualization world, IBM

* (07:30) Transition into visualizing ML

* (08:25) Skepticism of neural networks in the 1980s

* (09:45) Work at IBM

* (10:00) Multiple scales in information graphics, organization of information

* (13:55) How much information should a graphic display to whom?

* (17:00) Progressive disclosure of complexity in interface design

* (18:45) Visualization as a rhetorical process

* (20:45) Conversation Thumbnails for Large-Scale Discussions

* (21:35) Evolution of conversation interfaces—Slack, etc.

* (24:20) Path dependence — mutual influences between user behaviors and technology, takeaways for ML interface design

* (26:30) Baby Names and Social Data Analysis — patterns of interest in baby names

* (29:50) History Flow

* (30:05) Why investigate editing dynamics on Wikipedia?

* (32:06) Implications of editing patterns for design and governance

* (33:25) The value of visualizations in this work, issues with Wikipedia editing

* (34:45) Community moderation, bureaucracy

* (36:20) Consensus and guidelines

* (37:10) “Neutral” point of view as an organizing principle

* (38:30) Takeaways

* PAIR

* (39:15) Tools for model understanding and “understanding” ML systems

* (41:10) Intro to PAIR (at Google)

* (42:00) Unpacking the word “understanding” and use cases

* (43:00) Historical comparisons for AI development

* (44:55) The birth of TensorFlow.js

* (47:52) Democratization of ML

* (48:45) Visualizing translation — uncovering and telling a story behind the findings

* (52:10) Shared representations in LLMs and their facility at translation-like tasks

* (53:50) TCAV

* (55:30) Explainability and trust

* (59:10) Writing code with LMs and metaphors for using

* More recent research

* (1:01:05) The System Model and the User Model: Exploring AI Dashboard Design

* (1:10:05) OthelloGPT and world models, causality

* (1:14:10) Dashboards and interaction design—interfaces and core capabilities

* (1:18:07) Reactions to existing LLM interfaces

* (1:21:30) Visualizing and Measuring the Geometry of BERT

* (1:26:55) Note/Correction: The “Atlas of Meaning” Prof. Wattenberg mentions is called Context Atlas

* (1:28:20) Language model tasks and internal representations/geometry

* (1:29:30) LLMs as “next word predictors” — explaining systems to people

* (1:31:15) The Shape of Song

* (1:31:55) What does music look like?

* (1:35:00) Levels of abstraction, emergent complexity in music and language models

* (1:37:00) What Prof. Wattenberg hopes to see in ML and interaction design

* (1:41:18) Outro

Links:

* Professor Wattenberg’s homepage and Twitter

* Harvard SEAS application info — Professor Wattenberg is recruiting students!

* Research

* Earlier work

* A Fuzzy Commitment Scheme

* Stacked Graphs—Geometry & Aesthetics

* A Multi-Scale Model of Perceptual Organization in Information Graphics

* Conversation Thumbnails for Large-Scale Discussions

* Baby Names and Social Data Analysis

* History Flow (paper)

* At Harvard and Google / PAIR

* Tools for Model Understanding: Facets, SmoothGrad, Attacking discrimination with smarter ML

* TensorFlow.js

* Visualizing translation

* TCAV

* Other ML papers:

* The System Model and the User Model: Exploring AI Dashboard Design (recent speculative essay)

* Emergent World Representations: Exploring a Sequence Model Trained on a Synthetic Task

* Visualizing and Measuring the Geometry of BERT

* Artwork

* The Shape of Song

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Laurence Liew: AI Singapore

9 november 2023 | 50 min

Michael Levin & Adam Goldstein: Intelligence and its Many Scales

2 november 2023 | 57 min

Jonathan Frankle: From Lottery Tickets to LLMs

26 oktober 2023 | 68 min

In episode 96 of The Gradient Podcast, Daniel Bashir speaks to Jonathan Frankle.

Jonathan is the Chief Scientist at MosaicML and (as of release). Jonathan completed his PhD at MIT, where he investigated the properties of sparse neural networks that allow them to train effectively through his lottery ticket hypothesis. He also spends a portion of his time working on technology policy, and currently works with the OECD to implement the AI principles he helped develop in 2019.

Have suggestions for future podcast guests (or other feedback)? Let us know here or reach us at [email protected]

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (02:35) Jonathan’s background and work

* (04:25) Origins of the Lottery Ticket Hypothesis

* (06:00) Jonathan’s empiricism and approach to science

* (08:25) More Karl Popper discourse + hot takes

* (09:45) Walkthrough of the Lottery Ticket Hypothesis

* (12:00) Issues with the Lottery Ticket Hypothesis as a statement

* (12:30) Jonathan’s advice for PhD students, on asking good questions

* (15:55) Strengths and Promise of the Lottery Ticket Hypothesis

* (18:55) More Lottery Ticket Hypothesis Papers

* (19:10) Comparing Rewinding and Fine-tuning

* (23:00) Care in making experimental choices

* (25:05) Linear Mode Connectivity and the Lottery Ticket Hypothesis

* (27:50) On what is being measured and how

* (28:50) “The outcome of optimization is determined to a linearly connected region”

* (31:15) On good metrics

* (32:54) On the Predictability of Pruning Across Scales — scaling laws for pruning

* (34:40) The paper’s takeaway

* (38:45) Pruning Neural Networks at Initialization — on a scientific disagreement

* (45:00) On making takedown papers useful

* (46:15) On what can be known early in training

* (49:15) Jonathan’s perspective on important research questions today

* (54:40) MosaicML

* (55:19) How Mosaic got started

* (56:17) Mosaic highlights

* (57:33) Customer stories

* (1:00:30) Jonathan’s work and perspectives on AI policy

* (1:05:45) The key question: what we want

* (1:07:35) Outro

Links:

* Jonathan’s homepage and Twitter

* Papers

* The Lottery Ticket Hypothesis and follow-up work

* Comparing Rewinding and Fine-tuning in Neural Network Pruning

* Linear Mode Connectivity and the LTH

* On the Predictability of Pruning Across Scales

* Pruning Neural Networks at Initialization: Why Are We Missing The Mark?

* Desirable Inefficiency

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Nao Tokui: "Surfing" Musical Creativity with AI

19 oktober 2023 | 62 min

In episode 95 of The Gradient Podcast, Daniel Bashir speaks to Nao Tokui.

Nao Tokui is an artist/DJ and researcher based in Tokyo. While pursuing his Ph.D. at The University of Tokyo, he produced his first music album and singles using AI, including a 12-inch record with Nujabes, a legendary Japanese hip-hop producer. After completing his Ph.D. research, he founded Qosmo, AI Creativity and Music Lab, in 2009. Since then, he has been actively working at the intersection of AI technology and art. Nao and his team's works have been exhibited at renowned venues such as the New York MoMA and the Barbican Centre in London. Their performances have also been showcased at various music festivals, including MUTEK and Sonar. Additionally, he is leading the development of AI-based music instruments at his newly founded company, Neutone. In 2021, Nao received the Okawa Publishing Award for his Japanese book on art, creativity, and AI. The book is scheduled to be released in English as "Surfing human creativity with AI — A user's guide" in November 2023.

Have suggestions for future podcast guests (or other feedback)? Let us know here or reach us at [email protected]

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (02:15) Nao’s background and how he got into AI and music

* (05:10) Nao’s experiences as a DJ, collaboration with Nujabes

* (07:10) HCI and music

* (10:35) Leveraging the difference between AI systems and humans

* (12:40) Total control vs total chaos

* (13:45) Qosmo and the Neutone Project, misusable AI tools

* (17:25) On music and “creating something new”

* (21:00) Declarative and top-down vs. bottom-up creation, individual taste

* (23:50) How generative AI enables humans

* (26:25) On misusing technology and art

* (32:00) Dawn Patrol EP

* (36:00) A two-discriminator GAN for creating music in new genres

* (37:45) The AI DJ Project

* (38:20) The interactive vision of the project

* (42:10) How AI chooses music, breaking from constraints

* (43:15) Interpretability and how an AI system DJs differently

* (45:15) How the project altered Nao’s perspective on DJing, the role of humans

* (51:40) Nao’s book Creating with AI

* (55:15) Human-AI interaction as joint improvisation

* (58:10) Nao’s advice and takeaways for thinking about AI creatively

* (1:01:32) Outro

Links:

* Nao’s homepage and Twitter

* Other links:

* Neutone, AI audio plugin

* Real-time AI-generative DJ performance

* Qosmo

* Dawn Patrol EP

* Nao’s book: Surfing human creativity with AI — A user's guide

* Paper on Creative-GAN for deviating from existing music genres

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Divyansh Kaushik: The Realities of AI Policy

12 oktober 2023 | 78 min

In episode 94 of The Gradient Podcast, Daniel Bashir speaks to Divyansh Kaushik.

Divyansh is the Associate Director for Emerging Technologies and National Security at the Federation of American Scientists where his focus areas include, amongst other things, AI policy, STEM immigration, and US-China strategic competition. He holds a PhD from Carnegie Mellon University, where he focused on designing reliable AI systems that align with human values. In addition to his advocacy work on Capitol Hill, he also played a key role in establishing the Congressional Graduate Research and Development Caucus. He is a frequent contributor to leading publications, including Vox, National Defense Magazine, The Dispatch, Daily Caller, and Forbes.

Have suggestions for future podcast guests (or other feedback)? Let us know here or reach us at [email protected]

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (02:20) Divyansh intro/background

* (06:00) Zachary Lipton Appreciation Session ( + advice from Prof Lipton)

* (08:00) How Divyansh got involved in policy

* (11:30) What does policy work look like? Divyansh’s early experiences

* (15:42) AI policy issues, divides, party lines

* (19:15) Bringing AI talent into the US

* (26:45) US/China saber rattling, impact of Xi Jinping’s presidency

* (33:49) China’s AI regulations, CCP motivations, China’s disadvantages in AI and benefits of the US policy process

* (42:42) Trading off AI governance and stifling innovation

* (51:17) AI governance comments from Jeremy Howard / Connor Leahy / Andrew Maynard, regulating use vs basic technology, limits on scaling

* (1:01:30) Articulating and communicating the issues for AI governance

* (1:03:10) Existential risk concerns in AI governance, theories of change

* (1:10:15) How can AI researchers/practitioners better communicate with policymakers?

* (1:16:57) Outro

Links:

* Divyansh’s Twitter and FAS page

* Divyansh’s policy work:

* The impact of international scientists, engineers, and students on US research outputs and global competitiveness

* How Congress can shape AI governance without stifling innovation

* How Do OpenAI’s Efforts To Make GPT-4 “Safer” Stack Up Against The NIST AI Risk Management Framework?

* Six Policy Ideas for the National AI Strategy

* Other work mentioned/discussed:

* Jeremy Howard’s AI Safety and the Age of Dislightenment

* Proposals from Connor Leahy

* Andrew Maynard’s Regulating Frontier AI: To Open Source or Not?

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Tal Linzen: Psycholinguistics and Language Modeling

5 oktober 2023 | 75 min

In episode 93 of The Gradient Podcast, Daniel Bashir speaks to Professor Tal Linzen.

Professor Linzen is an Associate Professor of Linguistics and Data Science at New York University and a Research Scientist at Google. He directs the Computation and Psycholinguistics Lab, where he and his collaborators use behavioral experiments and computational methods to study how people learn and understand language. They also develop methods for evaluating, understanding, and improving computational systems for language processing.

Have suggestions for future podcast guests (or other feedback)? Let us know here or reach us at [email protected]

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (02:25) Prof. Linzen’s background

* (05:37) Back and forth between psycholinguistics and deep learning research, LM evaluation

* (08:40) How can deep learning successes/failures help us understand human language use, methodological concerns, comparing human representations to LM representations

* (14:22) Behavioral capacities and degrees of freedom in representations

* (16:40) How LMs are becoming less and less like humans

* (19:25) Assessing LSTMs’ ability to learn syntax-sensitive dependencies

* (22:48) Similarities between structure-sensitive dependencies, sophistication of syntactic representations

* (25:30) RNNs implicitly implement tensor-product representations—vector representations of symbolic structures

* (29:45) Representations required to solve certain tasks, difficulty of natural language

* (33:25) Accelerating progress towards human-like linguistic generalization

* (34:30) The pre-training agnostic identically distributed evaluation paradigm

* (39:50) Ways to mitigate differences in evaluation

* (44:20) Surprisal does not explain syntactic disambiguation difficulty

* (45:00) How to measure processing difficulty, predictability and processing difficulty

* (49:20) What other factors influence processing difficulty?

* (53:10) How to plant trees in language models

* (55:45) Architectural influences on generalizing knowledge of linguistic structure

* (58:20) “Cognitively relevant regimes” and speed of generalization

* (1:00:45) Acquisition of syntax and sampling simpler vs. more complex sentences

* (1:04:03) Curriculum learning for progressively more complicated syntax

* (1:05:35) Hypothesizing tree-structured representations

* (1:08:00) Reflecting on a prediction from the past

* (1:10:15) Goals and “the correct direction” in AI research

* (1:14:04) Outro

Links:

* Prof. Linzen’s Twitter and homepage

* Papers

* Assessing the Ability of LSTMs to Learn Syntax-Sensitive Dependencies

* RNNS Implicitly Implement Tensor-Product Representations

* How Can We Accelerate Progress Towards Human-like Linguistic Generalization?

* Surprisal does not explain syntactic disambiguation difficulty: evidence from a large-scale benchmark

* How to Plant Trees in LMs: Data and Architectural Effects on the Emergence of Syntactic Inductive Biases

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Kevin K. Yang: Engineering Proteins with ML

28 september 2023 | 60 min

Arjun Ramani & Zhengdong Wang: Why Transformative AI is Really, Really Hard to Achieve

21 september 2023 | 110 min

In episode 91 of The Gradient Podcast, Daniel Bashir speaks to Arjun Ramani and Zhengdong Wang.

Arjun is the global business and economics correspondent at The Economist.

Zhengdong is a research engineer at Google DeepMind.

Have suggestions for future podcast guests (or other feedback)? Let us know here or reach us at [email protected]

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (03:53) Arjun intro

* (06:04) Zhengdong intro

* (09:50) How Arjun and Zhengdong met in the woods

* (11:52) Overarching narratives about technological progress and AI

* (14:20) Setting up the claim: Arjun on what “transformative” means

* (15:52) What enables transformative economic growth?

* (21:19) From GPT-3 to ChatGPT; is there something special about AI?

* (24:15) Zhengdong on “real AI” and divisiveness

* (27:00) Arjun on the independence of bottlenecks to progress/growth

* (29:05) Zhengdong on bottleneck independence

* (32:45) More examples on bottlenecks and surplus wealth

* (37:06) Technical arguments—what are the hardest problems in AI?

* (38:00) Robotics

* (40:41) Challenges of deployment in high-stakes settings and data sources / synthetic data, self-driving

* (45:13) When synthetic data works

* (49:06) Harder tasks, process knowledge

* (51:45) Performance art as a critical bottleneck

* (53:45) Obligatory Taylor Swift Discourse

* (54:45) AI Taylor Swift???

* (54:50) The social arguments

* (55:20) Speed of technology diffusion — “diffusion lags” and dynamics of trust with AI

* (1:00:55) ChatGPT adoption, where major productivity gains come from

* (1:03:50) Timescales of transformation

* (1:10:22) Unpredictability in human affairs

* (1:14:07) The economic arguments

* (1:14:35) Key themes — diffusion lags, different sectors

* (1:21:15) More on bottlenecks, AI trust, premiums on human workers

* (1:22:30) Automated systems and human interaction

* (1:25:45) Campaign text reachouts

* (1:30:00) Counterarguments

* (1:30:18) Solving intelligence and solving science/innovation

* (1:34:07) Strengths and weaknesses of the broad applicability of Arjun and Zhengdong’s argument

* (1:35:34) The “proves too much” worry — how could any innovation have ever happened?

* (1:37:25) Examples of bringing down barriers to innovation/transformation

* (1:43:45) What to do with all of this information?

* (1:48:45) Outro

Links:

* Zhengdong’s homepage and Twitter

* Arjun’s homepage and Twitter

* Why transformative artificial intelligence is really, really hard to achieve

* Other resources and links mentioned:

* Allan-Feuer and Sanders: Transformative AGI by 2043 is <1% likely

* On AlphaStar Zero

* Hardmaru on AI as applied philosophy

* Robotics Transformer 2

* Davis Blalock on synthetic data

* Matt Clancy on automating invention and bottlenecks

* Michael Webb on 80,000 Hours Podcast

* Bob Gordon: The Rise and Fall of American Growth

* OpenAI economic impact paper

* David Autor: new work paper

* Baumol effect paper

* Pew research centre poll, public concern on AI

* Human premium Economist piece

* Callum Williams — London tube and AI/jobs

* Culture Series book 1, Iain Banks

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Miles Grimshaw: Benchmark, LangChain, and Investing in AI

14 september 2023 | 61 min

Shreya Shankar: Machine Learning in the Real World

7 september 2023 | 77 min

In episode 89 of The Gradient Podcast, Daniel Bashir speaks to Shreya Shankar.

Shreya is a computer scientist pursuing her PhD in databases at UC Berkeley. Her research interest is in building end-to-end systems for people to develop production-grade machine learning applications. She was previously the first ML engineer at Viaduct, did research at Google Brain, and software engineering at Facebook. She graduated from Stanford with a B.S. and M.S. in computer science with concentrations in systems and artificial intelligence. At Stanford, helped run SHE++, an organization that helps empower underrepresented minorities in technology.

Have suggestions for future podcast guests (or other feedback)? Let us know here or reach us at [email protected]

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (02:22) Shreya’s background and journey into ML / MLOps

* (04:51) ML advances in 2013-2016

* (05:45) Shift in Stanford undergrad class ecosystems, accessibility of deep learning research

* (09:10) Why Shreya left her job as an ML engineer

* (13:30) How Shreya became interested in databases, data quality in ML

* (14:50) Daniel complains about things

* (16:00) What makes ML engineering uniquely difficult

* (16:50) Being a “historian of the craft” of ML engineering

* (22:25) Levels of abstraction, what ML engineers do/don’t have to think about

* (24:16) Observability for Production ML Pipelines

* (28:30) Metrics for real-time ML systems

* (31:20) Proposed solutions

* (34:00) Moving Fast with Broken Data

* (34:25) Existing data validation measures and where they fall short

* (36:31) Partition summarization for data validation

* (38:30) Small data and quantitative statistics for data cleaning

* (40:25) Streaming ML Evaluation

* (40:45) What makes a metric actionable

* (42:15) Differences in streaming ML vs. batch ML

* (45:45) Delayed and incomplete labels

* (49:23) Operationalizing Machine Learning

* (49:55) The difficult life of an ML engineer

* (53:00) Best practices, tools, pain points

* (55:56) Pitfalls in current MLOps tools

* (1:00:30) LLMOps / FMOps

* (1:07:10) Thoughts on ML Engineering, MLE through the lens of data engineering

* (1:10:42) Building products, user expectations for AI products

* (1:15:50) Outro

Links:

* Papers

* Towards Observability for Production Machine Learning Pipelines

* Rethinking Streaming ML Evaluation

* Operationalizing Machine Learning

* Moving Fast With Broken Data

* Blog posts

* The Modern ML Monitoring Mess

* Thoughts on ML Engineering After a Year of my PhD

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Stevan Harnad: AI's Symbol Grounding Problem

31 augusti 2023 | 118 min

In episode 88 of The Gradient Podcast, Daniel Bashir speaks to Professor Stevan Harnad.

Stevan Harnad is professor of psychology and cognitive science at Université du Québec à Montréal, adjunct professor of cognitive science at McGill University, and professor emeritus of cognitive science at the University of Southampton. His research is on category learning, categorical perception, symbol grounding, the evolution of language, and animal and human sentience (otherwise known as “consciousness”). He is also an advocate for open access and an activist for animal rights.

Have suggestions for future podcast guests (or other feedback)? Let us know here or reach us at [email protected]

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (05:20) Professor Harnad’s background: interests in cognitive psychobiology, editing Behavioral and Brain Sciences

* (07:40) John Searle submits the Chinese Room article

* (09:20) Early reactions to Searle and Prof. Harnad’s role

* (13:38) The core of Searle’s argument and the generator of the Symbol Grounding Problem, “strong AI”

* (19:00) Ways to ground symbols

* (20:26) The acquisition of categories

* (25:00) Pantomiming, non-linguistic category formation

* (27:45) Mathematics, abstraction, and grounding

* (36:20) Symbol manipulation and interpretation language

* (40:40) On the Whorf Hypothesis

* (48:39) Defining “grounding” and introducing the “T3” Turing Test

* (53:22) Turing’s concerns, AI and reverse-engineering cognition

* (59:25) Other Minds, T4 and zombies

* (1:05:48) Degrees of freedom in solutions to the Turing Test, the easy and hard problems of cognition

* (1:14:33) Over-interepretation of AI systems’ behavior, sentience concerns, T3 and evidence sentience

* (1:24:35) Prof. Harnad’s commentary on claims in The Vector Grounding Problem

* (1:28:05) RLHF and grounding, LLMs’ (ungrounded) capabilities, syntactic structure and propositions

* (1:35:30) Multimodal AI systems (image-text and robotic) and grounding, compositionality

* (1:42:50) Chomsky’s Universal Grammar, LLMs and T2

* (1:50:55) T3 and cognitive simulation

* (1:57:34) Outro

Links:

* Professor Harnad’s webpage and skywritings

* Papers:

* Category Induction and Representation

* Categorical Perception

* From Sensorimotor Categories to Grounded Symbols

* Minds, machines and Searle 2

* The Latent Structure of Dictionaries

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Terry Winograd: AI, HCI, Language, and Cognition

24 augusti 2023 | 93 min

In episode 87 of The Gradient Podcast, Daniel Bashir speaks to Professor Terry Winograd.

Professor Winograd is Professor Emeritus of Computer Science at Stanford University. His research focuses on human-computer interaction design and the design of technologies for development. He founded the Stanford Human-Computer Interaction Group, where he directed the teaching programs and HCI research. He is also a founding faculty member of the Stanford d.school and a founding member and past president of Computer Professionals for Social Responsibility.

Have suggestions for future podcast guests (or other feedback)? Let us know here or reach us at [email protected]

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (03:00) Professor Winograd’s background

* (05:10) At the MIT AI Lab

* (05:45) The atmosphere in the MIT AI Lab, Minsky/Chomsky debates

* (06:20) Blue-sky research, government funding for academic research

* (10:10) Isolation and collaboration between research groups

* (11:45) Phases in the development of ideas and how cross-disciplinary work fits in

* (12:26) SHRDLU and the MIT AI Lab’s intellectual roots

* (17:20) Early responses to SHRDLU: Minsky, Dreyfus, others

* (20:55) How Prof. Winograd’s thinking about AI’s abilities and limitations evolved

* (22:25) How this relates to current AI systems and discussions of intelligence

* (23:47) Repetitive debates in AI, semantics and grounding

* (27:00) The concept of investment, care, trust in human communication vs machine communication

* (28:53) Projecting human-ness onto AI systems and non-human things and what this means for society

* (31:30) Time after leaving MIT in 1973, time at Xerox PARC, how Winograd’s thinking evolved during this time

* (38:28) What Does It Mean to Understand Language? Speech acts, commitments, and the grounding of language

* (42:40) Reification of representations in science and ML

* (46:15) LLMs, their training processes, and their behavior

* (49:40) How do we coexist with systems that we don’t understand?

* (51:20) Progress narratives in AI and human agency

* (53:30) Transitioning to intelligence augmentation, founding the Stanford HCI group and d.school, advising Larry Page and Sergey Brin

* (1:01:25) Chatbots and how we consume information

* (1:06:52) Evolutions in journalism, progress in trust for modern AI systems

* (1:09:18) Shifts in the social contract, from institutions to personalities

* (1:12:05) AI and HCI in recent years

* (1:17:05) Philosophy of design and the d.school

* (1:21:20) Designing AI systems for people

* (1:25:10) Prof. Winograd’s perspective on watermarking for detecting GPT outputs

* (1:25:55) The politics of being a technologist

* (1:30:10) Echos of the past in AI regulation and competition and learning from history

* (1:32:34) Outro

Links:

* Professor Winograd’s Homepage

* Papers/topics discussed:

* SHRDLU

* Beyond Programming Languages

* What Does It Mean to Understand Language?

* The PageRank Citation Ranking

* Stanford Digital Libraries project

* Talk: My Politics as a Technologist

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Gil Strang: Linear Algebra and Deep Learning

17 augusti 2023 | 61 min

Anant Agarwal: AI for Education

10 augusti 2023 | 48 min

Raphaël Millière: The Vector Grounding Problem and Self-Consciousness

4 augusti 2023 | 125 min

Peli Grietzer: A Mathematized Philosophy of Literature

27 juli 2023 | 154 min

Ryan Drapeau: Battling Fraud with ML at Stripe

20 juli 2023 | 67 min

Shiv Rao: Enabling Better Patient Care with AI

13 juli 2023 | 61 min

Hugo Larochelle: Deep Learning as Science

6 juli 2023 | 108 min

Jeremie Harris: Realistic Alignment and AI Policy

29 juni 2023 | 91 min

In episode 79 of The Gradient Podcast, Daniel Bashir speaks to Jeremie Harris.

Jeremie is co-founder of Gladstone AI, author of the book Quantum Physics Made Me Do It, and co-host of the Last Week in AI Podcast. Jeremy previously hosted the Towards Data Science podcast and worked on a number of other startups after leaving a PhD in physics.

Have suggestions for future podcast guests (or other feedback)? Let us know here or reach us at [email protected]

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (01:37) Jeremie’s physics background and transition to ML

* (05:19) The physicist-to-AI person pipeline, how Jeremie’s background impacts his approach to AI

* (08:20) A tangent on inflationism/deflationism about natural laws (I promise this applies to AI)

* (11:45) How ML implies a particular viewpoint on the above question

* (13:20) Jeremie’s first (recommendation systems) company, how startup founders can make mistakes even when they’ve read Paul Graham essays

* (17:30) Classic startup wisdom, different sorts of startups

* (19:35) OpenAI’s approach in shipping features for DALL-E 2 and generation vs. discrimination as an approach to product

* (24:55) Capabilities and risk

* (26:43) Commentary on fundamental limitations of alignment in LLMs

* (30:45) Intrinsic difficulties in alignment problems

* (41:15) Daniel tries to steel man / defend anti-longtermist arguments (nicely :) )

* (46:23) Anthropic’s paper on asking models to be less biased

* (47:20) Why Jeremie is excited about Anthropic’s Constitutional AI scheme

* (51:05) Jeremie’s thoughts on recent Eliezer discourse

* (56:50) Cheese / task vectors and steerability/controllability in LLMs

* (59:50) Difficulty of one-shot solutions in alignment work, better strategies

* (1:02:00) Lack of theoretical understanding of deep learning systems / alignment

* (1:04:50) Jeremie’s work and perspectives on AI policy

* (1:10:00) Incrementality in convincing policymakers

* (1:14:00) How recent developments impact policy efforts

* (1:16:20) Benefits and drawbacks of open source

* (1:19:30) Arguments in favor of (limited) open source

* (1:20:35) Quantum Physics (not Mechanics) Made Me Do It

* (1:24:10) Some theories of consciousness and corresponding physics

* (1:29:49) Outro

Links:

* Jeremie’s Twitter

* Quantum Physics Made Me Do It

* Gladstone AI

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Antoine Blondeau: Alpha Intelligence Capital and Investing in AI

22 juni 2023 | 60 min

Joon Park: Generative Agents and Human-Computer Interaction

15 juni 2023 | 141 min

In episode 77 of The Gradient Podcast, Daniel Bashir speaks to Joon Park.

Joon is a third-year PhD student at Stanford, advised by Professors Michael Bernstein and Percy Liang. He designs, builds, and evaluates interactive systems that support new forms of human-computer interaction by leveraging state-of-the-art advances in natural language processing such as large language models. His research introduced the concept of, and the techniques for building generative agents—computational software agents that simulate believable human behavior. Joon’s work has been supported by the Microsoft Research PhD Fellowship, the Stanford School of Engineering Fellowship, and the Siebel Scholarship.

Have suggestions for future podcast guests (or other feedback)? Let us know here or reach us at [email protected]

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (01:43) Joon’s path from studio art to social computing / AI

* (05:00) Joon’s perspectives on Human-Computer Interaction (HCI) and its recent evolution

* (06:45) How foundation models enter the picture

* (10:28) On slow algorithms and technology: A Slow Algorithm Improves Users’ Assessments of the Algorithm’s Accuracy

* (12:10) Motivations

* (17:55) The jellybean-counting task, hypotheses

* (22:00) Applications and takeaways

* (28:05) Deliberate engagement in social media / computing systems, incentives

* (32:55) Daniel rants about The Social Dilemma + anti- social media rhetoric, Joon on the role of academics, framings of addiction

* (39:05) Measuring the Prevalence of Anti-Social Behavior in Online Communities

* (48:30) Statistics on anti-social behavior and anecdotal information, limitations in the paper’s measurements

* (51:45) Participatory and value-sensitive design

* (52:50) “Interaction” in On the Opportunities and Risks of Foundation Models

* (53:45) Broader insights on foundation models and emergent behavior

* (56:50) Joon’s section on interaction

* (1:01:05) Daniel’s bad segue to Social Simulacra: Creating Populated Prototypes for Social Computing Systems

* (1:02:50) Context for Social Simulacra and Generative Agents, why Social Simulacra was tackled first

* (1:24:05) The value of norms

* (1:26:20) Collaborations between designers and developers of social simulacra

* (1:30:00) Generative Agents: Interactive Simulacra of Human Behavior

* (1:30:30) Context / intro

* (1:45:10) On (too much) coherence in generative agents and believability

* (1:52:02) Instruction tuning’s impact on generative agents, model alignment w/ believability goals, desirability of agent conflict / toxic LLMs

* (1:56:55) Release strategies and toxicity in LLMs

* (2:03:05) On designing interfaces and responsible use

* (2:09:05) Capability advances and the capability-safety research gap

* (2:14:12) Worries about LLM integration, human-centered framework for technology release / LLM incorporation

* (2:18:00) Joon’s philosophy as an HCI researcher

* (2:20:39) Outro

Links:

* Joon’s homepage and Twitter

* Research

* A Slow Algorithm Improves Users’ Assessments of the Algorithm’s Accuracy

* Measuring the Prevalence of Anti-Social Behavior in Online Communities

* On the Opportunities and Risks of Foundation Models

* Social Simulacra: Creating Populated Prototypes for Social Computing Systems

* Generative Agents: Interactive Simulacra of Human Behavior

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Christoffer Holmgård: AI for Video Games

8 juni 2023 | 69 min

Riley Goodside: The Art and Craft of Prompt Engineering

1 juni 2023 | 60 min

In episode 75 of The Gradient Podcast, Daniel Bashir speaks to Riley Goodside.

Riley is a Staff Prompt Engineer at Scale AI. Riley began posting GPT-3 prompt examples and screenshot demonstrations in 2022. He previously worked as a data scientist at OkCupid, Grindr, and CopyAI.

Have suggestions for future podcast guests (or other feedback)? Let us know here or reach us at [email protected]

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (01:37) Riley’s journey to becoming the first Staff Prompt Enginer

* (02:00) data science background in online dating industry

* (02:15) Sabbatical + catching up on LLM progress

* (04:00) AI Dungeon and first taste of GPT-3

* (05:10) Developing on codex, ideas about integrating codex with Jupyter Notebooks, start of posting on Twitter

* (08:30) “LLM ethnography”

* (09:12) The history of prompt engineering: in-context learning, Reinforcement Learning from Human Feedback (RLHF)

* (10:20) Models used to be harder to talk to

* (10:45) The three eras

* (10:45) 1 - Pre-trained LM era—simple next-word predictors

* (12:54) 2 - Instruction tuning

* (16:13) 3 - RLHF and overcoming instruction tuning’s limitations

* (19:24) Prompting as subtractive sculpting, prompting and AI safety

* (21:17) Riley on RLHF and safety

* (24:55) Riley’s most interesting experiments and observations

* (25:50) Mode collapse in RLHF models

* (29:24) Prompting models with very long instructions

* (33:13) Explorations with regular expressions, chain-of-thought prompting styles

* (36:32) Theories of in-context learning and prompting, why certain prompts work well

* (42:20) Riley’s advice for writing better prompts

* (49:02) Debates over prompt engineering as a career, relevance of prompt engineers

* (58:55) Outro

Links:

* Riley’s Twitter and LinkedIn

* Talk: LLM Prompt Engineering and RLHF: History and Techniques

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Talia Ringer: Formal Verification and Deep Learning

25 maj 2023 | 106 min

In episode 74 of The Gradient Podcast, Daniel Bashir speaks to Professor Talia Ringer.

Professor Ringer is an Assistant Professor with the Programming Languages, Formal Methods, and Software Engineering group at the University of Illinois at Urbana Champaign. Their research leverages proof engineering to allow programmers to more easily build formally verified software systems.

Have suggestions for future podcast guests (or other feedback)? Let us know here or reach us at [email protected]

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Daniel’s long annoying intro

* (02:15) Origin Story

* (04:30) Why / when formal verification is important

* (06:40) Concerns about ChatGPT/AutoGPT et al failures, systems for accountability

* (08:20) Difficulties in making formal verification accessible

* (11:45) Tactics and interactive theorem provers, interface issues

* (13:25) How Prof Ringer’s research first crossed paths with ML

* (16:00) Concrete problems in proof automation

* (16:15) How ML can help people verifying software systems

* (20:05) Using LLMs for understanding / reasoning about code

* (23:05) Going from tests / formal properties to code

* (31:30) Is deep learning the right paradigm for dealing with relations for theorem proving?

* (36:50) Architectural innovations, neuro-symbolic systems

* (40:00) Hazy definitions in ML

* (41:50) Baldur: Proof Generation & Repair with LLMs

* (45:55) In-context learning’s effectiveness for LLM-based theorem proving

* (47:12) LLMs without fine-tuning for proofs

* (48:45) Something ~ surprising ~ about Baldur results (maybe clickbait or maybe not)

* (49:32) Asking models to construct proofs with restrictions, translating proofs to formal proofs

* (52:07) Methods of proofs and relative difficulties

* (57:45) Verifying / providing formal guarantees on ML systems

* (1:01:15) Verifying input-output behavior and basic considerations, nature of guarantees

* (1:05:20) Certified/verifies systems vs certifying/verifying systems—getting LLMs to spit out proofs along with code

* (1:07:15) Interpretability and how much model internals matter, RLHF, mechanistic interpretability

* (1:13:50) Levels of verification for deploying ML systems, HCI problems

* (1:17:30) People (Talia) actually use Bard

* (1:20:00) Dual-use and “correct behavior”

* (1:24:30) Good uses of jailbreaking

* (1:26:30) Talia’s views on evil AI / AI safety concerns

* (1:32:00) Issues with talking about “intelligence,” assumptions about what “general intelligence” means

* (1:34:20) Difficulty in having grounded conversations about capabilities, transparency

* (1:39:20) Great quotation to steal for your next thinkpiece + intelligence as socially defined

* (1:42:45) Exciting research directions

* (1:44:48) Outro

Links:

* Talia’s Twitter and homepage

* Research

* Concrete Problems in Proof Automation

* Baldur: Whole-Proof Generation and Repair with LLMs

* Research ideas

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Brigham Hyde: AI for Clinical Decision-Making

18 maj 2023 | 42 min

Scott Aaronson: Against AI Doomerism

11 maj 2023 | 70 min

In episode 72 of The Gradient Podcast, Daniel Bashir speaks to Professor Scott Aaronson.

Scott is the Schlumberger Centennial Chair of Computer Science at the University of Texas at Austin and director of its Quantum Information Center. His research interests focus on the capabilities and limits of quantum computers and computational complexity theory more broadly. He has recently been on leave to work at OpenAI, where he is researching theoretical foundations of AI safety.

Have suggestions for future podcast guests (or other feedback)? Let us know here or reach us at [email protected]

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (01:45) Scott’s background

* (02:50) Starting grad school in AI, transitioning to quantum computing and the AI / quantum computing intersection

* (05:30) Where quantum computers can give us exponential speedups, simulation overhead, Grover’s algorithm

* (10:50) Overselling of quantum computing applied to AI, Scott’s analysis on quantum machine learning

* (18:45) ML problems that involve quantum mechanics and Scott’s work

* (21:50) Scott’s recent work at OpenAI

* (22:30) Why Scott was skeptical of AI alignment work early on

* (26:30) Unexpected improvements in modern AI and Scott’s belief update

* (32:30) Preliminary Analysis of DALL-E 2 (Marcus & Davis)

* (34:15) Watermarking GPT outputs

* (41:00) Motivations for watermarking and language model detection

* (45:00) Ways around watermarking

* (46:40) Other aspects of Scott’s experience with OpenAI, theoretical problems

* (49:10) Thoughts on definitions for humanistic concepts in AI

* (58:45) Scott’s “reform AI alignment stance” and Eliezer Yudkowsky’s recent comments (+ Daniel pronounces Eliezer wrong), orthogonality thesis, cases for stopping scaling

* (1:08:45) Outro

Links:

* Scott’s blog

* AI-related work

* Quantum Machine Learning Algorithms: Read the Fine Print

* A very preliminary analysis of DALL-E 2 w/ Marcus and Davis

* New AI classifier for indicating AI-written text and Watermarking GPT Outputs

* Writing

* Should GPT exist?

* AI Safety Lecture

* Why I’m not terrified of AI

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Ted Underwood: Machine Learning and the Literary Imagination

4 maj 2023 | 104 min

In episode 71 of The Gradient Podcast, Daniel Bashir speaks to Ted Underwood.

Ted is a professor in the School of Information Sciences with an appointment in the Department of English at the University of Illinois at Urbana Champaign. Trained in English literary history, he turned his research focus to applying machine learning to large digital collections. His work explores literary patterns that become visible across long timelines when we consider many works at once—often, his work involves correcting and enriching digital collections to make them more amenable to interesting literary research.

Have suggestions for future podcast guests (or other feedback)? Let us know here or reach us at [email protected]

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (01:42) Ted’s background / origin story

* (04:35) Context in interpreting statistics, “you need a model,” the need for data about human responses to literature and how that manifested in Ted’s work

* (07:25) The recognition that we can model literary prestige/genre because of ML

* (08:30) Distant reading and the import of statistics over large digital libraries

* (12:00) Literary prestige

* (12:45) How predictable is fiction? Scales of predictability in texts

* (13:55) Degrees of autocorrelation in biography and fiction and the structure of narrative, how LMs might offer more sophisticated analysis

* (15:15) Braided suspense / suspense at different scales of a story

* (17:05) The Literary Uses of High-Dimensional Space: how “big data” came to impact the humanities, skepticism from humanists and responses, what you can do with word count

* (20:50) Why we could use more time to digest statistical ML—how acceleration in AI advances might impact pedagogy

* (22:30) The value in explicit models

* (23:30) Poetic “revolutions” and literary prestige

* (25:53) Distant vs. close reading in poetry—follow-up work for “The Longue Durée”

* (28:20) Sophistication of NLP and approaching the human experience

* (29:20) What about poetry renders it prestigious?

* (32:20) Individualism/liberalism and evolution of poetic taste

* (33:20) Why there is resistance to quantitative approaches to literature

* (34:00) Fiction in other languages

* (37:33) The Life Cycles of Genres

* (38:00) The concept of “genre”

* (41:00) Inflationary/deflationary views on natural kinds and genre

* (44:20) Genre as a social and not a linguistic phenomenon

* (46:10) Will causal models impact the humanities?

* (48:30) (Ir)reducibility of cultural influences on authors

* (50:00) Machine Learning and Human Perspective

* (50:20) Fluent and perspectival categories—Miriam Posner on “the radical, unrealized potential of digital humanities.”

* (52:52) How ML’s vices can become virtues for humanists

* (56:05) Can We Map Culture? and The Historical Significance of Textual Distances

* (56:50) Are cultures and other social phenomena related to one another in a way we can “map”?

* (59:00) Is cultural distance Euclidean?

* (59:45) The KL Divergence’s use for humanists

* (1:03:32) We don’t already understand the broad outlines of literary history

* (1:06:55) Science Fiction Hasn’t Prepared us to Imagine Machine Learning

* (1:08:45) The latent space of language and what intelligence could mean

* (1:09:30) LLMs as models of culture

* (1:10:00) What it is to be a human in “the age of AI” and Ezra Klein’s framing

* (1:12:45) Mapping the Latent Spaces of Culture

* (1:13:10) Ted on Stochastic Parrots

* (1:15:55) The risk of AI enabling hermetically sealed cultures

* (1:17:55) “Postcards from an unmapped latent space,” more on AI systems’ limitations as virtues

* (1:20:40) Obligatory GPT-4 section

* (1:21:00) Using GPT-4 to estimate passage of time in fiction

* (1:23:39) Is deep learning more interpretable than statistical NLP?

* (1:25:17) The “self-reports” of language models: should we trust them?

* (1:26:50) University dependence on tech giants, open-source models

* (1:31:55) Reclaiming Ground for the Humanities

* (1:32:25) What scientists, alone, can contribute to the humanities

* (1:34:45) On the future of the humanities

* (1:35:55) How computing can enable humanists as humanists

* (1:37:05) Human self-understanding as a collaborative project

* (1:39:30) Is anything ineffable? On what AI systems can “grasp”

* (1:43:12) Outro

Links:

* Ted’s blog and Twitter

* Research

* The literary uses of high-dimensional space

* The Longue Durée of literary prestige

* The Historical Significance of Textual Distances

* Machine Learning and Human Perspective

* The life cycles of genres

* Can We Map Culture?

* Cohort Succession Explains Most Change in Literary Culture

* Other Writing

* Reclaiming Ground for the Humanities

* We don’t already understand the broad outlines of literary history

* Science fiction hasn’t prepared us to imagine machine learning.

* How predictable is fiction?

* Mapping the latent spaces of culture

* Using GPT-4 to measure the passage of time in fiction

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Irene Solaiman: AI Policy and Social Impact

27 april 2023 | 72 min

In episode 70 of The Gradient Podcast, Daniel Bashir speaks to Irene Solaiman.

Irene is an expert in AI safety and policy and the Policy Director at HuggingFace, where she conducts social impact research and develops public policy. In her former role at OpenAI, she initiated and led bias and social impact research at OpenAI in addition to leading public policy. She built AI policy at Zillow group and advised poilcymakers on responsible autonomous decision-making and privacy as a fellow at Harvard’s Berkman Klein Center.

Have suggestions for future podcast guests (or other feedback)? Let us know here or reach us at [email protected]

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (02:00) Intro to Irene and her work

* (03:45) What tech people need to learn about policy, and vice versa

* (06:35) Societal impact—words and reality, Irene’s experience

* (08:30) OpenAI work on GPT-2 and release strategies (yes, this was recorded on Pi Day)

* (11:00) Open-source proponents and release

* (14:00) What does a multidisciplinary approach to working on AI look like?

* (16:30) Thinking about end users and enabling contributors with different sets of expertise

* (18:00) “Preparing for AGI” and current approaches to release

* (21:00) Who constitutes a researcher? What constitutes safety and who gets resourced? Limitations in red-teaming potentially dangerous systems.

* (22:35) PALMS and Values-Targeted Datasets

* (25:52) PALMS and RLHF

* (27:00) Homogenization in foundation models, cultural contexts

* (29:45) Anthropic’s moral self-correction paper and Irene’s concerns about marketing “de-biasing” and oversimplification

* (31:50) Data work, human systemic problems → AI bias

* (33:55) Why do language models get more toxic as they get larger? (if you have ideas, let us know!)

* (35:45) The gradient of generative AI release, Irene’s experience with the open-source world, tradeoffs along the release gradient

* (38:40) More on Irene’s orientation towards release

* (39:40) Pragmatics of keeping models closed, dealing with open-source by force

* (42:22) Norm setting for release and use, normalization of documentation on social impacts

* (46:30) Race dynamics :(

* (49:45) Resource allocation and advances in ethics/policy, conversations on integrity and disinformation

* (53:10) Organizational goals, balancing technical research with policy work

* (58:10) Thoughts on governments’ AI policies, impact of structural assumptions

* (1:04:00) Approaches to AI-generated sexual content, need for more voices represented in conversations about AI

* (1:08:25) Irene’s suggestions for AI practitioners / technologists

* (1:11:24) Outro

Links:

* Irene’s homepage and Twitter

* Papers

* Release Strategies and the Social Impacts of Language Models

* Hugh Zhang’s open letter in The Gradient from 2019

* Process for Adapting Large Models to Society (PALMS) with Values-Targeted Datasets

* The Gradient of Generative AI Release: Methods and Considerations

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Drago Anguelov: Waymo and Autonomous Vehicles

20 april 2023 | 65 min

In episode 69 of The Gradient Podcast, Daniel Bashir speaks to Drago Anguelov.

Drago is currently a Distinguished Scientist and Head of Research at Waymo, where he joined in 2018. Earlier, he spent eight years at Google working on 3D vision and pose estimation for StreetView, then leading a research team that developed computer vision systems for annotating Google Photos. He has been involved in developing popular neural network methods such as the Inception architecture and the SSD detector. Before joining Waymo, he also led the 3D perception team at Zoox.

Have suggestions for future podcast guests (or other feedback)? Let us know here or reach us at [email protected]

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (02:04) Drago’s background in AI and self-driving, work with Daphne Koller + Sebastian Thrun, computer vision / pose estimation

* (14:20) One- and two-stage object detectors

* (15:15) Early experiences and thoughts on self-driving and its prospects

* (21:00) An introduction to the “self-driving stack”: mapping & localization, perception, behavior modeling & planning, simulation

* (29:25) From Stuart Russell’s comments on early Waymo’s “old-fashioned” approach

* (37:34) Scaling 3D Detection: challenges and architectural innovations

* (43:20) Behavior modeling: making decisions and modeling interactions in multi-agent environments

* (52:42) Distributional RL (+ imitation learning) in self-driving?

* (54:10) The Waymo Open Dataset

* (1:01:48) Looking forward in self-driving

* (1:04:36) Outro

Links:

* Drago’s LinkedIn and Twitter

* Research

* SSD: Single-Shot Multibox Detector

* SCAPE: Shape completion and animation of people

* Behavior Models for Autonomous Driving

* Wayformer

* Symphony: Learning Realistic and Diverse Agents for Autonomous Driving Simulation

* Imitation Is Not Enough

* Scaling 3D Detection to the Long Tail

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Joanna Bryson: The Problems of Cognition

13 april 2023 | 73 min

Daniel Situnayake: AI on the Edge

6 april 2023 | 118 min

Soumith Chintala: PyTorch

30 mars 2023 | 68 min

In episode 66 of The Gradient Podcast, Daniel Bashir speaks to Soumith Chintala.

Soumith is a Research Engineer at Meta AI Research in NYC. He is the co-creator and lead of Pytorch, and maintains a number of other open-source ML projects including Torch-7 and EBLearn. Soumith has previously worked on robotics, object and human detection, generative modeling, AI for video games, and ML systems research.

Have suggestions for future podcast guests (or other feedback)? Let us know here!

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (01:30) Soumith’s intro to AI journey to Pytorch

* (05:00) State of computer vision early in Soumith’s career

* (09:15) Institutional inertia and sunk costs in academia, identifying fads

* (12:45) How Soumith started working on GANs, frustrations

* (17:45) State of ML frameworks early in the deep learning era, differentiators

* (23:50) Frameworks and leveling the playing field, exceptions

* (25:00) Contributing to Torch and evolution into Pytorch

* (29:15) Soumith’s product vision for ML frameworks

* (32:30) From product vision to concrete features in Pytorch

* (39:15) Progressive disclosure of complexity (Chollet) in Pytorch

* (41:35) Building an open source community

* (43:25) The different players in today’s ML framework ecosystem

* (49:35) ML frameworks pioneered by Yann LeCun and Léon Bottou, their influences on Pytorch

* (54:37) Pytorch 2.0 and looking to the future

* (58:00) Soumith’s adventures in household robotics

* (1:03:25) Advice for aspiring ML practitioners

* (1:07:10) Be cool like Soumith and subscribe :)

* (1:07:33) Outro

Links:

* Soumith’s Twitter and homepage

* Papers

* Convolutional Neural Networks Applied to House Numbers Digit Classification

* GANs: LAPGAN, DCGAN, Wasserstein GAN

* Automatic differentiation in PyTorch

* PyTorch: An Imperative Style, High-Performance Deep Learning Library

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Sewon Min: The Science of Natural Language

23 mars 2023 | 103 min

In episode 65 of The Gradient Podcast, Daniel Bashir speaks to Sewon Min.

Sewon is a fifth-year PhD student in the NLP group at the University of Washington, advised by Hannaneh Hajishirzi and Luke Zettlemoyer. She is a part-time visiting researcher at Meta AI and a recipient of the JP Morgan PhD Fellowship. She has previously spent time at Google Research and Salesforce research.

Have suggestions for future podcast guests (or other feedback)? Let us know here!

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (03:00) Origin Story

* (04:20) Evolution of Sewon’s interests, question-answering and practical NLP

* (07:00) Methodology concerns about benchmarks

* (07:30) Multi-hop reading comprehension

* (09:30) Do multi-hop QA benchmarks actually measure multi-hop reasoning?

* (12:00) How models can “cheat” multi-hop benchmarks

* (13:15) Explicit compositionality

* (16:05) Commonsense reasoning and background information

* (17:30) On constructing good benchmarks

* (18:40) AmbigQA and ambiguity

* (22:20) Types of ambiguity

* (24:20) Practical possibilities for models that can handle ambiguity

* (25:45) FaVIQ and fact-checking benchmarks

* (28:45) External knowledge

* (29:45) Fact verification and “complete understanding of evidence”

* (31:30) Do models do what we expect/intuit in reading comprehension?

* (34:40) Applications for fact-checking systems

* (36:40) Intro to in-context learning (ICL)

* (38:55) Example of an ICL demonstration

* (40:45) Rethinking the Role of Demonstrations and what matters for successful ICL

* (43:00) Evidence for a Bayesian inference perspective on ICL

* (45:00) ICL + gradient descent and what it means to “learn”

* (47:00) MetaICL and efficient ICL

* (49:30) Distance between tasks and MetaICL task transfer

* (53:00) Compositional tasks for language models, compositional generalization

* (55:00) The number and diversity of meta-training tasks

* (58:30) MetaICL and Bayesian inference

* (1:00:30) Z-ICL: Zero-Shot In-Context Learning with Pseudo-Demonstrations

* (1:02:00) The copying effect

* (1:03:30) Copying effect for non-identical examples

* (1:06:00) More thoughts on ICL

* (1:08:00) Understanding Chain-of-Thought Prompting

* (1:11:30) Bayes strikes again

* (1:12:30) Intro to Sewon’s text retrieval research

* (1:15:30) Dense Passage Retrieval (DPR)

* (1:18:40) Similarity in QA and retrieval

* (1:20:00) Improvements for DPR

* (1:21:50) Nonparametric Masked Language Modeling (NPM)

* (1:24:30) Difficulties in training NPM and solutions

* (1:26:45) Follow-on work

* (1:29:00) Important fundamental limitations of language models

* (1:31:30) Sewon’s experience doing a PhD

* (1:34:00) Research challenges suited for academics

* (1:35:00) Joys and difficulties of the PhD

* (1:36:30) Sewon’s advice for aspiring PhDs

* (1:38:30) Incentives in academia, production of knowledge

* (1:41:50) Outro

Links:

* Sewon’s homepage and Twitter

* Papers

* Solving and re-thinking benchmarks

* Multi-hop Reading Comprehension through Question Decomposition and Rescoring / Compositional Questions Do Not Necessitate Multi-hop Reasoning

* AmbigQA: Answering Ambiguous Open-domain Questions

* FaVIQ: FAct Verification from Information-seeking Questions

* Language Modeling

* Rethinking the Role of Demonstrations

* MetaICL: Learning to Learn In Context

* Towards Understanding CoT Prompting

* Z-ICL: Zero-Shot In-Context Learning with Pseudo-Demonstrations

* Text representation/retrieval

* Dense Passage Retrieval

* Nonparametric Masked Language Modeling

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Richard Socher: Re-Imagining Search

16 mars 2023 | 98 min

In episode 64 of The Gradient Podcast, Daniel Bashir speaks to Richard Socher.

Richard is founder and CEO of you.com, a new search engine that lets you personalize your search workflow and eschews tracking and invasive ads. Richard was previously Chief Scientist at Salesforce where he led work on fundamental and applied research, product incubation, CRM search, customer service automation and a cross-product AI platform. He was an adjunct professor at Stanford’s CS department as well as founder and CEO/CTO of MetaMind, which was acquired by Salesforce in 2016. He received his PhD from Stanford’s CS Department in 2014.

Have suggestions for future podcast guests (or other feedback)? Let us know here!

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (02:20) Richard Socher origin story + time at Metamind, Salesforce (AI Economist, CTRL, ProGen)

* (22:00) Why Richard advocated for deep learning in NLP

* (27:00) Richard’s perspective on language

* (32:20) Is physical grounding and language necessary for intelligence?

* (40:10) Frankfurtian b******t and language model utterances as truth

* (47:00) Lessons from Salesforce Research

* (53:00) Balancing fundamental research with product focus

* (57:30) The AI Economist + how should policymakers account for limitations?

* (1:04:50) you.com, the chatbot wars, and taking on search giants

* (1:13:50) Re-imagining the vision for and components of a search engine

* (1:18:00) The future of generative models in search and the internet

* (1:28:30) Richard’s advice for early-career technologists

* (1:37:00) Outro

Links:

* Richard’s Twitter

* YouChat by you.com

* Careers at you.com

* Papers mentioned

* Semi-Supervised Recursive Autoencoders for Predicting Sentiment Distributions

* Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank

* Grounded Compositional Semantics for Finding and Describing Images with Sentences

* The AI Economist

* ProGen

* CTRL

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Joe Edelman: Meaning-Aligned AI

9 mars 2023 | 66 min

In episode 63 of The Gradient Podcast, Daniel Bashir speaks to Joe Edelman.

Joe developed the meaning-based organizational metrics at Couchsurfing.com, then co-founded the Center for Humane Technology with Tristan Harris, and coined the term “Time Well Spent” for a family of metrics adopted by teams at Facebook, Google, and Apple. Since then, he's worked on the philosophical underpinnings for new business metrics, design methods, and political movements. The central idea is to make people's sources of meaning explicit, so that how meaningful or meaningless things are can be rigorously accounted for. His previous career was in HCI and programming language design.

Have suggestions for future podcast guests (or other feedback)? Let us know here!

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro (yes Daniel is trying a new intro format)

* (01:30) Joe’s origin story

* (07:15) Revealed preferences and personal meaning, recommender systems

* (12:30) Is using revealed preferences necessary?

* (17:00) What are values and how do you detect them?

* (24:00) Figuring out what’s meaningful to us

* (28:45) The decline of spaces and togetherness

* (35:00) Individualism and economic/political theory, tensions between collectivism/individualism

* (41:00) What it looks like to build spaces, Habitat

* (47:15) Cognitive effects of social platforms

* (51:45) Atomized communication, re-imagining chat apps

* (55:50) Systems for social groups and medium independence

* (1:02:45) Spaces being built today

* (1:05:15) Joe is building research groups! Get in touch :)

* (1:05:40) Outro

Links:

* Joe's 80m lecture on techniques for rebuilding society on meaning (youtube, transcript)

* The discord for Rebuilding Meaning—join if you'd like to help build ML models or metrics using the methods discussed

* Writing/papers mentioned:

* Tech products (that don’t cause depression and war)

* Values, Preferences, Meaningful Choice

* Social Programming Considered as a Habitat for Groups

* Is Anything Worth Maximizing

* Joe’s homepage, Twitter, and YouTube page

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Ed Grefenstette: Language, Semantics, Cohere

2 mars 2023 | 74 min

Ken Liu: What Science Fiction Can Teach Us

23 februari 2023 | 123 min

In episode 61 of The Gradient Podcast, Daniel Bashir speaks to Ken Liu.

Ken is an author of speculative fiction. A winner of the Nebula, Hugo, and World Fantasy awards, he is the author of silkpunk epic fantasy series Dandelion Dynasty and short story collections The Paper Menagerie and Other Stories and The Hidden Girl and Other Stories. Prior to writing full-time, Ken worked as a software engineer, corporate lawyer, and litigation consultant.

Have suggestions for future podcast guests (or other feedback)? Let us know here!

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (02:00) How Ken Liu became Ken Liu: A Saga

* (03:10) Time in the tech industry, interest in symbolic machines

* (04:40) Determining what stories to write, (07:00) art as failed communication

* (07:55) Law as creating abstract machines, importance of successful communication, stories in law

* (13:45) Misconceptions about science fiction

* (18:30) How we’ve been misinformed about literature and stories in school, stories as expressing multivalent truths, Dickens on narration (29:00)

* (31:20) Stories as imposing structure on the world

* (35:25) Silkpunk as aesthetic and writing approach

* (39:30) If modernity is a translated experience, what is it translated from? Alternative sources for the American pageant

* (47:30) The value of silkpunk for technologists and building the future

* (52:40) The engineer as poet

* (59:00) Technology language as constructing societies, what it is to be a technologist

* (1:04:00) The technology of language

* (1:06:10) The Google Wordcraft Workshop and co-writing with LaMDA

* (1:14:10) Possibilities and limitations of LMs in creative writing

* (1:18:45) Ken’s short fiction

* (1:19:30) Short fiction as a medium

* (1:24:45) “The Perfect Match” (from The Paper Menagerie and other stories)

* (1:34:00) Possibilities for better recommender systems

* (1:39:35) “Real Artists” (from The Hidden Girl and other stories)

* (1:47:00) The scaling hypothesis and creativity

* (1:50:25) “The Gods have not died in vain” & Moore’s Proof epigraph (The Hidden Girl)

* (1:53:10) More of The Singularity Trilogy (The Hidden Girl)

* (1:58:00) The role of science fiction today and how technologists should engage with stories

* (2:01:53) Outro

Links:

* Ken’s homepage

* The Dandelion Dynasty Series: Speaking Bones is out in paperback

* Books/Stories/Projects Mentioned

* “Evaluative Soliloquies” in Google Wordcraft

* The Paper Menagerie and Other Stories

* The Hidden Girl and Other Stories

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Hattie Zhou: Lottery Tickets and Algorithmic Reasoning in LLMs

16 februari 2023 | 103 min

In episode 60 of The Gradient Podcast, Daniel Bashir speaks to Hattie Zhou.

Hattie is a PhD student at the Université de Montréal and Mila. Her research focuses on understanding how and why neural networks work, based on the belief that the performance of modern neural networks exceeds our understanding and that building more capable and trustworthy models requires bridging this gap. Prior to Mila, she spent time as a data scientist at Uber and did research with Uber AI Labs.

Have suggestions for future podcast guests (or other feedback)? Let us know here!

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (01:55) Hattie’s Origin Story, Uber AI Labs, empirical theory and other sorts of research

* (10:00) Intro to the Lottery Ticket Hypothesis & Deconstructing Lottery Tickets

* (14:30) Lottery tickets as lucky initialization

* (17:00) Types of masking and the “masking is training” claim

* (24:00) Type-0 masks and weight evolution over long training trajectories

* (27:00) Can you identify good masks or training trajectories a priori?

* (29:00) The role of signs in neural net initialization

* (35:27) The Supermask

* (41:00) Masks to probe pretrained models and model steerability

* (47:40) Fortuitous Forgetting in Connectionist Networks

* (54:00) Relationships to other work (double descent, grokking, etc.)

* (1:01:00) The iterative training process in fortuitous forgetting, scale and value of exploring alternatives

* (1:03:35) In-Context Learning and Teaching Algorithmic Reasoning

* (1:09:00) Learning + algorithmic reasoning, prompting strategy

* (1:13:50) What’s happening with in-context learning?

* (1:14:00) Induction heads

* (1:17:00) ICL and gradient descent

* (1:22:00) Algorithmic prompting vs discovery

* (1:24:45) Future directions for algorithmic prompting

* (1:26:30) Interesting work from NeurIPS 2022

* (1:28:20) Hattie’s perspective on scientific questions people pay attention to, underrated problems

* (1:34:30) Hattie’s perspective on ML publishing culture

* (1:42:12) Outro

Links:

* Hattie’s homepage and Twitter

* Papers

* Deconstructing Lottery Tickets: Zeros, signs, and the Supermask

* Fortuitous Forgetting in Connectionist Networks

* Teaching Algorithmic Reasoning via In-context Learning

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Kyunghyun Cho: Neural Machine Translation, Language, and Doing Good Science

9 februari 2023 | 128 min

In episode 59 of The Gradient Podcast, Daniel Bashir speaks to Professor Kyunghyun Cho.

Professor Cho is an associate professor of computer science and data science at New York University and CIFAR Fellow of Learning in Machines & Brains. He is also a senior director of frontier research at the Prescient Design team within Genentech Research & Early Development. He was a research scientist at Facebook AI Research from 2017-2020 and a postdoctoral fellow at University of Montreal under the supervision of Prof. Yoshua Bengio after receiving his MSc and PhD degrees from Aalto University. He received the Samsung Ho-Am Prize in Engineering in 2021.

Have suggestions for future podcast guests (or other feedback)? Let us know here!

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (02:15) How Professor Cho got into AI, going to Finland for a PhD

* (06:30) Accidental and non-accidental parts of Prof Cho’s journey, the role of timing in career trajectories

* (09:30) Prof Cho’s M.Sc. thesis on Restricted Boltzmann Machines

* (17:00) The state of autodiff at the time

* (20:00) Finding non-mainstream problems and examining limitations of mainstream approaches, anti-dogmatism, Yoshua Bengio appreciation

* (24:30) Detaching identity from work, scientific training

* (26:30) The rest of Prof Cho’s PhD, the first ICLR conference, working in Yoshua Bengio’s lab

* (34:00) Prof Cho’s isolation during his PhD and its impact on his work—transcending insecurity and working on unsexy problems

* (41:30) The importance of identifying important problems and developing an independent research program, ceiling on the number of important research problems

* (46:00) Working on Neural Machine Translation, Jointly Learning to Align and Translate

* (1:01:45) What RNNs and earlier NN architectures can still teach us, why transformers were successful

* (1:08:00) Science progresses gradually

* (1:09:00) Learning distributed representations of sentences, extending the distributional hypothesis

* (1:21:00) Difficulty and limitations in evaluation—directions of dynamic benchmarks, trainable evaluation metrics

* (1:29:30) Mixout and AdapterFusion: fine-tuning and intervening on pre-trained models, pre-training as initialization, destructive interference

* (1:39:00) Analyzing neural networks as reading tea leaves

* (1:44:45) Importance of healthy skepticism for scientists

* (1:45:30) Language-guided policies and grounding, vision-language navigation

* (1:55:30) Prof Cho’s reflections on 2022

* (2:00:00) Obligatory ChatGPT content

* (2:04:50) Finding balance

* (2:07:15) Outro

Links:

* Professor Cho’s homepage and Twitter

* Papers

* M.Sc. thesis and PhD thesis

* NMT and attention

* Properties of NMT,

* Learning Phrase Representations

* Neural machine translation by jointly learning to align and translate

* More recent work

* Learning Distributed Representations of Sentences from Unlabelled Data

* Mixout: Effective Regularization to Finetune Large-scale Pretrained Language Models

* Generative Language-Grounded Policy in Vision-and-Language Navigation with Bayes’ Rule

* AdapterFusion: Non-Destructive Task Composition for Transfer Learning

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Steve Miller: Will AI Take Your Job? It's Not So Simple.

2 februari 2023 | 70 min

Blair Attard-Frost: Canada’s AI strategy and the ethics of AI business practices

26 januari 2023 | 58 min

Linus Lee: At the Boundary of Machine and Mind

19 januari 2023 | 149 min

In episode 56 of The Gradient Podcast, Daniel Bashir speaks to Linus Lee.

Linus is an independent researcher interested in the future of knowledge representation and creative work aided by machine understanding of language. He builds interfaces and knowledge tools that expand the domain of thoughts we can think and qualia we can feel. Linus has been writing online since 2014–his blog boasts half a million words–and has built well over 100 side projects. He has also spent time as a software engineer at Replit, Hack Club, and Spensa, and was most recently a Researcher in Residence at Betaworks in New York.

Have suggestions for future podcast guests (or other feedback)? Let us know here!

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (02:00) Linus’s background and interests, vision-language models

* (07:45) Embodiment and limits for text-image

* (11:35) Ways of experiencing the world

* (16:55) Origins of the handle “thesephist”, languages

* (25:00) Math notation, reading papers

* (29:20) Operations on ideas

* (32:45) Overview of Linus’s research and current work

* (41:30) The Oak and Ink languages, programming languages

* (49:30) Personal search engines: Monocle and Reverie, what you can learn from personal data

* (55:55) Web browsers as mediums for thought

* (1:01:30) This AI Does Not Exist

* (1:03:05) Knowledge representation and notational intelligence

* Notation vs language

* (1:07:00) What notation can/should be

* (1:16:00) Inventing better notations and expanding human intelligence

* (1:23:30) Better interfaces between humans and LMs to provide precise control, inefficiency prompt engineering

* (1:33:00) Inexpressible experiences

* (1:35:42) Linus’s current work using latent space models

* (1:40:00) Ideas as things you can hold

* (1:44:55) Neural nets and cognitive computing

* (1:49:30) Relation to Hardware Lottery and AI accelerators

* (1:53:00) Taylor Swift Appreciation Session, mastery and virtuosity

* (1:59:30) Mastery/virtuosity and interfaces / learning curves

* (2:03:30) Linus’s stories, the work of fiction

* (2:09:00) Linus’s thoughts on writing

* (2:14:20) A piece of writing should be focused

* (2:16:15) On proving yourself

* (2:28:00) Outro

Links:

* Linus’s Twitter and website

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Suresh Venkatasubramanian: An AI Bill of Rights

12 januari 2023 | 101 min

In episode 55 of The Gradient Podcast, Daniel Bashir speaks to Professor Suresh Venkatasubramanian.

Professor Venkatasubramanian is a Professor of Computer Science and Data Science at Brown University, where his research focuses on algorithmic fairness and the impact of automated decision-making systems in society. He recently served as Assistant Director for Science and Justice in the White House Office of Science and Technology Policy, where he co-authored the Blueprint for an AI Bill of Rights.

Have suggestions for future podcast guests (or other feedback)? Let us know here!

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (02:25) Suresh’s journey into AI and policymaking

* (08:00) The complex graph of designing and deploying “fair” AI systems

* (09:50) The Algorithmic Lens

* (14:55) “Getting people into a room” isn’t enough

* (16:30) Failures of incorporation

* (21:10) Trans-disciplinary vs interdisciplinary, the limiting nature of “my lane” / “your lane” thinking, going beyond existing scientific and philosophical ideas

* (24:50) The trolley problem is annoying, its usefulness and limitations

* (25:30) Breaking the frame of a discussion, self-driving doesn’t fit into the parameters of the trolley problem

* (28:00) Acknowledging frames and their limitations

* (29:30) Social science’s inclination to critique, flaws and benefits of solutionism

* (30:30) Computer security as a model for thinking about algorithmic protections, the risk of failure in policy

* (33:20) Suresh’s work on recourse

* (38:00) Kantian autonomy and the value of recourse, non-Western takes and issues with individual benefit/harm as the most morally salient question

* (41:00) Community as a valuable entity and its implications for algorithmic governance, surveillance systems

* (43:50) How Suresh got involved in policymaking / the OSTP

* (46:50) Gathering insights for the AI Bill of Rights Blueprint

* (51:00) One thing the Bill did miss… Struggles with balancing specificity and vagueness in the Bill

* (54:20) Should “automated system” be defined in legislation? Suresh’s approach and issues with the EU AI Act

* (57:45) The danger of definitions, overlap with chess world controversies

* (59:10) Constructive vagueness in law, partially theorized agreements

* (1:02:15) Digital privacy and privacy fundamentalism, focus on breach of individual autonomy as the only harm vector

* (1:07:40) GDPR traps, the “legacy problem” with large companies and post-hoc regulation

* (1:09:30) Considerations for legislating explainability

* (1:12:10) Criticisms of the Blueprint and Suresh’s responses

* (1:25:55) The global picture, AI legislation outside the US, legislation as experiment

* (1:32:00) Tensions in entering policy as an academic and technologist

* (1:35:00) Technologists need to learn additional skills to impact policy

* (1:38:15) Suresh’s advice for technologists interested in public policy

* (1:41:20) Outro

Links:

* Suresh is on Mastodon @[email protected] (and also Twitter)

* Suresh’s blog

* Blueprint for an AI Bill of Rights

* Papers

* Fairness and abstraction in sociotechnical systems

* A comparative study of fairness-enhancing interventions in machine learning

* The Philosophical Basis of Algorithmic Recourse

* Runaway Feedback Loops in Predictive Policing

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Pete Florence: Dense Visual Representations, NeRFs, and LLMs for Robotics

5 januari 2023 | 75 min

Melanie Mitchell: Abstraction and Analogy in AI

15 december 2022 | 55 min

Have suggestions for future podcast guests (or other feedback)? Let us know here!

In episode 53 of The Gradient Podcast, Daniel Bashir speaks to Professor Melanie Mitchell.

Professor Mitchell is the Davis Professor at the Santa Fe Institute. Her research focuses on conceptual abstraction, analogy-making, and visual recognition in AI systems. She is the author or editor of six books and her work spans the fields of AI, cognitive science, and complex systems. Her latest book is Artificial Intelligence: A Guide for Thinking Humans.

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (02:20) Melanie’s intro to AI

* (04:35) Melanie’s intellectual influences, AI debates over time

* (10:50) We don’t have the right metrics for empirical study in AI

* (15:00) Why AI is Harder than we Think: the four fallacies

* (20:50) Difficulties in understanding what’s difficult for machines vs humans

* (23:30) Roles for humanlike and non-humanlike intelligence

* (27:25) Whether “intelligence” is a useful word

* (31:55) Melanie’s thoughts on modern deep learning advances, brittleness

* (35:35) Abstraction, Analogies, and their role in AI

* (38:40) Concepts as analogical and what that means for cognition

* (41:25) Where does analogy bottom out

* (44:50) Cognitive science approaches to concepts

* (45:20) Understanding how to form and use concepts is one of the key problems in AI

* (46:10) Approaching abstraction and analogy, Melanie’s work / the Copycat architecture

* (49:50) Probabilistic program induction as a promising approach to intelligence

* (52:25) Melanie’s advice for aspiring AI researchers

* (54:40) Outro

Links:

* Melanie’s homepage and Twitter

* Papers

* Difficulties in AI, hype cycles

* Why AI is Harder than we think

* The Debate Over Understanding in AI’s Large Language Models

* What Does It Mean for AI to Understand?

* Abstraction, analogies, and reasoning

* Abstraction and Analogy-Making in Artificial Intelligence

* Evaluating understanding on conceptual abstraction benchmarks

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Marc Bellemare: Distributional Reinforcement Learning

8 december 2022 | 72 min

Have suggestions for future podcast guests (or other feedback)? Let us know here!

In episode 52 of The Gradient Podcast, Daniel Bashir speaks to Professor Marc Bellemare.

Professor Bellemare leads the reinforcement learning efforts at Google Brain Montréal and is a core industry member at Mila, where he also holds the Canada CIFAR AI Chair. His PhD work, completed at the University of Alberta, proposed the use of Atari 2600 video games to benchmark progress in reinforcement learning (RL). He was a research scientist at DeepMind from 2013-2017, and his Arcade Learning Environment was very influential in DeepMind’s early RL research and remains one of the most widely-used RL benchmarks today. More recently he collaborated with Loon to deploy deep reinforcement learning to navigate stratospheric balloons. His book on distributional reinforcement learning, published by MIT Press, will be available in Spring 2023.

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (03:10) Marc’s intro to AI and RL

* (07:00) Cross-pollination of deep learning research and RL in McGill and UDM

* (09:50) PhD work at U Alberta, continual learning, origins of the Arcade Learning Environment (ALE)

* (14:40) Challenges in the ALE, how the ALE drove RL research

* (23:10) Marc’s thoughts on the Avalon benchmark and what makes a good RL benchmark

* (28:00) Opinions on “Reward is Enough” and whether RL gets us to AGI

* (32:10) How Marc thinks about priors in learning, “reincarnating RL”

* (36:00) Distributional Reinforcement Learning and the problem of distribution estimation

* (43:00) GFlowNets and distributional RL

* (45:05) Contraction in RL and distributional RL, theory-practice gaps

* (52:45) Representation learning for RL

* (55:50) Structure of the value function space

* (1:00:00) Connections to open-endedness / evolutionary algorithms / curiosity

* (1:03:30) RL for stratospheric balloon navigation with Loon

* (1:07:30) New ideas for applying RL in the real world

* (1:10:15) Marc’s advice for young researchers

* (1:12:37) Outro

Links:

* Professor Bellemare’s Homepage

* Distributional Reinforcement Learning book

* Papers

* The Arcade Learning Environment: An Evaluation Platform for General Agents

* A Distributional Perspective on Reinforcement Learning

* Distributional Reinforcement Learning with Quantile Regression

* Distributional Reinforcement Learning with Linear Function Approximation

* Autonomous navigation of stratospheric balloons using reinforcement learning

* A Geometric Perspective on Optimal Representations for Reinforcement Learning

* The Value Function Polytope in Reinforcement Learning

Get full access to The Gradient at thegradientpub.substack.com/subscribe

François Chollet: Keras and Measures of Intelligence

1 december 2022 | 89 min

Yoshua Bengio: The Past, Present, and Future of Deep Learning

21 november 2022 | 74 min

Happy episode 50! This week’s episode is being released on Monday to avoid Thanksgiving.

Have suggestions for future podcast guests (or other feedback)? Let us know here!

In episode 50 of The Gradient Podcast, Daniel Bashir speaks to Professor Yoshua Bengio.

Professor Bengio is a Full Professor at the Université de Montréal as well as Founder and Scientific Director of the MILA-Quebec AI Institute and the IVADO institute. Best known for his work in pioneering deep learning, Bengio was one of three awardees of the 2018 A.M. Turing Award along with Geoffrey Hinton and Yann LeCun. He is also the awardee of the prestigious Killam prize and, as of this year, the computer scientist with the highest h-index in the world.

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (02:20) Journey into Deep Learning, PDP and Hinton

* (06:45) “Inspired by biology”

* (08:30) “Gradient Based Learning Applied to Document Recognition” and working with Yann LeCun

* (10:00) What Bengio learned from LeCun (and Larry Jackel) about being a research advisor

* (13:00) “Learning Long-Term Dependencies with Gradient Descent is Difficult,” why people don’t understand this paper well enough

* (18:15) Bengio’s work on word embeddings and the curse of dimensionality, “A Neural Probabilistic Language Model”

* (23:00) Adding more structure / inductive biases to LMs

* (24:00) The rise of deep learning and Bengio’s experience, “you have to be careful with inductive biases”

* (31:30) Bengio’s “Bayesian posture” in response to recent developments

* (40:00) Higher level cognition, Global Workspace Theory

* (45:00) Causality, actions as mediating distribution change

* (49:30) GFlowNets and RL

* (53:30) GFlowNets and actions that are not well-defined, combining with System II and modular, abstract ideas

* (56:50) GFlowNets and evolutionary methods

* (1:00:45) Bengio on Cartesian dualism

* (1:09:30) “When you are famous, it is hard to work on hard problems” (Richard Hamming) and Bengio’s response

* (1:11:10) Family background, art and its role in Bengio’s life

* (1:14:20) Outro

Links:

* Professor Bengio’s Homepage

* Papers

* Gradient-based learning applied to document recognition

* Learning Long-Term Dependencies with Gradient Descent is Difficult

* The Consciousness Prior

* Flow Network based Generative Models for Non-Iterative Diverse Candidate Generation

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Kanjun Qiu and Josh Albrecht: Generally Intelligent

17 november 2022 | 47 min

Nathan Benaich: The State of AI Report

10 november 2022 | 79 min

* Have suggestions for future podcast guests (or other feedback)? Let us know here!

* Want to write with us? Send a pitch using this form :)

In episode 48 of The Gradient Podcast, Daniel Bashir speaks to Nathan Benaich.

Nathan is Founder and General Partner at Air Street Capital, a venture capital (VC) firm focused on investing in AI-first technology and life sciences companies. Nathan runs a number of communities focused on AI including the Research and Applied AI Summit and leads Spinout.fyi to improve the creation of university spinouts. Together with investor Ian Hogarth, Nathan co-authors the State of AI Report.

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (02:40) Nathan’s interests in AI, life sciences, investing

* (04:10) Biotech and tech-bio companies

* (08:00) Why Nathan went into VC

* (10:15) Air Street Capital’s focus, investing in AI at an early stage

* (14:30) Why Nathan believes in specialism over generalism in AI, balancing consumer-focused ML with serious technical work

* (17:30) The European startup ecosystem

* (19:30) Spinouts and inventions born in academia

* (23:35) Spinout.fyi and issues with the European model

* (27:50) In the UK, only 4% of private AI companies are spinouts

* (30:00) Solutions

* (32:55) Origins of the State of AI Report

* (35:00) Looking back on Nathan’s 2021 predictions: Anthropic and JAX

* (39:00) AI semiconductors and the difficult reality

* (42:45) Nathan’s perspectives on AI safety/alignment

* (46:00) Long-termism and debates, safety research as an input into improving capabilities

* (49:50) Decentralization and the commercialization of open-source AI (Stability AI, Eleuther AI, etc.)

* (53:00) Second-order applications of diffusion models—chemistry, small molecule design, genome editors

* (59:00) Semiconductor restrictions and geopolitics

* (1:03:45) This year’s State of AI predictions

* (1:04:30) Trouble in semiconductor startup land

* (1:08:40) Predictions for AGI startups

* (1:14:20) How regulation of AGI startups might look

* (1:16:40) Nathan’s advice for founders, investors, and researchers

* (1:19:00) Outro

Links:

* State of AI Report

* Air Street Capital

* Spinout.fyi

* Rewriting the European spinout playbook

* Other sources mentioned

* Bridging the Gap: the case for an Incompletely Theorized Agreement on AI policy

* Choking Off China’s Access to the Future of AI

* China's New AI Governance Initiatives Shouldn't Be Ignored

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Matt Sheehan: China's AI Strategy and Governance

3 november 2022 | 67 min

* Have suggestions for future podcast guests (or other feedback)? Let us know here!

* Want to write with us? Send a pitch using this form :)

In episode 47 of The Gradient Podcast, Daniel Bashir speaks to Matt Sheehan.

Matt is a fellow at the Carnegie Endowment for International Peace, where he researches global technology with a focus on China. His writing and research explores China’s AI ecosystem, the future of China’s technology policy, and technology’s role in China’s political economy. Matt has also written for Foreign Affairs andThe Huffington Post, among other venues.

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (02:28) Matt’s path to analyzing China’s AI governance

* (06:50) Matt’s experience understanding daily life in China and developing a bottom-up perspective

* (09:40) The development of government constraints in technology/AI in the US and China

* (12:40) Matt’s take on China’s priorities and motivations

* (17:00) How recent history influences China’s technology ambitions

* (17:30) Matt gives an overview of the Century of Humiliation

* (22:07) Adversarial perceptions, Xi Jinping’s brashness and its effect on discourse about International Relations, how this intersects with AI

* (24:40) Self-reliance and semiconductors. Was the recent chip ban the right move?

* (36:15) Matt’s question: could foundation models be trained on trailing edge chips if necessary? Limitations

* (38:30) Silicon Valley and China, The Transpacific Experiment and stories

* (46:17) 躺平 and how trends among youth in China interact with tech development, parallel trends in the US, work culture

* (51:05) China’s recent AI governance initiatives

* (56:25) Squaring China’s AI ethics stance with its use of AI

* (59:53) The US can learn from both Chinese and European regulators

* (1:02:03) How technologists should think about geopolitics and national tensions

* (1:05:43) Outro

Links:

* Matt’s Twitter

* China’s influences/ambitions

* Beijing’s Industrial Internet Ambitions

* Beijing’s Tech Ambitions: What Exactly Does It Want?

* US-China exchange and US responses

* Who benefits from American AI research in China?

* Two New Tech Bills Could Transform US Innovation

* Fear of Chinese Competition Won’t Preserve US Tech Leadership

* China’s tech standards, government initiatives and regulation in AI

* How US businesses view China’s growing influence in tech standards

* Three takeaways from China’s new standards strategy

* China’s new AI governance initiatives shouldn’t be ignored

* Semiconductors

* Biden’s Unprecedented Semiconductor Bet (a new piece from Matt!)

* Choking Off China’s Access to the Future of AI

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Luis Voloch: AI and Biology

27 oktober 2022 | 44 min

Zachary Lipton: Where Machine Learning Falls Short

13 oktober 2022 | 101 min

Stuart Russell: The Foundations of Artificial Intelligence

6 oktober 2022 | 71 min

Have suggestions for future podcast guests (or other feedback)? Let us know here!

In episode 44 of The Gradient Podcast, Daniel Bashir speaks to Professor Stuart Russell.

Stuart Russell is a Professor of Computer Science and the Smith-Zadeh Professor in Engineering at UC Berkeley, as well as an Honorary Fellow at Wadham College, Oxford. Professor Russell is the co-author with Peter Norvig of Artificial Intelligence: A Modern Approach, probably the most popular AI textbook in history. He is the founder and head of Berkeley’s Center for Human-Compatible Artificial Intelligence and recently authored the book Human Compatible: Artificial Intelligence and the Problem of Control. He has also served as co-chair on the World Economic Forum’s Council on AI and Robotics.

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (02:45) Stuart’s introduction to AI

* (05:50) The two most important questions

* (07:25) Historical perspectives during Stuart’s PhD, agents and learning

* (14:30) Rationality and Intelligence, Bounded Optimality

* (20:30) Stuart’s work on Metareasoning

* (29:45) How does Metareasoning fit with Bounded Optimality?

* (37:39) “Civilization advances by reducing complex operations to be trivial”

* (39:20) Reactions to the rise of Deep Learning, connectionist/symbolic debates, probabilistic modeling

* (51:00) The Deep Learning and traditional AI communities will adopt each other’s ideas

* (51:55) Why Stuart finds the self-driving car arena interesting, Waymo’s old-fashioned AI approach

* (57:30) Effective generalization without the full expressive power of first-order logic—deep learning is a “weird way to go about it”

* (1:03:00) A very short shrift of Human Compatible and its ideas

* (1:10:42) Outro

Links:

* Stuart’s webpage

* Human Compatible page with reviews and interviews

* Papers mentioned

* Rationality and Intelligence

* Principles of Metareasoning

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Varun Ganapathi: AKASA, AI and Healthcare

29 september 2022 | 51 min

Joel Lehman: Open-Endedness and Evolution through Large Models

22 september 2022 | 99 min

Have suggestions for future podcast guests (or other feedback)? Let us know here!

In episode 42 of The Gradient Podcast, Daniel Bashir speaks to Joel Lehman.

Joel is a machine learning scientist interested in AI safety, reinforcement learning, and creative open-ended search algorithms. Joel has spent time at Uber AI Labs and OpenAI and is the co-author of the book Why Greatness Cannot be Planned: The Myth of the Objective.

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (01:40) From game development to AI

* (03:20) Why evolutionary algorithms

* (10:00) Abandoning Objectives: Evolution Through the Search for Novelty Alone

* (24:10) Measuring a desired behavior post-hoc vs optimizing for that behavior

* (27:30) Neuroevolution through Augmenting Topologies (NEAT), Evolving a Diversity of Virtual Creatures

* (35:00) Humans are an inefficient solution to evolution’s objectives

* (47:30) Is embodiment required for understanding? Today’s LLMs as practical thought experiments in disembodied understanding

* (51:15) Evolution through Large Models (ELM)

* (1:01:07) ELM: Quality Diversity Algorithms, MAP-Elites, bootstrapping training data

* (1:05:25) Dimensions of Diversity in MAP-Elites, what is “interesting”?

* (1:12:30) ELM: Fine-tuning the language model

* (1:18:00) Results of invention in ELM, complexity in creatures

* (1:20:20) Future work building on ELM, key challenges in open-endedness

* (1:24:30) How Joel’s research affects his approach to life and work

* (1:28:30) Balancing novelty and exploitation in work

* (1:34:10) Intense competition in AI, Joel’s advice for people considering ML research

* (1:38:45) Daniel isn’t the worst interviewer ever

* (1:38:50) Outro

Links:

* Joel’s webpage

* Evolution through Large Models: The Tweet

* Papers:

* Abandoning Objectives: Evolution through the search for novelty alone

* Evolving a diversity of virtual creatures through novelty search and local competition

* Designing neural networks through neuroevolution

* Evolution through Large Models

* Resources for (aspiring) ML researchers!

* Cohere for AI

* ML Collective

Get full access to The Gradient at thegradientpub.substack.com/subscribe

Andrew Feldman: Cerebras and AI Hardware

15 september 2022 | 57 min

Christopher Manning: Linguistics and the Development of NLP

8 september 2022 | 72 min

Jeff Clune: Genetic Algorithms, Quality-Diversity, Curiosity

1 september 2022 | 69 min

Catherine Olsson and Nelson Elhage: Anthropic, Understanding Transformers

26 augusti 2022 | 47 min

Been Kim: Interpretable Machine Learning

18 augusti 2022 | 72 min

Laura Weidinger: Ethical Risks, Harms, and Alignment of Large Language Models

5 augusti 2022 | 56 min

Sebastian Raschka: AI Education and Research

29 juli 2022 | 64 min

Lt. General Jack Shanahan: AI in the DoD, Project Maven, and Bridging the Tech-DoD Gap

22 juli 2022 | 31 min

Sara Hooker: Cohere For AI, the Hardware Lottery, and DL Tradeoffs

14 juli 2022 | 53 min

Lukas Biewald: Crowdsourcing at CrowdFlower and ML Tooling at Weights & Biases

7 juli 2022 | 47 min

Chip Huyen: Machine Learning Tools and Systems

30 juni 2022 | 48 min

Preetum Nakkiran: An Empirical Theory of Deep Learning

24 juni 2022 | 97 min

Max Woolf: Data Science at BuzzFeed and AI Content Generation

16 juni 2022 | 48 min

Rosanne Liu: Paths in AI Research and ML Collective

10 juni 2022 | 75 min

Ben Green: "Tech for Social Good" Needs to Do More

2 juni 2022 | 53 min

Max Braun: Teaching Robots to Help People in their Everyday Lives

26 maj 2022 | 83 min

Yejin Choi: Teaching Machines Common Sense and Morality

19 maj 2022 | 76 min

David Chalmers on AI and Consciousness

12 maj 2022 | 52 min

Greg Yang on Communicating Research, Tensor Programs, and µTransfer

28 april 2022 | 66 min

Nick Walton on AI Dungeon and the Future of AI in Games

24 mars 2022 | 63 min

Connor Leahy on EleutherAI, Replicating GPT-2/GPT-3, AI Risk and Alignment

3 februari 2022 | min

Percy Liang on Machine Learning Robustness, Foundation Models, and Reproducibility

27 januari 2022 | 51 min

Eric Jang on Robots Learning at Google and Generalization via Language

8 januari 2022 | 93 min

Rishi Bommasani on Foundation Models

9 december 2021 | 94 min

Upol Ehsan on Human-Centered Explainable AI and Social Transparency

3 december 2021 | 95 min

Miles Brundage on AI Misuse and Trustworthy AI

23 november 2021 | 54 min

Jeffrey Ding on China's AI Dream, the AI 'Arms Race', and AI as a General Purpose Technology

18 november 2021 | 71 min

Alex Tamkin on Self-Supervised Learning and Large Language Models

11 november 2021 | 71 min

Peter Henderson on RL Benchmarking, Climate Impacts of AI, and AI for Law

28 oktober 2021 | 89 min

Chelsea Finn on Meta Learning & Model Based Reinforcement Learning

14 oktober 2021 | 50 min

Devi Parikh on Generative Art & AI for Creativity

1 oktober 2021 | 55 min

Sergey Levine on Robot Learning & Offline RL

16 september 2021 | 54 min

Jeremy Howard on Kaggle, Enlitic, and fast.ai

9 september 2021 | 58 min

Evan Hubinger on Effective Altruism and AI Safety

3 september 2021 | 75 min

Yannic Kilcher on Being an AI Researcher and Educator

27 augusti 2021 | 41 min

Alexander Veysov on Self-Teaching AI and Creating Open Speech-To-Text

19 augusti 2021 | 47 min

Yann LeCun on his Start in Research and Self-Supervised Learning

5 augusti 2021 | 56 min

Anna Rogers on the Flaws of Peer Review in AI

30 juli 2021 | 63 min

Joel Simon on AI art and Artbreeder

20 juli 2021 | 58 min

Abubakar Abid on AI for Genomics, Gradio, and the Fatima Fellowship

6 juli 2021 | 45 min

Helena Sarin on being an AI Artist

19 juni 2021 | 41 min

Hello World from The Gradient Podcast!

1 juni 2021 | 22 min

The Gradient: Perspectives on AI

Deeply researched, technical interviews with experts thinking about AI and technology.

Om podden

Avsnitt

Jacob Andreas: Language, Grounding, and World Models

Evan Ratliff: Our Future with Voice Agents

Meredith Ringel Morris: Generative AI's HCI Moment

Davidad Dalrymple: Towards Provably Safe AI

Clive Thompson: Tales of Technology

Judy Fan: Reverse Engineering the Human Cognitive Toolkit

L.M. Sacasas: The Questions Concerning Technology

Pete Wolfendale: The Revenge of Reason

Peter Lee: Computing Theory and Practice, and GPT-4's Impact

Manuel & Lenore Blum: The Conscious Turing Machine

Kevin Dorst: Against Irrationalist Narratives

David Pfau: Manifold Factorization and AI for Science

Dan Hart and Michelle Michael: Bringing AI to Students in New South Wales

Kristin Lauter: Private AI, Homomorphic Encryption, and AI for Cryptography

Sergiy Nesterenko: Automating Circuit Board Design

C. Thi Nguyen: Values, Legibility, and Gamification

Vivek Natarajan: Towards Biomedical AI

Thomas Mullaney: A Global History of the Information Age

Seth Lazar: Normative Philosophy of Computing

Suhail Doshi: The Future of Computer Vision

Azeem Azhar: The Exponential View

David Thorstad: Bounded Rationality and the Case Against Longtermism

Ryan Tibshirani: Statistics, Nonparametric Regression, Conformal Prediction

Sasha Luccioni: Connecting the Dots Between AI's Environmental and Social Impacts

Michael Sipser: Problems in the Theory of Computation

Andrew Lee: How AI will Shape the Future of Email

Joss Fong: Videomaking, AI, and Science Communication

Kate Park: Data Engines for Vision and Language

Ben Wellington: ML for Finance and Storytelling through Data

Venkatesh Rao: Protocols, Intelligence, and Scaling

Sasha Rush: Building Better NLP Systems

Cameron Jones & Sean Trott: Understanding, Grounding, and Reference in LLMs

Nicholas Thompson: AI and Journalism

Subbarao Kambhampati: Planning, Reasoning, and Interpretability in the Age of LLMs

Russ Maschmeyer: Spatial Commerce and AI in Retail

Benjamin Breen: The Intersecting Histories of Psychedelics and AI Research

Ted Gibson: The Structure and Purpose of Language

Harvey Lederman: Propositional Attitudes and Reference in Language Models

Eric Jang: AI is Good For You

2023 in AI, with Nathan Benaich

Kathleen Fisher: DARPA and AI for National Security

Peter Tse: The Neuroscience of Consciousness and Free Will

Vera Liao: AI Explainability and Transparency

Thomas Dietterich: From the Foundations

Martin Wattenberg: ML Visualization and Interpretability

Laurence Liew: AI Singapore

Michael Levin & Adam Goldstein: Intelligence and its Many Scales

Jonathan Frankle: From Lottery Tickets to LLMs

Nao Tokui: "Surfing" Musical Creativity with AI

Divyansh Kaushik: The Realities of AI Policy

Tal Linzen: Psycholinguistics and Language Modeling

Kevin K. Yang: Engineering Proteins with ML

Arjun Ramani & Zhengdong Wang: Why Transformative AI is Really, Really Hard to Achieve

Miles Grimshaw: Benchmark, LangChain, and Investing in AI

Shreya Shankar: Machine Learning in the Real World

Stevan Harnad: AI's Symbol Grounding Problem

Terry Winograd: AI, HCI, Language, and Cognition

Gil Strang: Linear Algebra and Deep Learning

Anant Agarwal: AI for Education

Raphaël Millière: The Vector Grounding Problem and Self-Consciousness

Peli Grietzer: A Mathematized Philosophy of Literature

Ryan Drapeau: Battling Fraud with ML at Stripe

Shiv Rao: Enabling Better Patient Care with AI

Hugo Larochelle: Deep Learning as Science

Jeremie Harris: Realistic Alignment and AI Policy

Antoine Blondeau: Alpha Intelligence Capital and Investing in AI

Joon Park: Generative Agents and Human-Computer Interaction

Christoffer Holmgård: AI for Video Games

Riley Goodside: The Art and Craft of Prompt Engineering

Talia Ringer: Formal Verification and Deep Learning

Brigham Hyde: AI for Clinical Decision-Making

Scott Aaronson: Against AI Doomerism

Ted Underwood: Machine Learning and the Literary Imagination

Irene Solaiman: AI Policy and Social Impact

Drago Anguelov: Waymo and Autonomous Vehicles

Joanna Bryson: The Problems of Cognition