Podd: Chain of Thought

AI's Two Extremes – Foundations & The Frontier | Databricks’ Denny Lee

7 maj 2025 | 44 min

The AI landscape often pulls us between the allure of cutting-edge models and the quiet necessity of foundational work—yet how do these extremes actually connect to deliver value?

Join Conor Bronsdon as he welcomes Denny Lee, a self-proclaimed "data nerd" and Product Management Director, Developer Relations at Dataricks, to unpack this very spectrum, from AI's core infrastructure to its most advanced applications. Denny explains why robust logging, tracing, and data lineage are indispensable for credible AI evaluation and feedback, ultimately making AI systems more affordable, accessible, and impactful.

The discussion ventures into strategies for democratizing AI, exploring the "GenAI ladder" from efficient inference and retrieval-augmented generation to deciding when to fine-tune or pre-train models. Denny also tackles the industry's pressing hardware bottlenecks, the critical role of open standards, and the imperative of navigating data privacy in an increasingly AI-driven world. Listen for grounded advice on moving beyond the hype and making practical, value-driven decisions in your AI journey.

Chapters

00:00 Introduction and Guest Welcome

01:31 Diving into AI Foundations

02:25 Importance of Logging and Tracing

08:40 Challenges in Data Quality and Lineage

14:49 Strategies for Cost-Effective AI

19:52 Partnerships and Collaborative Opportunities

22:10 Hardware Bottlenecks in AI

24:56 China's Power and Networking Advantage

25:26 Nvidia's Super Chip and Network Fabrics

26:39 The Growing Demand for Power in AI

29:26 Practical Advice for Data Governance

35:47 Understanding Privacy in AI

36:25 Differential Privacy and Its Challenges

41:57 Conclusion

Follow the hosts

Follow⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠ Atin⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠

Follow⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠ Conor⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠

Follow⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠ Vikram⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠

Follow⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠ ⁠⁠⁠⁠Yash⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠

Follow Today's Guest(s)

Website: Databricks.com

Podcast: Data Brew by Databricks (available on major podcast platforms)

YouTube: @Databricks

LinkedIn: Denny Lee

Read

SemiAnalysis Blog: https://semianalysis.com/

Check out Galileo

⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠Try Galileo⁠⁠⁠

⁠Agent Leaderboard

Why Enterprises Need a Different Approach to AI Agents | Lyzr’s Siva Surendira

30 april 2025 | 39 min

Agentic AI exploded in 2025, but how do businesses move beyond prototypes to deploy reliable, valuable agents at scale?

Join host Conor Bronsdon and Lyzr AI CEO Siva Surendira as they discuss the complexities of building and managing AI agents for enterprises. Siva shares his journey creating Lyzr, focusing on making powerful agent frameworks accessible and trustworthy for enterprise developers. They discuss the critical hurdles businesses face, including productionization challenges, ensuring responsible AI, and bridging the gap between rapid innovation and the stringent requirements of regulated industries.

Listen as Siva explains Lyzr's approach to embedding safety guardrails natively and learn about the nuances of multi-agent orchestration, including managerial, DAG, and hybrid flows. Siva also offers insights into the limitations of "vibe coding" for enterprise use cases and stresses the crucial role of robust evaluation (evals) and choosing the right models—from local open-source options to frontier LLMs. Explore the bottlenecks hindering adoption, like custom application integration and data readiness, and learn why Siva believes the biggest opportunity for agent companies may not lie in replacing SaaS platforms but rather in automating the mundane work currently performed by humans.

Chapters

00:22 Introduction and Guest Welcome

00:52 Enterprise Agent Framework

02:48 Building Enterprise-Friendly AI Frameworks

04:56 Enterprise Concerns with Vibe Coding

09:23 Safe and Responsible AI Implementation

11:05 Multi-Agent Orchestration

14:13 Challenges in Multi-Agent Systems

14:22 Enterprise Integration Bottlenecks

17:37 The Role of Low-Code and No-Code Solutions

19:55 Inter-Agent Communication Standards

21:49 Future of AI Agents in Enterprises

29:37 Evaluating AI Agents

36:34 Conclusion and Final Thoughts

Follow the hosts

Follow⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠ Atin⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠

Follow⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠ Conor⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠

Follow⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠ Vikram⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠

Follow⁠⁠⁠⁠⁠⁠⁠⁠⁠ ⁠⁠⁠⁠Yash⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠

Follow Today's Guest(s)

Website: lyzr.ai

LinkedIn: Siva Surendira

Check out Galileo

⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠Try Galileo⁠⁠

Agent Leaderboard

Will AI Erase All Language Barriers? | Smartling's Olga Beregovaya

23 april 2025 | 41 min

Are we on the verge of removing all language barriers with AI?

Olga Beregovaya, VP of AI at Smartling, joins host Conor Bronsdon to tackle this question, discussing the evolution from rule-based NLP to today's powerful LLMs. Together, they confront the persistent challenges that stand in the way, like the English-centric nature of AI, domain-specific inaccuracies, and the unpredictability of model hallucinations. Olga unpacks the difficulties faced when striving for accurate, nuanced translation across all languages, especially under-resourced ones.

Beyond these hurdles, the conversation explores the cutting-edge opportunities and technical innovations driving progress, including RAG, the rise of purpose-built models, agentic AI workflows, and the potential of multilingual multimodality. Olga shares insights into boosting translator productivity, achieving more predictable quality, and the path toward human parity in translation, examining how technology and human expertise will shape the future of global communication.

Chapters

00:00 Introduction and Guest Welcome

01:14 Evolution of NLP: From Rule-Based to Machine Learning

02:40 Challenges in AI Translation

04:21 Biases in Language Models

05:28 Inference Time and Latency

05:44 English-Centric AI Models

08:53 Opportunities in AI Translation

09:14 Industries Benefiting from Language AI

10:36 Human-in-the-Loop Translation

12:06 Architectural Innovations in Language AI

16:20 Success with RAG Architectures

17:58 Multilingual Vectorization

19:54 Agentic AI in Translation

24:35 Data Sets and Data Privacy

28:30 Using Smaller, Purpose-Built Models

32:10 Future of AI in Translation

36:37 Conclusion and Farewell

Follow the hosts

Follow⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠ Atin⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠

Follow⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠ Conor⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠

Follow⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠ Vikram⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠

Follow⁠⁠⁠⁠⁠⁠⁠⁠ ⁠⁠⁠⁠Yash⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠

Follow Today's Guest(s)

LinkedIn Olga Beregovaya

LinkedIn ⁠Smartling

Check out Galileo

⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠Try Galileo⁠⁠

AI, Low-Code, and Shaping the Next Generation of Apps | OutSystems' Rodrigo Coutinho

16 april 2025 | 40 min

AI Won't Solve Your Toughest Engineering Problems | Honeycomb’s Charity Majors

9 april 2025 | 42 min

Generative AI dominates the conversation, but does it actually make it easier to build, lead, and sustain high-performing engineering teams?

Host Conor Bronsdon sits down with Charity Majors, co-founder and CTO of Honeycomb (.io), and the mind behind charity.wtf. Known for her sharp insights and unfiltered opinions, Charity kicks off the discussion by expanding on her popular article: 'Generative AI is not going to build your engineering team for you.' Together, they explore how AI has altered the dynamics for engineering teams and leaders.

The discussion navigates the complex dynamics of hiring in an AI-enabled era, challenging the "senior-only" trend and championing the vital role of junior engineers in creating learning organizations. Charity also explains why writing code is often the "easy part" compared to the full lifecycle of owning and operating systems, a challenge amplified by AI-generated code.

Finally, Conor and Charity discuss the risk of "cognitive decay" from over-reliance on AI tools and why fostering deep system understanding remains paramount for engineers and leaders.

Chapters

00:00 Introduction and Guest Welcome

01:51 Generative AI and Engineering Teams

02:26 The Writing Process and Inspiration

03:49 AI's Impact on Hiring and Team Building

05:30 Embracing AI and Automation

07:43 The Role of Junior Engineers

09:33 Building Effective Engineering Teams

17:01 Future of AI in Code Generation

20:07 High Performing Engineering Teams

21:48 Evolving Expectations for Engineering Managers

22:41 Cognitive Decay

25:00 Feedback Loops in Software Systems

26:56 Hiring for Potential vs. Experience

29:17 The Future of Observability

39:50 Closing Thoughts and Advice for Engineers

Follow the hosts

Follow⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠ Atin⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠

Follow⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠ Conor⁠⁠⁠⁠⁠⁠⁠⁠⁠

Follow⁠⁠⁠⁠⁠⁠⁠⁠ Vikram⁠⁠⁠⁠⁠⁠⁠⁠

Follow⁠⁠⁠⁠⁠⁠ ⁠⁠⁠⁠Yash⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠

Follow Today's Guest(s)

Follow Charity: charity.wtf

Learn more about Honeycomb: www.honeycomb.io

Read: Generative AI is not going to build your engineering team for you

Check out Galileo

⁠⁠⁠⁠⁠⁠⁠⁠⁠Try Galileo⁠⁠

Building IBM's watsonx & The Future of Enterprise AI | Dr. Maryam Ashoori

2 april 2025 | 45 min

What is Information Symmetry and Will AI Unlock it? | DevRev’s Manoj Agarwal

26 mars 2025 | 41 min

Is There an Agent Bubble? | Spot AI’s Kelly Vaughn

19 mars 2025 | 35 min

Using AI to Modernize Your Legacy Applications | MongoDB’s Rachelle Palmer

12 mars 2025 | 44 min

Can Your AI Strategy Be Future-Proof? | Galileo’s Vikram Chatterji

5 mars 2025 | 29 min

The Making of Gemini 2.0: DeepMind's Approach to AI Development and Deployment | Logan Kilpatrick

12 februari 2025 | 41 min

DeepSeek Fallout, Export Controls & Agentic Evals

5 februari 2025 | 33 min

AI, Open Source & Developer Safety | Block’s Rizel Scarlett

29 januari 2025 | 34 min

AI in 2025: Agents & The Rise of Evaluation Driven Development

15 januari 2025 | 33 min

Now is the Time to Build | Weaviate’s Bob van Luijt

8 januari 2025 | 35 min

How AI Assistants Can Enhance Human Connection | Twilio’s Vinnie Giarrusso

18 december 2024 | 42 min

Lessons from Deploying AI at Enterprise Scale | ServiceTitan, Indeed & Twilio

11 december 2024 | 51 min

Practical Lessons for GenAI Evals | Chip Huyen & Vivienne Zhang

4 december 2024 | 48 min

As AI agents and multimodal models become more prevalent, understanding how to evaluate GenAI is no longer optional – it's essential.

Generative AI introduces new complexities in assessment compared to traditional software, and this week on Chain of Thought we’re joined by Chip Huyen (Storyteller, Tép Studio), Vivienne Zhang (Senior Product Manager, Generative AI Software, Nvidia) for a discussion on AI evaluation best practices.

Before we hear from our guests, Vikram Chatterji (CEO, Galileo) and Conor Bronsdon (Developer Awareness, Galileo) give their takes on the complexities of AI evals and how to overcome them through the use of objective criteria in evaluating open-ended tasks, the role of hallucinations in AI models, and the importance of human-in-the-loop systems.

Afterwards, Chip and Vivienne sit down with Atin Sanyal (Co-Founder & CTO, Galileo) to explore common evaluation approaches, best practices for building frameworks, and implementation lessons. They also discuss the nuances of evaluating AI coding assistants and agentic systems.

Chapters: 00:00 Challenges in Evaluating Generative AI

05:45 Evaluating AI Agents

13:08 Are Hallucinations Bad?

17:12 Human in the Loop Systems

20:49 Panel discussion begins

22:57 Challenges in Evaluating Intelligent Systems

24:37 User Feedback and Iterative Improvement

26:47 Post-Deployment Evaluations and Common Mistakes

28:52 Hallucinations in AI: Definitions and Challenges

34:17 Evaluating AI Coding Assistants

38:15 Agentic Systems: Use Cases and Evaluations

43:00 Trends in AI Models and Hardware

45:42 Future of AI in Enterprises

47:16 Conclusion and Final Thoughts

Follow: Vikram Chatterji: https://www.linkedin.com/in/vikram-chatterji/

Atin Sanyal: ⁠⁠https://www.linkedin.com/in/atinsanyal/

Conor Bronsdon: https://www.linkedin.com/in/conorbronsdon/ Chip Huyen: ⁠https://www.linkedin.com/in/chiphuyen/⁠ Vivienne Zhang: ⁠⁠https://www.linkedin.com/in/viviennejiaozhang/

Show notes: Watch all of Productionize 2.0: ⁠https://www.galileo.ai/genai-productionize-2-0⁠

The Real ROI of Enterprise AI | HP, ServiceNow & Accenture

27 november 2024 | 41 min

GenAI Predictions for 2025 | Databricks & Cohere

20 november 2024 | 40 min

Got Agents? | Weaviate, Unstructured & crewAI

13 november 2024 | 31 min

Trust, Regulation & the Road Ahead | Writer's May Habib

6 november 2024 | 49 min

Welcome to Chain of Thought

29 oktober 2024 | 1 min

Chain of Thought

Introducing Chain of Thought, the podcast for software engineers and leaders that demystifies artificial intelligence.

Om podden

Avsnitt

AI's Two Extremes – Foundations & The Frontier | Databricks’ Denny Lee

Why Enterprises Need a Different Approach to AI Agents | Lyzr’s Siva Surendira

Will AI Erase All Language Barriers? | Smartling's Olga Beregovaya

AI, Low-Code, and Shaping the Next Generation of Apps | OutSystems' Rodrigo Coutinho

AI Won't Solve Your Toughest Engineering Problems | Honeycomb’s Charity Majors

Building IBM's watsonx & The Future of Enterprise AI | Dr. Maryam Ashoori

What is Information Symmetry and Will AI Unlock it? | DevRev’s Manoj Agarwal

Is There an Agent Bubble? | Spot AI’s Kelly Vaughn

Using AI to Modernize Your Legacy Applications | MongoDB’s Rachelle Palmer

Can Your AI Strategy Be Future-Proof? | Galileo’s Vikram Chatterji

The Making of Gemini 2.0: DeepMind's Approach to AI Development and Deployment | Logan Kilpatrick

DeepSeek Fallout, Export Controls & Agentic Evals

AI, Open Source & Developer Safety | Block’s Rizel Scarlett

AI in 2025: Agents & The Rise of Evaluation Driven Development

Now is the Time to Build | Weaviate’s Bob van Luijt

How AI Assistants Can Enhance Human Connection | Twilio’s Vinnie Giarrusso

Lessons from Deploying AI at Enterprise Scale | ServiceTitan, Indeed & Twilio

Practical Lessons for GenAI Evals | Chip Huyen & Vivienne Zhang

The Real ROI of Enterprise AI | HP, ServiceNow & Accenture

GenAI Predictions for 2025 | Databricks & Cohere

Got Agents? | Weaviate, Unstructured & crewAI

Trust, Regulation & the Road Ahead | Writer's May Habib

Welcome to Chain of Thought