MLOps Coffee Sessions #75 with Shreya Shankar, Towards Observability for ML Pipelines.
// Abstract
Achieving observability in ML pipelines is a mess right now. We are tracking thousands of means, percentiles, and KL divergences of features and outputs in a haphazard attempt to figure out when and how to retrain models.
In this session, we break down current unsuccessful approaches and discuss the path towards effectively maintaining ML models in production. Along the way, we introduce mltrace -- a preliminary open source project striving towards "bolt-on" observability in ML pipelines.
// Bio
Shreya Shankar is a computer scientist living in the Bay Area. She's interested in building systems to operationalize machine learning workflows. Shreya's research focus is on end-to-end observability for ML systems, particularly in the context of heterogeneous stacks of tools.
Currently, Shreya is doing her Ph.D. in the RISE lab at UC Berkeley. Previously, she was the first ML engineer at Viaduct, did research at Google Brain, and completed her BS and MS in computer science at Stanford University.
// Related Links
Shreya Shankar's blogposts: https://www.shreya-shankar.com/
Shreya Shankar's Podcasts: https://www.listennotes.com/top-episodes/shreya-shankar/
The deployment phase of machine learning by Benedict Evans: https://www.ben-evans.com/benedictevans/2019/10/4/machine-learning-deployment
Shreya Shrankar's mltrace blogpost: https://www.shreya-shankar.com/introducing-mltrace/
--------------- ✌️Connect With Us ✌️ -------------
Join our slack community: https://go.mlops.community/slack
Follow us on Twitter: @mlopscommunity
Sign up for the next meetup: https://go.mlops.community/register
Catch all episodes, blogs, newsletters, and more: https://mlops.community/
Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/
Connect with Vishnu on LinkedIn: https://www.linkedin.com/in/vrachakonda/
Connect with Shreya on LinkedIn: https://www.linkedin.com/in/shrshnk
Timestamps:
[00:00] Introduction to Shreya Shankar
[01:12] Shreya's background
[03:22] Contrast in scale influence
[05:28] Embedding ML and building machine learning infused products
[07:26] Management structure and professional incentive
[08:25] Organizational side of MLOps retros
[10:15] Tooling implementations
[12:00] Structured rational investment hardships
[13:17] Working at a start-up
[14:02] Academic work and entrepreneurial ambitions
[16:00] ML Monitoring Observability interest
[17:14] Where to get started
[20:47] Realization while at Viaduct
[23:30] Preventing alert fatigue
[27:04] Tooling bridging the gap
[30:40] Juncture at overall MLOps ecosystem
[33:58] The deployment phase of machine learning - it's the new SQL by Benedict Evans
[35:30] Model monitoring
[36:16] mltrace
[38:28] Introducing mltrace blog post series
[41:25] Tips to our content creators/writers
[43:47] Monitoring through the lens of the database
[47:37] Advice about picking up ML engineering and ML systems development in 2022
[49:36] Database low down the stack
[50:51] Most excited about 2022
[52:13] What MLOps space/ecosystem should change?
[53:21] Funding has changed the incentives around innovation
[54:52] Competition in million-dollar rounds
[55:25] Starting a company
[56:30] Wrap up