Sveriges mest populära poddar

The Cloudcast

Managing the Business Impact of Data Quality

34 min • 9 mars 2022

Elliot Shmukler (@eshmu, Co-Founder/CEO @anomalo_hq) and Jeremy Stanley (@jeremystan, Co-Founder/CTO) talk about how data integrity and changes can impact both technology and business decisions. 

SHOW: 598

CLOUD NEWS OF THE WEEK - http://bit.ly/cloudcast-cnotw

CHECK OUT OUR NEW PODCAST - "CLOUDCAST BASICS"

SHOW SPONSORS:

  • Polyscale.ai - PolyScale solves global database latency and performance with intelligent serverless caching and compute at the edge.
  • Check out PolyScale’s global edge data platform at www.polyscale.ai and sign up for a free account today.
  • strongDM - Secure infrastructure access for the modern stack. 
  • Manage access to any server, database, or Kubernetes instance in minutes. Fully auditable, replayable, secure, and drag-and-drop easy. Try it free for 14 days - www.strongdm.com/signup
  • CloudZero - Cloud Cost Intelligence for Engineering Teams

SHOW NOTES:

Topic 1 - Welcome to the show. Let’s start with brief introductions and backgrounds. Elliot & Jeremy please introduce yourselves.

Topic 2 - As mentioned you both met and worked together at Instacart. Let’s introduce the concept of data quality and integrity to those who may not be familiar. What patterns and problems did you see. Any possible horror stories you can share?

Topic 3 - Up until now, I’ve seen big data often more about finding the needle in the haystack. It’s in there somewhere, you just have to find it. But, what if it is the wrong needle?

Topic 4 - Are we also talking about data drift over time as the amount of data grows? As data is added to the pool, we may come to different conclusions. How do we know if the new conclusions are based on good data or bad data?

Topic 5 - You recently announced a partnership with Snowflake. This makes sense, why build another data lake and that of course may be a barrier of entry to some, correct? Are we reaching a point with big data and data lakes that we can have one single source of truth? I know we are in early days but how do most customers approach big data today and how do they keep the data sets up to date.

FEEDBACK?

Förekommer på
00:00 -00:00