Elliot Shmukler (@eshmu, Co-Founder/CEO @anomalo_hq) and Jeremy Stanley (@jeremystan, Co-Founder/CTO) talk about how data integrity and changes can impact both technology and business decisions.
SHOW: 598
CLOUD NEWS OF THE WEEK - http://bit.ly/cloudcast-cnotw
CHECK OUT OUR NEW PODCAST - "CLOUDCAST BASICS"
SHOW SPONSORS:
SHOW NOTES:
Topic 1 - Welcome to the show. Let’s start with brief introductions and backgrounds. Elliot & Jeremy please introduce yourselves.
Topic 2 - As mentioned you both met and worked together at Instacart. Let’s introduce the concept of data quality and integrity to those who may not be familiar. What patterns and problems did you see. Any possible horror stories you can share?
Topic 3 - Up until now, I’ve seen big data often more about finding the needle in the haystack. It’s in there somewhere, you just have to find it. But, what if it is the wrong needle?
Topic 4 - Are we also talking about data drift over time as the amount of data grows? As data is added to the pool, we may come to different conclusions. How do we know if the new conclusions are based on good data or bad data?
Topic 5 - You recently announced a partnership with Snowflake. This makes sense, why build another data lake and that of course may be a barrier of entry to some, correct? Are we reaching a point with big data and data lakes that we can have one single source of truth? I know we are in early days but how do most customers approach big data today and how do they keep the data sets up to date.
FEEDBACK?