100 avsnitt • Längd: 50 min • Månadsvis
Mark Rittman is joined each episode by a special guest from the world of business intelligence, analytics and big data.
The podcast Drill to Detail is created by Rittman Analytics. The podcast and the artwork on this page are embedded on this page using the public podcast feed (RSS).
Join Mark Rittman in this special end-of-year episode as he speaks with Noel Gomez, co-founder of DataCoves about the challenges and opportunities of orchestrating dbt and other tools within the open-source Modern Data Stack, navigating the evolving semantic layer landscape and the future of modular, vendor-agnostic data solutions.
Build vs Buy Analytics Platform: Hosting Open-Source Tools
Mark is joined in this episode by Johan Baltzar, previously Product Analytics Manager at Spotify and now co-founder and CEO at Steep to talk about the role analytics played in Spotify’s growth story, the startup scene in Stockholm, Sweden and Steep’s metrics-first approach to user-centric business analytics.
The Kry founders factory: Meet 15 employees-turned-founders
Steep homepage
Mark is joined in this latest Drill to Detail episode by Tobias (Toby) Mao, CTO and co-Founder at Tobiko Data to talk about SQLGlot, the innovation culture at Airbnb and the data engineering challenges solved by SQLMesh.
Mark is joined in this episode by Christian Steinert to talk about his solo journey building Steinert Analytics, a data analytics consultancy transforming data into actionable revenue and cost-saving insights for Central Ohio's SaaS startups.
Steinert Analytics homepage
Christian Steinert LinkedIn Profile
“Helping a Roofing Company Define Their Sales Process”
“Transforming a Top 5 Fast Food Company's Operational Drive-Thru Reporting”
Mark Rittman is joined in this episode by Ethan Aaron, Founder & CEO of Portable, to talk about what you should do if you’re the first data hire at a company, when and when not to hire a consultant, the founding story of Portable.io and the long tail (and economic model) of the data integration connectors market.
Ethan Aaron LinkedIn Profile
Portable.io Homepage
“The biggest misconceptions about data integrations..” (LinkedIn)
“You join a 100 person company as the head of data. What should you do?” (LinkedIn)
“The stuff no one will tell you about running a data team” (LinkedIn)
Mark Rittman is joined by returning guest David Jayatillake, VP of AI at Cube.dev, to talk about Delphi Labs’ journey from a standalone data analytics chatbot to now becoming the basis of Cube’s new AI features within its composable semantic model product.
Introducing the AI API and Chart Prototyping in Cube Cloud
Mark Rittman is joined in this final episode of the current series of Drill to Detail by returning guest Jordan Tigani, Co-Founder & CEO at Motherduck to talk about the journey from big data to small data and bringing hybrid cloud execution to DuckDB.
Announcing Motherduck: Hybrid Execution Scales Duckdb From Your Laptop Into The Cloud
Mark Rittman is joined in this special episode by previous Looker founder and inventor of the LookML language Lloyd Tabb together with Carlin Eng, Product Manager at Google Cloud to talk about Malloy, a new query language to replace SQL for analytics.
Drill to Detail Ep.60 'A Deeper Look Into Looker' With Special Guest Lloyd Tabb
Malloy : An experimental language for data
A Sequel to SQL? An introduction to Malloy
Why SQL syntax sucks, and why it matters
Dimensional Flexibility, one of the things that makes Malloy Special
Mark Rittman is joined in this episode by returning guest and Elementl Founder Nick Shrock to talk about Dagster's role in the modern data stack ecosystem and software defined assets, a new, declarative approach to managing data and orchestrating its maintenance.
Introducing Software-Defined Assets
Rethinking Orchestration as Reconciliation: Software-Defined Assets in Dagster
Optimizing Data Materialization Using Dagster’s Policies
How I use Dagster to orchestrate the production of social science data assets
Mark Rittman is joined in this special episode of Drill to Detail by Benn Stancil, CTO and Founder at Mode to talk about their recent acquisition by ThoughtSpot, the economics of the modern data stack and how AI and LLMs could likely become our industry’s next discontinuity.
We don’t need another SQL chatbot
ThoughtSpot acquires Mode Analytics, a BI platform, for $200M in cash and stock
ChatGPT-Powered Data Analysis using Cube, Delphi and the Code Interpreter Plugin
Mark Rittman is joined in this episode by Artyom Keydunov, Founder at Cube to talk about embedded analytics and Cube's origin story, headless BI, query acceleration and how Cube and Delphi enable an AI-powered conversational interface for the semantic layer.
Empower uniquely insightful AI & LLM data experiences
AI-powered conversational interface for the semantic layer
Define metrics upstream to align your team
How Rittman Analytics Delivers the Semantic Layer Today with Cube
Hightouch co-Founder and co-CEO Tejas Manohar returns as special guest to talk with Mark Rittman about the reverse ETL market today, the evolution of the composable customer data platform and new featured in Hightouch to enrich customer profiles and drive personalization across marketing campaigns.
Reverse ETL is Dead (Ethan Aaron LinkedIn Post)
Customer 360 Data Warehousing and Sync to Hubspot
Mark is joined by Shinji Kim, Founder and CEO of Select Star to talk about their mission to re-invent data catalogs, data discovery and data lineage for the modern data stack
Data Discovery vs. Data Observability: Understanding the Differences for Better DataOps
Select Star and dbt Labs Partner for Better Data Discovery on dbt
Mark Rittman is joined by Jason Pohl, Senior Director of Data Management at Databricks along with special co-host and Head of Customer Experience at Coalesce, Stewart Bryson to talk about data lakehouses, Databricks and helping to raise tigers in a Thai Tiger Temple.
The Databricks Lakehouse Platform
Mark Rittman is joined by Twilio Segment Head of Product Kevin Niparko to talk about trends in the customer data platform market, Reverse ETL and Profiles Sync, the impact of LLMs (Large Language Models) on digital customer experience and Segment Unify, a consumer scale real-time identity resolution solution that provides complete, real-time, portable customer profiles.
Segment Unify is here: complete, real-time, portable customer profiles
Activate warehouse data in all of your destination tools
Customer profiles made portable
Drill to Detail Ep. 86 'Reverse ETL, Hightouch and CDW as CDP' with Special Guest Tejas Manohar
Mark Rittman is joined by David Jayatillake, CEO and co-founder of Delphi Labs, to talk about the role of semantic models and the marketplace today, large language models and the phenomena that is ChatGPT, and how Delphi Labs are planning on bringing AI to Slack and the modern data stack.
https://www.delphihq.com/#about
https://www.linkedin.com/in/david-jayatillake/details/experience/
For B2B Generative AI Apps, Is Less More?
LinkedIn post on rationale behind Delphi Labs product development
Semantic Superiority - Part 1 and Part 2
Semantic Search product feature
LLM Implications on Analytics (and Analysts!)
“Is This You?” Entity Matching in the Modern Data Stack with Large Language models and github repo
ChatGPT, Large Language Models and the Future of dbt and Analytics Consulting
Joining Mark Rittman for this 101st Episode Special for the Drill to Detail Podcast is Tristan Handy, CEO and Founder of dbt Labs talking about what went right at RJ Metrics, how the Analyst Collective led to today’s community around the open-source dbt project and his personal journey from being in the lab building Fishtown Analytics to CEO of today’s hottest data analytics startup … and why he secretly wishes he was Mark (according to Mark).
Ep.100 Special ‘Past, Present and Future of the Modern Data Stack’ with Special Guests Keenan Rice, Stewart Bryson and Jake Stein (Drill to Detail Podcast)
My $2.6 Billion Ecosystem Fail: an RJMetrics Post Mortem (Bob Moore)
How Best-in-Class eCommerce Businesses Achieve 230% Growth (2x eCommerce)
Introducing the RA Warehouse dbt Framework : How Rittman Analytics Does Data Centralization using dbt, Google BigQuery, Stitch and Looker (Rittman Analytics Blog)
Goodbye RJMetrics, Hello Fishtown Analytics (Tristan Handy)
Ep.33 'Building Out Analytics Functions in Startups' With Special Guest Tristan Handy (Drill to Detail Podcast)
Analytics is a Trade (Tristan Handy)
Analyst Collective website (via the Internet Archive)
Building a Mature Analytics Workflow: The Analyst Collective Viewpoint (SlideShare)
Fishtown Analytics : Frequently-Asked Questions (via the Internet Archive)
Ep 41: dbt Labs + Transform join forces on metrics w/ Nick Handel + Drew Banin (Analytics Engineering Podcast via Spotify)
Joining Mark Rittman for this 100th Episode Special are Keenan Rice (previously Founding Team, GTM Executive at Looker, now GM at Firebolt), Stewart Bryson (previously CEO at Red Pill Analytics, now Head of Customer Experience at Coalesce) and joining us mid-way through the show as our even more special mystery guest, Jake Stein (previously Co-Founder and COO at RJ Metrics, CEO and Co-Founder at Stitch and now CEO and Co-Founder at Common Paper.
Drill to Detail Past Episodes discussed on the show were:
Drill to Detail Ep.23 ‘Looker, BigQuery and Analytics on Big Data’ With Special Guest Daniel Mintz
Drill to Detail Ep.71 'The Rise of Snowflake Data Warehouse' With Special Guest Kent Graziano
Drill to Detail Ep.99 “Is the Modern Data Stack Dead?” with Special Guest Chris Tabb
Other show notes below:
Gartner Makes it Official: The Age of Self-Service Is Upon Us
RJMetrics Acquired by Magento Commerce, Pipeline is now Stitch
The Startup Founders’ Guide to Analytics - Tristan Handy
Analyzing Drill to Detail Podcast Stats using Hex
Google Completes Looker Acquisition
The Drill to Detail Podcast returns for a new series, with Mark Rittman joined by Special Guest Chris Tabb, Co-Founder & CCO at LEIT DATA, to discuss dimensional models and data meshes, when to bring in a consultant and to answer the question on everyone’s lips right now … “is the modern data stack dead”?
Mark Rittman is joined by Izzy Miller to talk about Hex, a collaborative data platform that brings everyone together to explore, analyze, and share … and why the future of notebooks isn’t just about notebooks.
Public app gallery of example Hex projects .
We’re joined on this episode of Drill to Detail by fellow Brightonian Daniel Perry-Reed from Measurelab, talking with Mark about the new Google Analytics 4 release and its role within the future Marketing Technology stack.
From the wild west of data collection to conversion modelling
The sun is setting on Universal Analytics
MeasureFest April 2022: The role of GA in the future MarTech stack
Mark Rittman is joined in this very special episode by Colin Zima, ex-Head of Analytics at Looker and now co-founder of Omni, a new analytics startup looking to become the fastest way to ask the first 100 questions.
Introducing Omni: The fastest way to turn SQL into intelligence
Mark Rittman is joined in this episode by Peter Fishman to talk about Mozart Data, the modern data stack and the need to automate the role of the analytics engineer.
MozartData website
Data Bites: A Virtual Lunch and Learn with Powered by Fivetran, Snowflake and Mozart Data
Launch HN: Mozart Data (YC S20) – One-stop shop for a modern data pipeline
Mark Rittman is joined by Eric Dodds to talk about Rudderstack's founding story and open-source roots, Segment compatibility and event-based pricing vs. MTUs and Rudderstack's "Warehouse-First" approach to building customer data platforms.
Rudderstack's Data Stack : Deep Dive
Introducing RudderStack Cloud: The Warehouse-First CDP for Developers
Rudderstack, Snowplow and Open-Source CDP Alternatives to Segment
Why (and How) Customer Data Warehouses are the New Customer Data Platform
In this first episode of a new series, Mark Rittman is joined by Katie Hindson to talk about Lightdash, a new open-source alternative to Looker that uses dbt as its metrics layer and semantic model.
We're joined on this episode of Drill to Detail by Dylan Baker and Josh Temple to talk about developing multi-layer modern data stack projects, Spectacles’ recently released integration with dbt and what's now possible with their new public API
Extending Spectacles with API-Triggered Runs
Synchronizing Looker and dbt™ with Spectacles
Adding Looker Regression Tests to the dbtCloud CI/CD Test Pipeline using Spectacles
Mark Rittman is joined in this episode by Nick Schrock, ex-Facebook Engineer and now Founder of Elementl to talk about GraphQL, data platform engineers and @dagsterio, a data orchestrator for machine learning, analytics, and ETL.
Orchestrating dbt with Dagster
Moving past Airflow: Why Dagster is the next-generation data orchestrator
Mark Rittman is joined in this episode by Michale Drogalis, Product Manager at Confluent to talk about the history of Apache Kafka and real-time stream processing, the origins of kSQL and kSQLdb and the rationale and use-cases addressed by these products.
Mark Rittman is joined by Eldad Farkash, Co-Founder and CEO at Firebolt to talk about SiSense and Panorama, the history of cloud data warehouses and how Firebolt's technology delivers query results 4-6000x faster than Snowflake, Redshift and AWS Athena.
Snowflake’s Co-Founder Marcin Żukowski Reflects On His Time At CWI
Centrum Wiskunde & Informatica & MonetDB
Eldad Farkash, inventor of In-Chip technology, takes top honors with IT Software (Individual) award
Maxime Beauchemin returns to the Drill to Detail Podcast and joins Mark Rittman to talk about what's new with Apache Airflow 2.0, the origin story for Apache Superset and now Preset.io, why the future of business intelligence is open source and news on Marquez, a reference implementation of the OpenLineage open source metadata service for the collection, aggregation, and visualization of a data ecosystem’s metadata sponsored by WeWork.
Apache Superset is a modern data exploration and visualization platform
The Future of Business Intelligence is Open Source
Powerful, easy to use data exploration and visualization platform, powered by Apache Superset™
Admunsen: Open source data discovery and metadata engine
Marquez: Collect, aggregate, and visualize a data ecosystem's metadata
Mark Rittman is joined in this episode by Jeff Pollock, Vice President Product Development for Fast Data at Oracle, to talk about the history of Oracle Data Integration from Warehouse Builder and Sunopsis to the move to the cloud and today’s distributed “data mesh”, based around Oracle’s fast data product, Oracle GoldenGate.
Technology Brief : Dynamic Data Fabric and Trusted Data Mesh using the Oracle GoldenGate Platform
Data Mesh Part 1: Future of Data Integration with a Deep Dive into GoldenGate, Kafka and Spark
How to Move Beyond a Monolithic Data Lake to a Distributed Data Mesh
An Introduction to Real-Time Data Integration
Drill to Detail Ep.22 'SnapLogic's Enterprise Integration Cloud' With Special Guest Craig Stewart
Mark Rittman is joined by Tejas Manohar from Hightouch to talk about the concept of "reverse ETL", his journey from working on Segment's Personas product to co-founding Hightouch and his recent guest post on the Fivetran Blog, "Why Your Customer Data Platform Should Be the Data Warehouse".
Why Your Customer Data Platform Should Be the Data Warehouse
Hightouch Ushers In The Era Of Operational Analytics
Customer 360 Data Warehousing and Sync to Hubspot using BigQuery, dbt, Looker and Hightouch
Why (and How) Customer Data Warehouses are the New Customer Data Platform
The Drill to Detail Podcast returns for a new series with Mark joined by special guest Barr Moses to talk about data-driven customer success, data quality and the story behind her new data reliability startup Monte Carlo.
Monte Carlo - Data reliability delivered
Delivering End-to-End Data Observability with Looker and Monte Carlo
Data Observability: How to Build Your Own Anomaly Detectors Using SQL
The Drill to Detail Podcast returns for an end-of-year and Coalesce 2020 special episode, with Mark Rittman joined by Tristan Handy from Fishtown Analytics along with Stewart Bryson from Red Pill Analytics.
Snowflake Stock Has More Than Tripled Since Its IPO. CEO Frank Slootman Explains What’s Next.
Mark Rittman is joined in this episode by Davis Clark, Founder at Futuremodel, to talk about democratizing Looker development by automating and abstracting the process of LookML development.
Futuremodel Homepage
Futuremodel Early Access Beta Program
Mark Rittman is joined in this episode by Josh Temple, Analytics Engineer at Spotify to talk about analytics development, automated testing and Spectacles, an open-source tool and SaaS service that automatically tests your LookML to ensure Looker always runs smoothly for your users.
Spectacles product homepage and early access program
Automated testing in the modern data warehouse
#JOIN19 - Using Customized Open-Source Tools With LookML
Continuous Integration and Automated Build Testing with dbtCloud
Mark Rittman is joined in this episode by GitLab.com Lead Developer Douwe Maan to discuss the history of the open-source Meltano project and its recent refocus on becoming the "glue" that creates open-source data pipelines, using plugin technologies such as Singer Taps and dbt transformations.
Meltano homepage and quickstart demo
Hey, data teams - We're working on a tool just for you
Revisiting the Meltano strategy: a return to our roots
Why we are building an open source platform for ELT pipelines
Singer - Simple, Composable, Open Source ETL
Mark Rittman is joined in this episode by Hubspot's Director of Data Infrastructure James Densmore to talk about distributed and remote-friendly data teams, DevOps with dbt and Apache Airflow and career path options for data engineers.
How should I structure my data team? A look inside HubSpot, Away, M.M. LaFleur, and more (dbt Blog)
Software Engineers do not Need to Become Managers to Thrive (Data Liftoff Blog)
The Misunderstood Data Engineer (Data Liftoff Blog)
Modular ELT (Data Liftoff Blog)
Test SQL Pipelines against Production Clones using DBT and Snowflake (Dan Gooden)
HubSpot Data Actions, Harvest Analytical Workflows and Looker Data Platform (Rittman Analytics Blog)
Mark Rittman is joined by special guests Drew Banin, co-founder of Fishtown Analytics and maintainer of dbt (data build tool) and Stewart Bryson, long-time friend of the show and CEO/Co-founder of Red Pill Analytics to talk about scaling modern data stack projects from startups to the enterprise; how do you deal with data quality issues when there’s no central record of customers, how do we introduce data governance, enterprise requirements and meet the needs of enterprise architects and how do we scale concepts such as agile and analytics as engineering beyond our initial champion and data team?
Fishtown Analytics and Drew Banin
Multi-Channel Marketing Attribution using Segment, Google BigQuery, dbt and Looker
In this special episode of Drill to Detail Mark Rittman is joined by Seth Rosen, Co-Founder and Principal at Hashpath to talk about the impact of COVID-19 and the subsequent economic shutdown on the business of analytics and product startups, the work being done in the community to help find a cure and the role that data and analytics can have in accelerating the post-shutdown recovery.
“A COVID-19 Dashboard for Massachusetts using Looker and BigQuery” - Hashpath.com blog
“The Virus Survival Strategy for your Startup"
“COVID-19: Implications for business” - McKinsey
“As COVID-19 data sets become more accessible, novel coronavirus pandemic may be most visualized ever” - ZDNet.com
“Leave COVID-19 analysis to the experts” - Japan Times
We return with a new episode featuring Keboola CEO Pavel Dolezal on scaling analytics adoption beyond technical developers, the original story behind Keboola and how their team and friends won the recent Looker Join 2019 Hackathon event in San Francisco.
Keboola homepage
“DataOps is NOT Just DevOps for Data” - Data Kitchen Blog -
Mark Rittman is joined in the final Drill to Detail Podcast episode of the year by Calvin French-Owen, co-founder and CTO at Segment to talk about Segment's founding story, partner ecosystems and their new customer data platform service, Segment Personas.
analytics.js: The hassle-free way to integrate analytics into any web application (Github.com)
The Million Dollar Engineering Problem (Segment Blog)
Tracking Customer Email Marketing Interactions using Segment Personas and Connections (Rittman Analytics Blog)
Segment Personas product homepage
Mark Rittman is joined in this specially-extended episode of Drill to Detail by Olivier Dupuis, founder of Lantrn Analytics to share his experiences growing a product analytics business using today’s modern, modular SaaS analytics tools
"How it Works" - Lantrns Analytics
"How Rittman Analytics does Analytics: Modern BI Stack Operational Analytics using Looker, Stitch, dbt and Google BigQuery" - Rittman Analytics Blog
Mark Rittman is joined in this episode by Bud Endress, Director, Product Management at Oracle to talk about the evolution of Oracle's query acceleration and in-database OLAP features from the acquisition Express Server back in the 90's to today's Autonomous Data Warehouse and Analytic Views.
The Drill to Detail Podcast returns for a new season with Mark Rittman joined in this episode by Colin Zima, VP of Product and Chief Analytics Officer at Looker to talk about luck, thinking differently and the deliberate design choices behind Looker Data Platform.
JOIN 2018 - Colin Zima Product Keynote - Part 1
GrowthHackers.com AMA with Colin Zima, Chief Analytics Officer & VP of Product at Looker
HotelTonight - Story of a Modern Data-Driven Business
Drill to Detail Ep.60 'A Deeper Look Into Looker' With Special Guest Lloyd Tabb
In this final Drill to Detail Episode before we take a break for the summer, Mark Rittman is joined by Bhav Patel, founder of the London Conversion Rate, Optimization and Product Analytics Meetup to talk about Conversion Rate Optimization, Experimentation and A/B Testing, Customer vs. Product Analytics, Attribution and Personalization ... and the story behind the CRAP Meetups.
CRAP Talks: CRO, Analytics and Product London
Analytics, BigQuery, Looker and How I Became an Internet Meme for 48 Hours (CRAP Talks presentation)
McDonald’s is acquiring Dynamic Yield to create a more customized drive-thru (TechCrunch)
Mark Rittman is joined in this episode of Drill to Detail by returning guest Kent Graziano, Chief Technical Evangelist for Snowflake, to talk about Snowflake Data Warehouse's cloud-first architecture and recent product announcements at Snowflake Summit 2019.
Try Snowflake Data Warehouse for free
Snowflake Materialized Views: A Fast, Zero-Maintenance, Accurate Solution
Modern Data Sharing: The Opportunities Are Endless
Drill to Detail Ep. 5 'SnowflakeDB, and is Data Modeling Dead?' with Special Guest Kent Graziano
Mark Rittman is joined by Bruno Aziza, Group Vice President, AI, Data Analytics & Cloud at Oracle to talk about the recently-updated product roadmap for Oracle Analytics, Oracle BI in the marketplace, recent acquisitions in the analytics marketplace and the recent Oracle Analytics Summit at Skywalker Ranch, California.
Oracle Analytics for Applications: Oracle Analytics Summit Product Tour
Oracle Analytics: Honing 18+ products down to a single brand
Mark Rittman, Founder and CEO of Rittman Analytics is joined in this special episode of Drill to Detail by returning guests Tristan Handy, Founder and CEO of Fishtown Analytics and Stewart Bryson, Founder and CEO of Red Pill Analytics to talk about the recent acquisitions of Looker by Google Cloud Platform and Tableau by Salesforce, the wider story about consolidation in the BI industry and whether the trend in analytics tools is towards enterprise features … or open source?
Five Thoughts on the Looker Acquisition by GCPCloud - Mark Rittman, Twitter Thread
A Wave of Acquisitions in Business Intelligence - Tristan Handy, Fishtown Analytics Blog
Google to acquire Looker - Thomas Kurian Blog Post
Google Cloud and Looker (PDF)
Siebel Analytics - The Jewel in the Project Fusion Crown? - Mark Rittman’s Oracle Weblog via Internet Archive
Salesforce Signs Definitive Agreement to Acquire Tableau
Salesforce.com's Tableau Acquisition: Admitting Organic Innovation Failure?
In this special edition of the Drill to Detail Podcast hosted by Stewart Bryson, CEO and Co-Founder of Red Pill Analytics, he is joined by Robin Moffatt and Ricardo Ferreira, Developer Advocates at Confluent, to talk about Apache Kafka and Confluent, event-first thinking and streaming real-time analytics.
Confluent Download: https://www.confluent.io/download/
Demo: https://github.com/confluentinc/cp-demo/
Slack group: http://cnfl.io/slack
Mailing list: https://groups.google.com/forum/#!forum/confluent-platform
From Zero to Hero with Kafka Connect: http://rmoff.dev/ksldn19l-kafka-connect-slides
No More Silos: Integrating Databases and Apache Kafka: http://rmoff.dev/ksny19-no-more-silos
The Changing Face of ETL: Event-Driven Architectures for Data Engineers: http://rmoff.dev/changing-face-of-etl
Mark Rittman is joined by Dylan Baker, freelance analytics consultant, to talk about thinking probabalistically, analytics within venture-funded startups, devops and its role in scaling-out the modern BI stack.
Mark Rittman is joined by Matthew Halliday to talk about the challenge of ETL and analytics on complex relational OLTP data models, previous attempts to solve these problems with products such as Oracle Essbase and Oracle E-Business Suite Extensions for Oracle Endeca and how those experiences, and others, led to his current role as co-founder and VP of Products at Incorta.
The Death of the Star Schema: 3 Key Innovations Driving the Rapid Demise
Accelerating Analytics with Direct Data Mapping
Accelerating Operational Reporting & Analytics for Oracle E-Business Suite (EBS)
The Good, the Bad, and the Ugly of Extract Transform Load (ETL)
E-Business Suite Extensions for Endeca: Technical Considerations
The Pain of Operational Reporting Solutions for Oracle E-Business Suite (EBS)
Mark Rittman is joined by CEO and Founder of Supermetrics, Mikael Thuneberg, to tell the story of how a mention in the official Google Analytics Blog and a prize of a t-shirt led to him founding and bootstrapping a €2M ARR marketing analytics business that’s probably the most important software vendor the Drill to Detail audience has never heard of, and who recently moved into the data pipelines-as-a-service market in collaboration with Google Cloud Platform and the Google BigQuery team.
Announcing Supermetrics for BigQuery: Get a marketing data warehouse up and running in minutes (Supermetrics blog)
Supermetrics, Google BigQuery and Data Pipelines for Digital Marketers (Rittman Analytics blog)
Mark Rittman is joined in this episode by Jordan Tigani, Director of Product Management at Google for Google BigQuery, to talk about the history of BigQuery and its technology origins in Dremel; how BigQuery has evolved since its original release to handle a wide range of data warehouse workloads; and new features announced for BigQuery at Google Next’19 including BI Engine, Storage API, Connected Sheets and a range of new data connectors from Supermetrics and Fivetran.
Modern Data Warehousing with BigQuery (Cloud Next '19)
Modern data warehousing with BigQuery: a Q&A with Engineering Director Jordan Tigani
Introduction to BigQuery BI Engine
Supermetrics and Fivetran BigQuery connectors
Drill to Detail Ep.2. 'Future of SQL on Hadoop', With Special Guest Dan McClary
Mark Rittman is joined by returning Special Guest Jake Stein, former co-founder and CEO of Stitch and now SVP of Stitch at Talend to talk about the evolution of the data pipeline-as-a-service, data catalogs and data governance and the vision behind Talend’s acquisition of Stitch.
”The Vision behind Talend's acquisition of Stitch”
“dbt: Analytics Engineering that Works”
Talend homepage
Mark Rittman is joined in this episode by Mike Ferguson, long-term analyst, consultant and Managing Director of Intelligent Business Strategies to talk about data warehouse modernization, analytics and big data project success within enterprise customers and the re-emergence of interest in data governance and master data management within the industry.
Mark Rittman is joined by DataRobot’s Jordan Meyer to discuss Kaggle, machine learning, deep neural networks and his team’s strategy to win the $1m ZIllow Prize, beating over 1000 other teams to come up with the most accurate home value prediction.
Zillow Prize: Zillow's Home Value Prediction (Zestimate) | Kaggle
And the Zillow Prize Goes to…Team ChaNJestimate!
Meet the ‘Zillow Prize’ winners who get $1M and bragging rights for beating the Zestimate
https://github.com/jordanmeyer
Data Scientist Spotlight: Jordan Meyer
The Drill to Detail Podcast returns for a new series with our host, Mark Rittman, joined by Lloyd Tabb, Founder CTO and Chairman of Looker to talk about the foundational story of Looker and LookML, query latency and semantic models, analytic engines and code IDEs, analytics developer workflows and the rise of cloud elastically-scalable databases, packaged applications and embedded analytics and why learning (and loving) are the long-term keys to analytics and business success.
“7 Reasons Looker Built a New Language for Data”
“How do you decide what to model in dbt vs LookML?” - Tristan Handy
“Looker co-founder finds freedom and inspiration on his bike rides” - BizJournals.com
“A simple explanation of Symmetric Aggregates or: Why On Earth Does My SQL Look Like That?” - Daniel Mintz
Mark Rittman is joined in this Looker JOIN 2018 Special by long-term friends of the show Tristan Handy from Fishtown Analytics, and Stewart Bryson from Red Pill Analytics to talk about dbt and enabling data engineering for data analysts; the state of modern data analytics consulting today, and what we’re looking forward to hearing about at next week’s Looker JOIN 2018 conference in San Francisco, CA.
Mark Rittman is joined in this episode by Jonathan Palmer from King Games to talk about the role of analytics in the development of Candy Crush Saga and other King games, their use of Looker along with Google BigQuery and Exasol to provide analytics capabilities to their game designers and product owners and his approach to doing all of this in a fast-moving, technology-driven internet business.
- Candy Crush Saga article on Wikipedia
- King Games company website
- “How King Games is Crushing Games Data” DataIQ article
- Looker and King Games case study
- Jonathan Palmer on LinkedIn
Mark Rittman is joined by Neil Barton, Chief Technology Officer at WhereScape to talk about metadata-driven data warehouse design, automating the build and management of data warehouse infrastructure and the thinking behind his company's WhereScape Red and Wherescape 3D tools.
In this specially-extended episode just before ODTUG KScope'18, Mark Rittman is joined by Matt Yorke from Qubix to talk about Oracle Essbase Cloud, Oracle Analytics Cloud and the business of Oracle Cloud analytics consulting
Oracle Analytics Partner Forum 2018
Mark Rittman is joined by Yali Sassoon from Snowplow to talk about data pipelines and Hadoop in the cloud; how web analytics evolved from counting pageviews to today's event-level analysis of consumer behavoir across all digital channels; why digital analytics is hard but interesting; and Snowplow's approach to building a successful hybrid open-source/commercial software business that competes successfully with megavendors such as Google and Adobe.
Snowplow Insights commercial hosted service details
Evolving Your Pipeline - Yali Sassoon - Snowplow Berlin Meetup #3
Mark Rittman is joined in this episode by Greg Michaelson from DataRobot, talking about the benefits of automating the discovery and automation of analytics and machine learning in financial services and other industries.
DataRobot: Automated Machine Learning for Predictive Modeling
DataRobot for Business Analysts
Automated Machine Learning Drives Intelligent Business (Jen Underwood, Information Week article)
Marketing Attribution, Artificial Intelligence, and Game Theory
Greg Michaelson LinkedIn Profile
Mark Rittman is joined by ThoughtSpot's Chief Data Evangelist Doug Bordonaro to talk about the value of data, issues around trust and consent raised by the EU's new GDPR regulations, and how ThoughtSpot are applying ideas from search engines combined with artificial intelligence smarts to surface insights and drive real value for business users from their analytics investment.
Value Becomes the 5th “V” in Big Data Factors
Mark Grover LinkedIn Profile and Github Profile
"Hadoop Application Architectures"
"Drill to Detail Ep. 7 'Apache Spark and Hadoop Application Architectures'
"Software Engineer to Product Manager" blog by Gwen Shapira
"Introduction to the Oracle Data Integrator Topology" from the Oracle Data Integrator docs site
Apache Airflow and Amazon Kinesis homepages
"Experimentation in a Ridesharing Marketplace" by Nicholas Chamandy, Head of Data Science at Lyft
"How Uber Eats Works with Restaurants"
"Deliveroo has built a bunch of tiny kitchens to feed more hungry Londoners" - Wired.co.uk
Mark Rittman is joined by Special Guest Fangjin Yang to talk about the history of Druid, a high-performance, column-oriented, distributed data store originally developed by the team at Metamarkets to provide fast ad-hoc access to large amounts of event-level marketing data, and his work at Imply to commercialise Druid and build a suite of supporting query and data management tools.
Druid project homepage
Druid - A Real-Time Analytical Data Store (pdf)
Druid - Learning about the Druid Architecture
Imply.io homepage
Druid, Imply and Looker 5 bring OLAP Analysis to BigQuery’s Data Warehouse
Mark Rittman is joined in this 50th Episode Special by our original guest on the first episode of Drill to Detail, Stewart Bryson, to talk about developing agile BI applications using FiveTran, SnowflakeDB and Looker and his recent work developing a BI solution for Google Play Marketing using Google Data Studio and Google Cloud Platform. We're also joined later in the show by Alex Gorbachev from Pythian, our mystery guest who Stewart then interviews flawlessly armed only with a set of questions given to him as the guest was unveiled ... though be sure to listen past the final closing music for the bonus out-takes.
#115 Google Play Marketing with Dom Elliott and Stewart Bryson
The Next-Generation Jump Program
Mark Rittman is joined by Will Davis from Trifacta to talk about the public beta of Google Cloud Dataprep, Trifacta's data wrangling platform and topics including metadata management, data quality and data management for big data and cloud data sources.
Google Cloud Dataprep on Google Cloud Platform
"Google Cloud Dataprep: Spreadsheet-Style Data Wrangling Powered by Google Cloud Dataflow"
"A New Cloud-Based Data Prep Solution from Google & Trifacta"
Trifacta website
"A Breakthrough Approach to Exploring and Preparing Data"
Trifacta platform architecture
- Oracle Designer page on Oracle.com
- Bitmap Index page on Wikipedia
- Mondrian project page on Github
- Mondrian OLAP Server page on Wikipedia
- MultiDimensional eXpressions (MDX) page on Wikipedia
- Apache Calcite project homepage
- Apache Calcite Introduction and Overview deck
- Streaming SQL presentation at Apex Big Data World 2017, Mountain View, California
Mark is joined by long-term industry veteran and friend Christian Berg to talk about surviving fifteen years as a contractor in analytics industry, changes he's seen in the market and in how project are approached, the value in getting involved in the community, and in a specially extended Christmas and New Year edition we look back at what was topical in 2017 and what are Christian's predictions for 2018 ... and appoint Christian as Head of our Best Practices Found on the Internet.
Mark Rittman is joined in this episode of Drill to Detail by Dr. Carsten Bange from BARC to talk about findings from the recently completed BI Survey 17 including the continuing move to modern BI platforms and self-service desktop tools, analytics adoption trends and the increasing incorporation of BI functionality within business applications, the surprising topicality of master data management and data governance ... and whatever happened to Nigel Pendse and his legendary OLAP Report?
Mark Rittman is joined in this episode by returning special guest Jen Underwood to talk about what's new and innovative in the BI and analytics industry right now, and how AI and machine learning are this year's data discovery and data visualization.
Mark is joined in this episode of Drill to Detail by Wes McKinney, to talk about the origins of the Python Pandas open-source package for data analysis and his subsequent work as a contributor to the Kudu (incubating) and Parquet projects within the Apache Software Foundation and Arrow, an in-memory data structure specification for use by engineers building data systems and the de-facto standard for columnar in-memory processing and interchange.
Mark is joined by Mike Durran from the Oracle Analytics Product Management team in this UKOUG Tech’17 special to talk about his route into product management via the Oracle Discoverer BI tool, Oracle’s latest product in this space Oracle Data Visualization Desktop 4 and its new features, and Mike’s upcoming sessions at the UK Oracle User Group’s Tech’17 event next week in Birmingham, UK.
Mark is joined in this episode by Avi Zloof from Evaluex to talk about the new world of elastically-provisioned cloud-hosted analytic databases such as Google BigQuery and Amazon Athena, how their pricing model and vendor strategy differs from the traditional database vendors, and how machine learning can be used to automate performance tuning and optimize workloads in this new world of large-scale distributed query and storage.
Mark is joined in this episode by Google Cloud Platform Developer Advocate Felipe Hoffa, talking about getting started as a developer using Google BigQuery along with Google Cloud Dataflow, Google Cloud Dataprep and Google Cloud Platform's machine learning APIs.
Mark Rittman is joined in this episode by Taylor Brown from Fivetran to talk about middleware for SaaS data, their focus on integrations with SaaS vendors and how this differentiates their offering, his thoughts on packaged analytic applications announced at the recent Looker Join conference ... and where the name "Fivetran" came from.
In this episode Mark is joined by ex-colleague and now Technical Advisor to Gluent, Michael Rainey, to talk about hybrid platforms and Gluent's new cloud offload capability, the Hadoop market in-general and his thoughts on data engineering and the recently-released AWS Glue data integration service.
Drill to Detail returns for a new season with special guest Jean-Pierre Dijcks, to talk about Oracle's Big Data Strategy now and in the past, thoughts on distributed query and storage in the cloud, and previewing themes and announcements to look forward to at the upcoming Oracle Open World 2017 event running in San Francisco next month
Mark Rittman is joined in this Summer Special episode by none other than Cameron Lackpour, Essbase expert and Oracle ACE Director, to talk about why and how Essbase won the OLAP wars, how Essbase Server works and the role it now plays in Oracle Analytics Cloud and his involvement with user groups over the years. In this specially extended edition he also gives us his reading recommendations for while you're at the pool or, as he will be, out camping, and he also shares his predictions for what we'll hear from Oracle and the analytics industry when he, and Drill to Detail, returns in the autumn after a well-deserved summer break.
Mark Rittman is joined by Industry Analyst Mark Madsen to talk about marketing analytics and the rise of the omni-channel consumer, the use of AI in analytics and personalization and what this all means for brands, for advertisers and for marketers.
In this episode Mark is joined by Jake Stein to talk about Stitch Data and their ETL tool for data engineers, the new open-source project Singer and his experiences building a software startup that both partners and competes with the big cloud platform vendors.
Mark Rittman is joined by Donald Farmer to talk about his work at Microsoft on SQL Server Analysis Services and Integration Services, why he moved to Qlik and the challenges of evolving a BI product strategy from focusing on desktops to focusing on the enterprise, and some advice for customers, software vendors and partners working with data and analytics tools.
In this episode Mark is joined by Tristan Handy from Fishtown Analytics to talk about building-out analytics functions in high-growth startups, and three related blog posts he wrote on this topic.
Mark is joined by Qubit colleague Will Browne to talk about a recent academic paper co-authored with Mike Swarbrick Jones on conversion optimisation techniques in the eCommerce industry. Using analytics and statistical analysis On 20 billion "user journeys" recorded in Qubit's Google Cloud Platform-hosted Customer Data Store this paper compares techniques using data and machine learning to those based on traditional sales techniques to see whether data trumps emotion ... or both have their place.
Mark is joined by returning special guest Dan McClary to talk about data modeling and database design on distributed query engines such as Google BigQuery, the underlying Dremel technology and columnar storage format that enables this cloud distributed data warehouse-as-a-service platform to scale to petabyte-size tables spanning tens of thousands of servers, and techniques to optimize BigQuery table joins using nested fields, table partitioning and denormalization.
Oracle's Jack Berkowitz joins Mark Rittman to talk about a new category of continuously adapting, self-learning applications being built-out by Oracle that use machine learning together with enterprise and third-party data to create a new generation of intelligent HR, CX, SCM and ERP SaaS apps.
Stewart Bryson returns to the show to join Mark Rittman to discuss new-world BI and data warehousing development using Google BigQuery and Amazon Athena, Apache Kafka and StreamSets, and talks about his experiences with Looker, the cloud-native BI tool that brings semantic modeling and modern development practices to the world of business intelligence.
Mark Rittman is joined in this episode by Independent Consultant Adrian Ward to talk about Oracle Business Analytics, Data Visualization, the BI Applications and his new book on Oracle Business Intelligence 12c.
Mark Rittman is joined by Gwen Shapira from Confluent to talk about Apache Kafka, streaming data integration and how it differs from batch-based, GUI-developed ETL development, the problem with architects, exactly-once processing and how data governance is coming to Kafka development with Confluent's new schema registry server.
Mark Rittman is joined by Maxime Beauchemin to talk about analytics and data integration at Airbnb, the Apache Airflow and Superset open-source projects he helped launch and now works with day-to-day at Airbnb , and his recent Medium article on "The Rise of the Data Engineer".
Mark Rittman is joined by Timo Elliott, originally of Business Objects and now Innovation Evangelist for SAP, to talk about the origins of self-service BI with Business Objects' innovative "Universe" and the role analytics now plays within SAP; why analytics is the most important function within your organization and why the vast majority of analytics is still reporting (which isn't so bad); and the role AI and other innovations will play in analytics going in the future.
Mark Rittman is joined by Kevin Madden and Josh Feingold to talk about graph + spatial analytics, Tom Sawyer Software ... and why a tweet about a certain WiFi kettle incident went viral last October.
Mark Rittman is joined by Daniel Mintz from Looker to talk about BI and analytics on Google BigQuery, data modelling on the new generation of cloud-based distributed-data warehousing platforms, and Looker's re-introduction of semantic models to big data analytics developers.
Mark Rittman is joined by Craig Stewart to talk about application and data integration, ODI and Sunopsis, SnapLogic's approach to hybrid on-premise/cloud integration and the rise of data preparation and dataflow-based cloud integration tools.
Mark Rittman is joined by Independent Consultant Chris Webb to talk about MDX & DAX, MSAS and SQL SQL Server and the fall ... and rise, of Microsoft BI
Mark Rittman is joined in this episode by MapR's Tugdall Grall to talk about MapR's platform differentation and relationship with open-source Hadoop, scaling and streaming, microservices, and MapR's platform strategy around big data workloads in the cloud.
Mark Rittman is joined by Elastic's Mark Walkom to talk about Elasticsearch, Kibana, Logstash and the Elastic Stack; business models built-around an open-source software core; and their move into cloud services with Elastic Cloud
Mark Rittman is joined by Vasu Murthy, Oracle's Senior Director for Product Management of Oracle Business Analytics to talk about what's new with OBIEE and Oracle Data Visualization and the recently released Oracle Analytics Cloud, a dive into the technical architecture of these new additions to Oracle's BI platform, and Oracle's vision for hybrid on-prem/cloud analytics.
En liten tjänst av I'm With Friends. Finns även på engelska.