Josh Patterson (@datametrician, Co-Founder & CEO @VoltronData) talks about the concept of composable data analytics and how it benefits our industry. What is it, why should be using it, and how to get started.
SHOW: 694
CLOUD NEWS OF THE WEEK - http://bit.ly/cloudcast-cnotw
NEW TO CLOUD? CHECK OUT - "CLOUDCAST BASICS"
SHOW SPONSORS:
SHOW NOTES:
Topic 1 - Hello Josh and welcome to the show. You have a very diverse and interesting background. Can you give everyone a quick introduction? As a follow up, tell everyone a little bit about your experience as Presidential Innovation Fellow.
Topic 2 - Before we dig into Voltron Data, we need to tell everyone about Apache Arrow. Business and organizations tend to be overwhelmed by big data. Everything from the volume, to the tools, to the lack of data scientists and practitioners. Can you give everyone an overview of Arrow, how it came to be, what problem does it solve?
Topic 3 - Arrow has companies like Snowflake, NetFlix, Meta, Databricks, Google and Microsoft all adopting it. Our listeners will be more familiar with Snowflake & Databricks and their business models, what makes Voltron Data different? How are you building a company on top of OSS?
Topic 4 - Let’s talk about communities and standards. I’ve seen various numbers on Arrow and monthly downloads, always in the tens of millions per month. Your focus appears to be providing services for Arrow and other Apache projects to simplify open source for those that don’t have the skills or time, while also working towards the goal of community standards. Is that correct?
Topic 5 - How will open source standards for data help the data analytics industry move faster? Is this a process problem? A data set problem? A tools problem?
Topic 6 - Data Analytics has a reputation for a high barrier to entry. If our listeners are interested, how can they get started?
FEEDBACK?