Sveriges mest populära poddar

Data Driven

Dave Wentzel on Why You Don’t Need a Data Warehouse

73 min • 2 maj 2021

In this episode of Data Driven, Frank and Andy chat with Philadelphia Microsoft Technology Center Data Architect Dave Wentzel on why you do not need a data warehouse.

Also, Frank discusses leaving Microsoft, Frank and Andy talk about five seasons of Data Driven, and even BAILeY has a sentimental moment.

Transcripts

00:00:00 BAILey 

Hello and welcome to data driven, the podcast where we explore the emerging wait a tick. This is the premiere episode of Season Five. Can you believe it? Data driven started four years ago this month. 

00:00:14 BAILey 

Up until last season, we had a human doing the voiceover work. That is until she was replaced by an AI. Yours truly. 

00:00:23 BAILey 

In this episode, Frank and Andy speak to Dave Wensel about why you don't need a datawarehouse. We're starting off the new season with a bit of contrarian tone. 

00:00:33 BAILey 

It's a lively back and forth conversation that runs contrary to prevailing wisdom. Don't say we didn't warn you? Now on with the show. 

00:00:41 Frank 

Hello and welcome to data driven. The podcasts were we wait a minute. We've been saying this Andy for four years now. Can you believe it? 

00:00:48 Andy 

Four years, that's crazy talk. 

00:00:52 Frank 

That's just craziness. So I think when you and I first talked about this and that was that fateful, I think it was December was right after Thanksgiving. But before Christmas, I was thinking about starting a podcast and as a data scientist, I needed someone. 

00:01:01 Andy 

Yeah, yeah. 

00:01:09 Frank 

That was a data engineer that could kind of round out the talent there and and and and obviously I wanted someone I knew, liked, and trust. 

00:01:11 Frank 

Found out. 

00:01:11 Frank 

 

00:01:22 Frank 

And so it was you. 

00:01:25 Andy 

Well, I'm just glad all of the real smart data engineers you knew were busy. That's all I got to say. 

00:01:25 Frank 

Much. 

00:01:30 Frank 

Ah, no man. You were the first one. I reached out to and the only one I would have done it with it. So I was delighted when you said yes because starting a podcast can sound like a daunting thing, particularly if you haven't done it before. 

00:01:44 Andy 

Yeah, neither one of us really had. And gosh, it's it's worked out. What are we up to? 180,000 downloads or something? I mean that's. 

00:01:52 Frank 

Something. 

00:01:53 Frank 

Like that about hundred 8000 downloads. I mean, we're not Joe Rogan, but that's OK, Yep. 

00:01:55 

Yeah. 

00:01:57 Andy 

No. 

00:01:59 Andy 

Yep, Yep. 

00:01:59 Andy 

Yep. 

00:02:01 Frank 

But you know what, we we we've impacted. I think the community in a significant way. We've we've done a number of things we've we've innovative how we podcast. 

00:02:12 Frank 

Uh, we we've actually managed to keep a good cadence with some exceptions. 

00:02:18 Andy 

Yeah, thanks. 

00:02:19 Frank 

You know, we we finally did earlier this year or late last year, kind of fulfill our vision of it being data driven TV when we actually interviewed guests on. 

00:02:27 Andy 

Yes. 

00:02:32 Frank 

On video. 

00:02:33 Frank 

And that was that actually delayed the launch of the show by about three months. 

00:02:38 Andy 

It did but also uhm. Yeah, that was interesting, but you know it's typical software development, right? You release a feature and then you debug it. The I have this saying about that Frank. All software is tested some intentionally. 

00:02:52 Frank 

Sometimes. 

00:02:53 Andy 

Right? 

00:02:56 Frank 

I love it, but I also like how, how, how both our careers have evolved over the last four years. And dumb, you know, this being the premiere episode of Season 5 and we have something special lined up, but I'll get to that in a minute. 

00:02:58 Andy 

Hello. 

00:03:03 Andy 

Oh gosh, itch. 

00:03:11 Andy 

June. 

00:03:12 Frank 

You've progressed in your career. We, you and I've worked on some some projects together or virtual Summit. What we're calling Ring Gate, which will announce very very soon and and but. But most of all, is been my kind of skilling up in transition into data engineering myself. 

00:03:29 BAILey 

Ehm 

00:03:31 Frank 

Which was something that when I joined, so this is just a job update about a year ago. I I left the role of Microsoft kind of field sales and I went into the Microsoft Technology Center stick with me. There's a point to this story and basically I was at the rest in MTC. 

00:03:52 Frank 

And basically I was the AI guy on my my my field sales team, but I didn't really have deep knowledge of kind of the typical typical data engineering pipe work that goes into that role and basically my my. My then manager said you know he's like hey, you know, if you want this role, you've got a skill. 

00:04:12 Frank 

And skill up I did. And with Andy's mentoring and a bunch of other folks that helped me kind of skill up on our the data engineering side. I looked at it this morning. I'm like 88 hours on Pluralsight. 

00:04:25 Frank 

Wow, that was from mid may till we're recording this on April 30th. So just about a year 88 hours right now tracking on about 200 four 205 consecutive days of getting on LinkedIn. I'm not on LinkedIn on Pluralsight, LinkedIn learning. I also have a number of courses too. 

00:04:31 Andy 

Yeah. 

00:04:43 Frank 

Uh, that is something I'm proud of in terms of career evolution. 

00:04:47 Andy 

Absolutely Frank, you should be. How many cirts are you up to now? 

00:04:50 Frank 

I 87. 

00:04:53 Andy 

Slacker. 

00:04:54 Frank 

I know, I know. 

00:04:54 Frank 

Know, I know. 

00:04:54 Andy 

I think I've got 4. 

00:04:56 Frank 

Ah, now I know you and I did the data engineering thing, so you have at least 11. 

00:05:00 Andy 

That's true, that's true. We did that one and you know that was it's just. It's just been a nice journey and I'll take credit for this. 'cause 'cause I can I was. I was actually pestering you years ago. We've been friends since 2005 and we started doing. 

00:05:20 Andy 

Code camps here in the Richmond area. 

00:05:22 Andy 

Together and co-founded RE co-founded Richmond SQL Server Users Group and you know, worked with the net users group and stuff. And I told you as soon as I saw some of your graphic art and Frank would do a keynote for the Richmond code camps and every time he would make movie posters, the one that. 

00:05:41 Frank 

Oh yeah. 

00:05:42 Andy 

Still sticks out is 1 called devs on a plane. 

00:05:45 Frank 

Ha ha ha. 

00:05:49 Andy 

Oh yeah, I loved that one that was so so cool and. 

00:05:49 Andy 

And that was. 

00:05:49 Andy 

 

00:05:54 Andy 

You know I saw the graphic arts part of it and I just knew I said you, you'd be really good in analytics and data visualization. You should get into by and you were busy doing other stuff which was cool. You were good at that too. It wasn't, you know you. I don't know of anything you've done that you haven't mastered. By thank you. You know you when. 

00:06:14 Andy 

Things took a took, uh, started taking a turn for you in your first rodeo at Microsoft. You got into it and and took off with it. I don't. I won't tell the story well, but you just really turned around. You focused on data and. 

00:06:32 Andy 

You know, I'll say this Frank. I was right. 

00:06:35 Frank 

Well, with that he totally I. I think if anything I took away is I should have listened to Andy 10 years earlier. 

00:06:36 Dave 

You aren't very good. 

00:06:40 Frank 

Uhm? 

00:06:41 Frank 

And that that that that is something that that that that's the big takeaway we'll talk about, kind of that journey. 'cause I think that's worth kind of talking about. And I think one of the things we you, and I've been bouncing around is kind of interviewing each other. 

00:06:46 

We 

00:06:55 Frank 

Like in asking one of us those those those questions we have, so we definitely will do that, but not today kids. 

00:06:55 Frank 

Yeah. 

00:06:55 Frank 

 

00:06:59 Dave 

We need to. 

00:06:59 Dave 

Need to. 

00:07:02 Andy 

Today, do we have Dave? 

00:07:02 Andy 

Today do Dave. 

00:07:03 Frank 

Today we have a special guest we have Dave Wentzel. Dave Wentzel is a was a peer of mine when I worked at the Microsoft MTC and that reminds me, I no longer work at Microsoft 2 weeks ago was my last day. I turned in my second blue badge. 

00:07:05 Frank 

Yeah. 

00:07:05 Frank 

 

00:07:18 Frank 

And I joined a startup called electrify. We'll talk about them a later day, but I'm so excited to have Dave here. Dave is the data in AI architect out of the Philadelphia Microsoft Technology Center, and he's an awesome guy. Awesome, got to work with. I worked with him when I was in field sales and I worked with him when I was in the MTC organization. 

00:07:38 Frank 

It is April. It was a privilege and honor Dave to have you as a colleague, and it's once again a privilege and an honor to have you here as a guest on data driven. 

00:07:46 Dave 

Well, thank you so much, appreciate that. 

00:07:47 Andy 

Welcome Dave. 

00:07:49 Dave 

Thank you. 

00:07:51 Frank 

So, uhm, so for folks that don't know what the MTC is. Shocking that there are actually people that don't know what that is, what? What is the MTC? 

00:08:00 Dave 

So basically we're a free service to our customers and I'm a data and AI technology architect. We talked to customers about data and it could be anything from just, you know. Hey, here's what we're doing. State of the art in Azure with. 

00:08:16 Dave 

With data, but it could also be architectural design sessions where we talk to customers. Our customers bring us their architectures, and then we kind of get it with them. Give them the pros and cons, alternative ways of thinking, and then what I really enjoy doing is hackathons with customers and workshops and just you know, helping them to learn without just. 

00:08:37 Dave 

Taking a course somewhere so actually using their data and then I guess I'm roughly a data scientist, so we also do design thinking sessions and those are absolutely a lot of fun. 

00:08:48 Dave 

We did one at the MTC with CSL Behring a couple years ago and it actually won a Forrester Award. So I'm very proud of that one. And yeah, it's it's a. It's a lot of fun and it's a good way to bring to have executives and business people understand the actual capabilities of data science. And then within two days be able to come up with a use case. 

00:08:55 Andy 

Oh wow, wow. 

00:09:08 Dave 

And actually build a prototype out a lot of fun. 

00:09:11 Frank 

Yeah, the NPC's are definitely like Microsoft Secret weapon in terms of how 'cause you know. Although I will say and because we were in the DC and we dealt with a lot of government contracts, we could not say that they were a free service. They were and already included paid for service. 

00:09:26 

That's. 

00:09:26 Dave 

Much, much better said yes. 

00:09:28 Frank 

I I 'cause I said free once and I got kind of slapped. 

00:09:31 Frank 

On the hand, say that. 

00:09:34 Frank 

But you know it, it really is something that if you do have a Microsoft account team and you are encountering any kind of questions or or whatever, and it's not strictly technical, there's also pretty good. You know, we basically wouldn't engage with the business development, business decision makers. 

00:09:52 Frank 

Technical decision makers all the way from kind of like you know, hey, this is what Azure can do. This is what data can do for you all the way down to OK. What's your problem? Let's build something out, give you 3 days with one of the top Notch architects in the. 

00:10:04 Frank 

Space and. 

00:10:07 Frank 

You know, boom, you know we knock it out and and you know I I enjoyed it you know had this opportunity not come I would have I would have gladly stayed another. You know 5-10 years of the MTC. Like a lot of people do, and it's a fun organization. So with that in mind, today we're going to do something a little different. We're kind of doing the. 

00:10:27 Frank 

A contrarian approach is that right, Dave. 

00:10:29 Dave 

Yes, exactly. 

00:10:31 Frank 

So this this has actually come up one of my last. This is one of the things that intrigued me about about your idea for the show was this came up when I was working with a we'll just call it a large governmental agency known for its. 

00:10:42 Frank 

Birds. 

00:10:42 Frank 

 

00:10:43 Frank 

Tape. 

00:10:44 Frank 

That that should keep it generic enough. They basically came to us and say we want Synapse. We want a data Lake. We want this. We want that. And I was like, OK, well how much data you're talking about. And like we have maybe you know 5 maybe 20 gigs of data. 

00:11:02 Frank 

And I'm like, uh, OK, tell me what are you trying to do? And ultimately I kind of pitched the idea like look, you know you don't have that much data right to make data bricks. 

00:11:14 Frank 

But you really want it so. 

00:11:17 Frank 

If you really want it, I won't stop you, but I think it's kind of overkill. I think you're taking instead of using a steak knife to cut the steak using a chainsaw. 

00:11:25 Frank 

And. 

00:11:27 Frank 

You know they kind of came back and ultimately what won the day was they already they couldn't get approval for whatever we recommended 'cause it didn't get stamped by there. 

00:11:37 Frank 

They're people for security usage yet, and things like that so they end up doing kind of the right thing because of their own bureaucracy, which. 

00:11:44 Frank 

It's kind of weird. It's kind of like dividing by zero and seeing the universe fold in on itself. 

00:11:50 Frank 

But UM, so the topic of today is kind of like no, you don't need a data warehouse. Did I get that right? 

00:11:58 Dave 

Exactly, that's what I believe in, and I believed in it since I was in college and I first learned about data warehouses. I'm not saying data warehouses are always bad, they definitely have their. 

00:12:10 Dave 

Use cases, but in 2021 when we're talking about advanced analytics and we're trying to tell customers you need to be more predictive than prescriptive. 

00:12:19 Dave 

The data warehouse really doesn't deliver. 

00:12:23 Frank 

Really, how so? 'cause? That's that's totally not the power. Certainly not the party line. I'm not going to say which party it was. You can figure it out but but why, why why would you say that? 

00:12:33 Dave 

OK so take. 

00:12:33 Dave 

A step back here, right? We're all data consultants, or we were at some point in our life and probably most of the listeners are. And if you've been doing this, I've been doing this since the mid 90s in college and when I first started I had an internship with a consumer package. Good company, they made candy. 

00:12:52 Dave 

Hours and they said, hey, we wouldn't want to do an internship and take a look at our data and figure out where is the best spot to put candy on a shelf so that we sell more candy to kids, right? So we used data for that at the time that was known as business Intelligence in the industry. Nowadays business intelligence means something totally different. In reality, it's really closer to what? 

00:13:12 Dave 

Today we would call data science right? So my tools of choice were SQL, although I didn't know what SQL was at the time and we had this goofy SQL engine and and essentially something called ESP's, which is roughly the equivalent of like our or stats package something. 

00:13:28 Dave 

Like that and we kind of looked at data as just, you know I have data and let me find the Nuggets of gold and I'm not going to concern myself with schema and that is I think the biggest problem with data warehouses. But take a, you know a metal layer higher right? Talk to the average business executive like a you know a CTO or CEO. 

00:13:48 Dave 

And tell them, as a consultant, you're going to go in and build them a data warehouse. 

00:13:53 Dave 

Instantly,...

Förekommer på
00:00 -00:00