Sveriges mest populära poddar

Catalog & Cocktails: The Honest, No-BS Data Podcast

Catalog & Cocktails: Bonus Episode with John Kutay

61 min • 3 september 2022

Lucid Streaming; how to take full advantage of data streaming


  • Data streaming, Stream Processing, Real-time analytics, operational analytics — what is this? What’s the difference?
  • Most important use cases for data streaming
  • There are lots of misconceptions especially for the MDS crowd (not as much enterprise) between fast batch vs streaming
  • Memory-first processing (in-memory) vs disk space batch jobs
  • Change data capture (and only capture of change)
  • Data warehouses are now tying to support streaming more (like Snowflake)
  • This will be a big deal to make it so that more streaming can happen
  • Streaming warehouses (Rockset, Materialize) vs data streaming
  • Lineage - transformed data - can I trust this data I'm looking at
  • How does data streaming and lineage come together? What’s unique about lineage in a streaming context?
  • If time: what does it mean to do streaming data products in a data mesh context?


Kategorier
Förekommer på
00:00 -00:00