Podd: Gnarly Data Waves by Dremio

EP59: Charting the Course: The Evolution and Future of Apache Iceberg and Polaris

29 oktober 2024 | 39 min

EP58 - Transforming the Landscape of Real-Time Data Analytics

29 oktober 2024 | 15 min

EP57 - From Hadoop & Hive to Minio & Dremio: Moving Towards a Next Gen Data Architecture

8 oktober 2024 | 64 min

EP56 - What’s New in Dremio: Improved Automation, Performance + Catalog for Iceberg Lakehouses

10 september 2024 | 32 min

EP55 - Unite Data Across Dremio, Snowflake, Iceberg, and Beyond

21 augusti 2024 | 14 min

EP54 - Mastering Semantic Layers: The Key to Data-Driven Innovation

20 augusti 2024 | 51 min

EP53 - Build the next-generation Iceberg lakehouse with Dremio and NetApp

19 augusti 2024 | 48 min

Apache Iceberg Lakehouse crash course

15 juli 2024 | 4 min

EP51 - Scania’s Journey in Navigating and Implementing Data Mesh

3 juli 2024 | 58 min

EP52 - The Best of the Subsurface Data Lakehouse Conference 2024

25 juni 2024 | 42 min

Join us for a captivating recap of Subsurface 2024—the leading conference at the intersection of data engineering, open source technology, and modern data architecture. This webinar will distill: - highlights of the conference, - curated clips of inspiring keynotes, - insightful discussions on real-world data lakehouse implementations by industry leaders such as Nomura, NetApp, and Blue Cross. - and deep dives into the transformative potential of open source projects like Apache Iceberg, Apache XTable, and Ibis. Whether you missed the conference or want to revisit its most impactful moments, this webinar offers a unique opportunity to stay ahead of the curve in the rapidly evolving data landscape. Don't miss this chance to gain valuable insights from the experts and innovators who are shaping the future of data. - Article on Dremio Auto-Ingest: https://www.dremio.com/blog/introducing-auto-ingest-pipes-event-driven-ingestion-made-easy/ - Article on Dremio and Hybrid Data Lakehouses (Vast, Netapp, Minio): https://www.dremio.com/blog/3-reasons-to-create-hybrid-apache-iceberg-data-lakehouses/ --------------------------------------------------------------- Get Hands-on with the Data Lakehouse ---------------------------------------------------------------- - Apache Iceberg Lakehouse on your Laptop: https://bit.ly/am-dremio-lakehouse-laptop - SQLServer to Iceberg to Dashboard: https://bit.ly/am-sqlserver-dashboard - MongoDB to Iceberg to Dashboard: https://bit.ly/am-mongodb-dashboard - Postgres to Iceberg to Dashboard: https://bit.ly/am-postgres-to-dashboard - MySQL to Iceberg to Dashboard: https://bit.ly/am-dremio-mysql-dashboard - Elasticsearch to Iceberg to Dashboard: https://bit.ly/am-dremio-elastic - Apache Druid to Iceberg to Dashboard: https://bit.ly/am-druid-dremio - JSON/CSV/Parquet to Iceberg to Dashboard: https://bit.ly/am-json-csv-parquet-dremio - From Kafka to Iceberg to Dremio: https://bit.ly/am-kafka-connect-dremio - Lowering Snowflake Costs with Dremio: https://bit.ly/am-dremio-snowflake-spend

EP50 - Optimize Analytics Workloads with Dremio + Snowflake

10 juni 2024 | 20 min

EP49 - What’s New in Dremio: New Capabilities for the Best Apache Iceberg Lakehouse

17 april 2024 | 28 min

EP48 - Understanding the Dremio Data Lakehouse

1 april 2024 | 28 min

GDW CE Workshop 1 - Getting Started with Dremio: Build a Data Lakehouse on your Laptop

29 mars 2024 | 58 min

Ready to revolutionize your data management approach and learn how to maximize your environment with Dremio? Watch Alex Merced in this workshop where he’ll guide you step-by-step through building a lakehouse on your laptop with Dremio, Nessie and Minio. This is a great opportunity to try out many of the best features Dremio offers. You'll learn how to: - Read and write Apache Iceberg tables on your object storage, cataloged by Nessie, - Create views in the semantic layer, - And much more GDW Community Edition Workshop Description: This hands-on workshop, participants will embark on a journey to construct their very own data lakehouse platform using their laptops. The workshop is designed to introduce and guide participants through the setup and utilization of three pivotal tools in the data lakehouse architecture: Dremio, Nessie, and Apache Iceberg. Each of these tools plays a crucial role in enabling the flexibility of data lakes with the efficiency and ease of use of data warehouses aiming to simplify and economize data management. You will start by setting up a Docker environment to run all necessary services, including a notebook server, Nessie for catalog tracking with Git-like versioning, Minio as an S3-compatible storage layer, and Dremio as the core lakehouse platform. The workshop will provide a practical, step-by-step guide to federating data sources, organizing and documenting data, and performing queries with Dremio; tracking table changes and branching with Nessie; and creating, querying, and managing Apache Iceberg tables for an ACID-compliant data lakehouse. Prerequisites for the workshop include having Docker installed on your laptop. You will be taken through the process of creating a docker-compose file to spin up the required services, configuring Dremio to connect with Nessie and Minio, and finally, executing SQL queries to manipulate and query data within their lakehouse. This immersive session aims to not just educate but to empower attendees with the knowledge and tools needed to experiment with and implement their data lakehouse solutions. By the end of the workshop, participants will have a functional data lakehouse environment on their laptops, enabling them to explore further and apply what they have learned to real-world scenarios. Whether you're looking to improve your data management strategies or curious about the data lakehouse architecture, this workshop will provide a solid foundation and practical experience.

EP47 - Learn how to reduce your Snowflake cost by 50%+ with a lakehouse

27 mars 2024 | 35 min

EP46 - Getting Started with Dremio

25 mars 2024 | 60 min

EP45 - Next-Gen Data Pipelines are Virtual: Simplify Data Pipelines with dbt, Dremio, and Iceberg

22 februari 2024 | 40 min

EP44 - How S&P Global is Building an Azure Data Lakehouse with Dremio

31 januari 2024 | 22 min

EP43 - Empowering Analytics: Unleashing the Power of Dremio Cloud on Microsoft Azure

19 januari 2024 | 54 min

In this session, Dremio and Microsoft will delve into the exciting developments surrounding the public preview launch of Dremio Cloud on Microsoft Azure. This presentation will provide a comprehensive exploration of how businesses are strategically operationalizing their data lakes, with a particular focus on unlocking the vast potential residing within Azure Storage. Attendees will gain valuable insights into the transformative journey toward harnessing the full benefits of a data lakehouse. The discussion will guide participants through the myriad possibilities that emerge when leveraging Dremio Cloud seamlessly on Azure, offering a holistic approach to executing analytics pipelines. This integration eliminates the need for costly data warehouses, presenting a revolutionary paradigm shift. A step-by-step walkthrough will illuminate the process of landing data within the lakehouse, followed by seamlessly progressing data through a virtual semantic layer. This strategic approach adds significant business meaning and value, enhancing the overall utility of the data before it is surfaced to end users. The session will also shed light on the noteworthy performance improvements and cost savings achieved by reducing data extract expenses associated with Power BI workloads. By embracing Dremio Cloud on Azure, organizations can elevate their analytical capabilities while optimizing operational costs, marking a pivotal advancement in the realm of data management and analytics. Join us as we explore the forefront of innovation in data lake operationalization and witness the tangible benefits of this dynamic integration. Watch and learn how Jonny Dixon, Sr. Product Manager at Dremio and Hanno Borns, Principal Product Manager at Microsoft Azure will look into: - Problems companies face with existing analytical architectures - How Dremio and Microsoft Azure work together - What Dremio Cloud on Azure is, and the value it provides - How the Dremio Cloud on Azure solution works, with a demo

EP42 - What's new in Dremio: New GenAI capabilities, advance for 100% query success + now on Azure

20 december 2023 | 26 min

EP41 - ZeroETL & Virtual Data Marts: The Cutting Edge of Lakehouse Architecture

18 december 2023 | 42 min

MEETUP: ZeroETL & Virtual Data Marts - Orlando Data Professionals Meetup

15 december 2023 | 66 min

Workshop: Build an Iceberg Lakehouse in 60 minutes

14 december 2023 | 59 min

EP40 - How Dremio provides you fast and easy data access while saving you money

13 december 2023 | 29 min

EP39 - How To Build an Iceberg Data Lakehouse with Fivetran and Dremio

12 december 2023 | 43 min

EP38 - Building a Data Science Platform on Apache Iceberg and Nessie

11 december 2023 | 32 min

EP37 - How NetApp is Redefining the Customer Experience with Product Analytics

19 oktober 2023 | 40 min

EP36 - Simplify Lakehouse Operations with Zero-Copy Clones and Multi-Table Transactions

13 oktober 2023 | 31 min

EP35 - Your Lakehouse Just Got Gnarlier: What’s New in Dremio, including Next Gen Reflections

9 oktober 2023 | 39 min

EP34 - Materialized Views and Dremio Reflections

8 oktober 2023 | 21 min

EP33 - The Who, What and Why of Data Lakehouse Table Formats (Apache Iceberg, Delta Lake and Apache Hudi)

6 oktober 2023 | 60 min

EP32 - Introduction to Dremio Arctic: Catalog Versioning and Iceberg Table Optimization

15 september 2023 | 49 min

EP31 - ELT, ETL and Dremio Data Lakehouse

13 september 2023 | 37 min

EP29 - Simplify data governance at scale across all your data

29 augusti 2023 | 48 min

EP28 - Apache Iceberg Office Hours

22 augusti 2023 | 42 min

EP27 - How Maersk is Building A Next Gen Data Lakehouse with Dremio

8 augusti 2023 | 50 min

EP26 - Versioning Data in the Data Lakehouse (File, Table and Catalog Versioning)

27 juli 2023 | 34 min

EP25 - Building a Data Lakehouse on Azure Data Lake Storage

27 juli 2023 | 38 min

EP24 - Simplifying Data Mesh with Dremio's Open Data Lakehouse

12 juli 2023 | 50 min

EP23 - Getting Started With Dremio Data Reflections

28 juni 2023 | 35 min

EP22 - Dremio and Data Lakehouse Table Formats (Apache Iceberg, Delta Lake and Apache Hudi & Dremio)

21 juni 2023 | 58 min

EP21 - Data as Code with Dremio Arctic: ML Experimentation & Reproducibility on the Lakehouse

16 juni 2023 | 60 min

EP20 - What's New in the Apache Iceberg Project: Updates, PyIceberg, Compute Engines

7 juni 2023 | 54 min

EP19 - Data Mesh In Practice: Accelerating Cancer Research with Dremio's Data Lakehouse

1 juni 2023 | 40 min

EP18 - Best Practices for Modernizing Your Hadoop Workloads to AWS with Dremio

24 maj 2023 | 38 min

EP17 - Unified Access for Your Data Mesh Self Service Data with Dremio's Semantic Layer

18 maj 2023 | 44 min

EP16 - Easy Data Lakehouse Management with Dremio Arctic’s Automatic Data Optimization

10 maj 2023 | 32 min

EP15 - Getting Started with Dremio’s Data Lakehouse

3 maj 2023 | 61 min

EP14 - Enabling Data Mesh with Dremio Arctic and Data as Code

1 maj 2023 | 43 min

EP13 - Making the Move: Five Factors to Consider When Migrating from Hadoop to the Data Lakehouse

25 april 2023 | 54 min

EP12 - How to Modernize Hive to the Data Lakehouse with Dremio and Apache Iceberg

12 april 2023 | 64 min

EP11 - Apache Iceberg Office Hours - Apache Iceberg 1.2.0 has been released

5 april 2023 | 36 min

EP10 - Optimizing Data Files in Apache Iceberg Performance Strategies

31 mars 2023 | 42 min

EP9 - Build your open data lakehouse Iceberg with Fivetran and Dremio

22 mars 2023 | 42 min

EP8 - Managing your data as code with Dremio Arctic

15 mars 2023 | 49 min

EP 7 - Getting Started with Hadoop Migration and Modernization

22 februari 2023 | 43 min

EP6 - Total Economic Impact of Data Lakehouse

15 februari 2023 | 44 min

EP5 - Apache Iceberg Office Hours - Apache Iceberg Partitioning Explanation

8 februari 2023 | 51 min

EP4 - Best Practices for Optimizing Tableau Dashboards with Dremio

1 februari 2023 | 50 min

EP 3 - Migrating from Delta Lake to Iceberg

25 januari 2023 | 61 min

EP2 - Migrating a BI Dashboard to your Data Lakehouse with Apache Superset and Dremio

21 januari 2023 | 55 min

EP1 - Getting Started with Dremio’s Data Lakehouse

21 januari 2023 | 58 min

Gnarly Data Waves by Dremio

Gnarly Data Waves is a weekly show about the world of Data Analytics and Data Architecture.

Om podden

Avsnitt