Sveriges mest populära poddar

The GeekNarrator

Demystifying Real-time Analytics, Search and Hybrid Search with Dhruba, CTO @Rockset

75 min • 17 maj 2024

In this video, I talk to Dhruba, CTO @Rockset about search and realtime analytics. We discussed deep internals of Rockset, its architecture and why is it a great fit for search and realtime analytics use cases. Chapters: 00:00 Introduction 02:45 The Evolution of Data Systems: From Hadoop to Rockset 07:30 Understanding Rockset: Real-Time Analytics and Search Defined 12:01 The Technical Edge: Rockset vs. Elasticsearch 18:16 Deep Dive into Rockset's Architecture and Internals 28:21 Partitioning, Hashing, and Data Distribution in Rockset 36:56 Exploring Hot Storage and Cache Layers 37:40 Why Hot Storage is Essential for Low Latency 39:05 Optimizing Data Storage with Compression and Delta Encoding 39:49 Balancing Cost and Performance in Data Storage 41:50 The Power of Converged Indexing in Rockset 45:50 Efficient Query Execution and Index Management 54:51 Leveraging Mutability for Real-Time Analytics 59:24 Deep Dive into Query Processing and Optimization 01:04:21 Understanding Joins and Reporting Queries in Rockset 01:12:23 Future Directions and Vector Search Innovations Index Conference: https://rockset.com/index-conf/ Follow me on Linkedin and Twitter: https://www.linkedin.com/in/kaivalyaapte/ and https://twitter.com/thegeeknarrator If you like this episode, please hit the like button and share it with your network. Also please subscribe if you haven't yet. Database internals series: https://youtu.be/yV_Zp0Mi3xs Popular playlists: Realtime streaming systems: https://www.youtube.com/playlist?list=PLL7QpTxsA4se-mAKKoVOs3VcaP71X_LA- Software Engineering: https://www.youtube.com/playlist?list=PLL7QpTxsA4sf6By03bot5BhKoMgxDUU17 Distributed systems and databases: https://www.youtube.com/playlist?list=PLL7QpTxsA4sfLDUnjBJXJGFhhz94jDd_d Modern databases: https://www.youtube.com/playlist?list=PLL7QpTxsA4scSeZAsCUXijtnfW5ARlrsN Stay Curios! Keep Learning! #rockset #elasticsearch #search #vectorsearch #realtime #databases #sql #joins #indexes

Kategorier
Förekommer på
00:00 -00:00