Maintaining Your Data Lake At Scale With Spark
Data Engineering Podcast
JUNE 16, 2019
In this episode Michael Armbrust, the lead architect of Delta Lake, explains how the project is designed, how you can use it for building a maintainable data lake, and some useful patterns for progressively refining the data in your lake. How does this unified interface resolve the shortcomings and complexities of that approach?
Let's personalize your content