1. EachPod

#24 - Moving to the Lakehouse: From Hive to Iceberg

Author
Chris Lettieri
Published
Mon 25 Mar 2024
Episode Link
https://bitsofchris.com/p/24-moving-to-the-lakehouse-from-hive

Change is hard.

But it’s necessary.

In this Data Engineering episode, you'll learn:

* Hive tracks data as folders, Iceberg tracks data as files

* How this key distinction enables Iceberg with powerful metadata operations

* What a data lakehouse is in Data Engineering

* Iceberg’s schema evolution, partition evolution, and better query pruning



This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit bitsofchris.com

Share to: