The Azure Data Lakehouse Toolkit Building and Scaling Data Lakehouses on Azure with Delta Lake, Apache Spark, Databricks, Synapse Analytics, and Snowflake

Design and implement a modern data lakehouse on the Azure Data Platform using Delta Lake, Apache Spark, Azure Databricks, Azure Synapse Analytics, and Snowflake. This book teaches you the intricate details of the Data Lakehouse Paradigm and how to efficiently design a cloud-based data lakehouse usin...

Descripción completa

Detalles Bibliográficos
Otros Autores: L'Esteve, Ron, author (author)
Formato: Libro electrónico
Idioma:Inglés
Publicado: Berkeley, CA : Apress 2022.
Edición:1st ed. 2022.
Materias:
Ver en Biblioteca Universitat Ramon Llull:https://discovery.url.edu/permalink/34CSUC_URL/1im36ta/alma991009671505606719
Tabla de Contenidos:
  • Part I: Getting Started
  • Chapter 1: The Data Lakehouse Paradigm
  • Part II: Data Platforms
  • Chapter 2: Snowflake
  • Chapter 3: Databricks
  • Chapter 4: Synapse Analytics
  • Part III: Apache Spark ELT
  • Chapter 5: Pipelines and Jobs
  • Chapter 6: Notebook Code
  • Part IV: Delta Lake.-Chapter 7: Schema Evolution
  • Chapter 8: Change Feed
  • Chapter 9: Clones
  • Chapter 10: Live Tables
  • Chapter 11: Sharing
  • Part V: Optimizing Performance
  • Chapter 12: Dynamic Partition Pruning for Querying Star Schemas
  • Chapter 13: Z-Ordering & Data Skipping
  • Chapter 14: Adaptive Query Execution
  • Chapter 15: Bloom Filter Index
  • Chapter 16: Hyperspace
  • Part VI: Advanced Capabilities
  • Chapter 17: Auto Loader
  • Chapter 18: Python Wheels
  • Chapter 19: Security & Controls.