The Azure Data Lakehouse Toolkit Building and Scaling Data Lakehouses on Azure with Delta Lake, Apache Spark, Databricks, Synapse Analytics, and Snowflake
Design and implement a modern data lakehouse on the Azure Data Platform using Delta Lake, Apache Spark, Azure Databricks, Azure Synapse Analytics, and Snowflake. This book teaches you the intricate details of the Data Lakehouse Paradigm and how to efficiently design a cloud-based data lakehouse usin...
Otros Autores: | |
---|---|
Formato: | Libro electrónico |
Idioma: | Inglés |
Publicado: |
Berkeley, CA :
Apress
2022.
|
Edición: | 1st ed. 2022. |
Materias: | |
Ver en Biblioteca Universitat Ramon Llull: | https://discovery.url.edu/permalink/34CSUC_URL/1im36ta/alma991009671505606719 |
Tabla de Contenidos:
- Part I: Getting Started
- Chapter 1: The Data Lakehouse Paradigm
- Part II: Data Platforms
- Chapter 2: Snowflake
- Chapter 3: Databricks
- Chapter 4: Synapse Analytics
- Part III: Apache Spark ELT
- Chapter 5: Pipelines and Jobs
- Chapter 6: Notebook Code
- Part IV: Delta Lake.-Chapter 7: Schema Evolution
- Chapter 8: Change Feed
- Chapter 9: Clones
- Chapter 10: Live Tables
- Chapter 11: Sharing
- Part V: Optimizing Performance
- Chapter 12: Dynamic Partition Pruning for Querying Star Schemas
- Chapter 13: Z-Ordering & Data Skipping
- Chapter 14: Adaptive Query Execution
- Chapter 15: Bloom Filter Index
- Chapter 16: Hyperspace
- Part VI: Advanced Capabilities
- Chapter 17: Auto Loader
- Chapter 18: Python Wheels
- Chapter 19: Security & Controls.