Data engineering with Apache Spark, Delta Lake, and Lakehouse create scalable data pipelines and networks that ingest, process, and store complex data
Understand the complexities of modern-day data engineering platforms and explore strategies to deal with them with the help of use case scenarios led by an industry expert in big data Key Features Become well-versed with the core concepts of Apache Spark and Delta Lake for building data platforms Le...
Otros Autores: | |
---|---|
Formato: | Libro electrónico |
Idioma: | Inglés |
Publicado: |
Birmingham ; Mumbai :
Packt Publishing
[2021]
|
Edición: | 1st edition |
Materias: | |
Ver en Biblioteca Universitat Ramon Llull: | https://discovery.url.edu/permalink/34CSUC_URL/1im36ta/alma991009644272506719 |
Tabla de Contenidos:
- Table of Contents The Story of Data Engineering and Analytics Discovering Storage and Compute Data Lake Architectures Data Engineering on Microsoft Azure Understanding Data Pipelines Data Collection Stage - The Bronze Layer Understanding Delta Lake Data Curation Stage - The Silver Layer Data Aggregation Stage - The Gold Layer Deploying and Monitoring Pipelines in Production Solving Data Engineering Challenges Infrastructure Provisioning Continuous Integration and Deployment (CI/CD) of Data Pipelines.