Data engineering with Apache Spark, Delta Lake, and Lakehouse create scalable data pipelines and networks that ingest, process, and store complex data

Understand the complexities of modern-day data engineering platforms and explore strategies to deal with them with the help of use case scenarios led by an industry expert in big data Key Features Become well-versed with the core concepts of Apache Spark and Delta Lake for building data platforms Le...

Descripción completa

Detalles Bibliográficos
Otros Autores: Kukreja, Manoj, author (author)
Formato: Libro electrónico
Idioma:Inglés
Publicado: Birmingham ; Mumbai : Packt Publishing [2021]
Edición:1st edition
Materias:
Ver en Biblioteca Universitat Ramon Llull:https://discovery.url.edu/permalink/34CSUC_URL/1im36ta/alma991009644272506719
Tabla de Contenidos:
  • Table of Contents The Story of Data Engineering and Analytics Discovering Storage and Compute Data Lake Architectures Data Engineering on Microsoft Azure Understanding Data Pipelines Data Collection Stage - The Bronze Layer Understanding Delta Lake Data Curation Stage - The Silver Layer Data Aggregation Stage - The Gold Layer Deploying and Monitoring Pipelines in Production Solving Data Engineering Challenges Infrastructure Provisioning Continuous Integration and Deployment (CI/CD) of Data Pipelines.