Data engineering with Apache Spark, Delta Lake, and Lakehouse create scalable data pipelines and networks that ingest, process, and store complex data

Understand the complexities of modern-day data engineering platforms and explore strategies to deal with them with the help of use case scenarios led by an industry expert in big data Key Features Become well-versed with the core concepts of Apache Spark and Delta Lake for building data platforms Le...

Descripción completa

Detalles Bibliográficos
Otros Autores:	Kukreja, Manoj, author (author)
Formato:	Libro electrónico
Idioma:	Inglés
Publicado:	Birmingham ; Mumbai : Packt Publishing [2021]
Edición:	1st edition
Materias:	Spark (Electronic resource : Apache Software Foundation) Data mining. Microsoft Azure (Computing platform)
Ver en Biblioteca Universitat Ramon Llull:	https://discovery.url.edu/permalink/34CSUC_URL/1im36ta/alma991009644272506719

Tabla de Contenidos:

Table of Contents The Story of Data Engineering and Analytics Discovering Storage and Compute Data Lake Architectures Data Engineering on Microsoft Azure Understanding Data Pipelines Data Collection Stage - The Bronze Layer Understanding Delta Lake Data Curation Stage - The Silver Layer Data Aggregation Stage - The Gold Layer Deploying and Monitoring Pipelines in Production Solving Data Engineering Challenges Infrastructure Provisioning Continuous Integration and Deployment (CI/CD) of Data Pipelines.

Data engineering with Apache Spark, Delta Lake, and Lakehouse create scalable data pipelines and networks that ingest, process, and store complex data

Ejemplares similares