Materias dentro de su búsqueda.
Materias dentro de su búsqueda.
- Historia 203
- Universidad Pontificia de Salamanca (España) 136
- Medical microbiology & virology 108
- Microbiology (non-medical) 92
- Science: general issues 92
- Crítica e interpretación 87
- Cirilo 54
- Biblia 52
- Història 52
- Patrística 49
- Cirilo de Alejandría 47
- Derecho penal 41
- Filosofía 40
- Derecho 39
- Imágenes e ídolos 38
- Arroyo, Eduardo 36
- Virology 36
- Colecciones 33
- Derecho mercantil 30
- Teatro español 30
- Literatura española 29
- Poesía española 27
- Ídolos e imágenes 27
- Educación 26
- Arquitectura 25
- Crítica i interpretació 25
- Iglesia Católica 25
- Música 25
- Universidad Pontificia de Salamanca, Facultad de Educación 25
- Metodio 24
-
5241
-
5242
-
5243
-
5244
-
5245
-
5246
-
5247
-
5248
-
5249
-
5250
-
5251
-
5252
-
5253
-
5254
-
5255
-
5256Publicado 2024Tabla de Contenidos: “…Chapter 5: Big Data Processing with Apache Spark -- Technical requirements -- Getting started with Spark -- Installing Spark locally -- Spark architecture -- Spark executors -- Components of execution -- Starting a Spark program -- The DataFrame API and the Spark SQL API -- Transformations -- Actions -- Lazy evaluation -- Data partitioning -- Narrow versus wide transformations -- Analyzing the titanic dataset -- Working with real data -- How Spark performs joins -- Joining IMDb tables -- Summary -- Chapter 6: Building Pipelines with Apache Airflow -- Technical requirements -- Getting started with Airflow -- Installing Airflow with Astro -- Airflow architecture -- Airflow's distributed architecture -- Building a data pipeline -- Airflow integration with other tools -- Summary -- Chapter 7: Apache Kafka for Real-Time Events and Data Ingestion -- Technical requirements -- Getting started with Kafka -- Exploring the Kafka architecture -- The PubSub design -- How Kafka delivers exactly-once semantics -- First producer and consumer -- Streaming from a database with Kafka Connect -- Real-time data processing with Kafka and Spark -- Summary -- Part 3: Connecting It All Together -- Chapter 8: Deploying the Big Data Stack on Kubernetes -- Technical requirements -- Deploying Spark on Kubernetes -- Deploying Airflow on Kubernetes -- Deploying Kafka on Kubernetes -- Summary -- Chapter 9: Data Consumption Layer -- Technical requirements -- Getting started with SQL query engines -- The limitations of traditional data warehouses -- The rise of SQL query engines -- The architecture of SQL query engines -- Deploying Trino in Kubernetes -- Connecting DBeaver with Trino -- Deploying Elasticsearch in Kubernetes -- How Elasticsearch stores, indexes and manages data -- Elasticsearch deployment -- Summary -- Chapter 10: Building a Big Data Pipeline on Kubernetes…”
Libro electrónico -
5257
-
5258
-
5259
-
5260Publicado 2023Tabla de Contenidos: “…Table of Contents Introduction to Data Ingestion Principals of Data Access – Accessing your Data Data Discovery – Understanding Our Data Before Ingesting It Reading CSV and JSON Files and Solving Problems Ingesting Data from Structured and Unstructured Databases Using PySpark with Defined and Non-Defined Schemas Ingesting Analytical Data Designing Monitored Data Workflows Putting Everything Together with Airflow Logging and Monitoring Your Data Ingest in Airflow Automating Your Data Ingestion Pipelines Using Data Observability for Debugging, Error Handling, and Preventing Downtime…”
Libro electrónico