Beginning Apache Spark Using Azure Databricks Unleashing Large Cluster Analytics in the Cloud

Analyze vast amounts of data in record time using Apache Spark with Databricks in the Cloud. Learn the fundamentals, and more, of running analytics on large clusters in Azure and AWS, using Apache Spark with Databricks on top. Discover how to squeeze the most value out of your data at a mere fractio...

Descripción completa

Detalles Bibliográficos
Autor principal: Ilijason, Robert. author (author)
Formato: Libro electrónico
Idioma:Inglés
Publicado: Berkeley, CA : Apress 2020.
Edición:1st ed. 2020.
Materias:
Ver en Biblioteca Universitat Ramon Llull:https://discovery.url.edu/permalink/34CSUC_URL/1im36ta/alma991009631959206719
Tabla de Contenidos:
  • Chapter 1: Introduction to Large-Scale Data Analytics
  • Chapter 2: Spark and Databricks
  • Chapter 3: Getting Started with Databricks
  • Chapter 4: Workspaces, Clusters, and Notebooks
  • Chapter 5: Getting Data into Databricks
  • Chapter 6: Querying Data Using SQL
  • Chapter 7: The Power of Python
  • Chapter 8: ETL and Advanced Data Wrangling
  • Chapter 9: Connecting to and from Afar
  • Chapter 10: Running in Production
  • Chapter 11: Bits and Pieces.