Hands-on big data analytics with pyspark analyze large datasets and discover techniques for testing, immunizing, and parallelizing spark jobs

Use PySpark to easily crush messy data at-scale and discover proven techniques to create testable, immutable, and easily parallelizable Spark jobs Key Features Work with large amounts of agile data using distributed datasets and in-memory caching Source data from all popular data hosting platforms,...

Full description

Bibliographic Details
Other Authors: Lai, Rudy, author (author), Potaczek, Bartłomiej, author
Format: eBook
Language:Inglés
Published: Birmingham ; Mumbai : Packt Publishing 2019.
Edition:1st edition
Subjects:
See on Biblioteca Universitat Ramon Llull:https://discovery.url.edu/permalink/34CSUC_URL/1im36ta/alma991009630455606719

Similar Items