Materias dentro de su búsqueda.
Materias dentro de su búsqueda.
- Big data 159
- Data mining 110
- Spark (Electronic resource : Apache Software Foundation) 107
- Machine learning 79
- Electronic data processing 71
- Python (Computer program language) 58
- Apache Hadoop 57
- Management 50
- Application software 49
- Cloud computing 49
- Distributed processing 49
- Development 48
- Database management 43
- Computer programs 36
- Artificial intelligence 32
- Data processing 31
- History 24
- Historia 23
- Design 21
- Open source software 21
- Leadership 19
- Novela inglesa 19
- Big Data 17
- Computer programming 17
- Java (Computer program language) 17
- Scala (Computer program language) 17
- Information technology 16
- Success in business 16
- Technological innovations 16
- Creative ability in business 15
-
41Publicado 2018Materias: “…Spark (Electronic resource : Apache Software Foundation)…”
Libro electrónico -
42por Nabi, Zubair. authorTabla de Contenidos: “…Chapter 1: The Hitchhiker's Guide to Big Data -- Chapter 2: Introduction to Spark -- Chapter 3: DStreams: Realtime RDDs -- Chapter 4: High Velocity Streams: Parallelism and Other Stories -- Chapter 5: Real-time Route 66: Linking External Data Sources -- Chapter 6: The Art of Side Effects -- Chapter 7: Getting Ready for Prime Time -- Chapter 8: Real-time ETL and Analytics Magic -- Chapter 9: Machine Learning at Scale -- Chapter 10: Of Clouds, Lambdas, and Pythons…”
Publicado 2016
Libro electrónico -
43
-
44Publicado 2015Materias: “…Spark (Electronic resource : Apache Software Foundation)…”
-
45Publicado 2016Materias: “…Spark (Electronic resource : Apache Software Foundation)…”
Libro electrónico -
46Publicado 2017Tabla de Contenidos: “…Introduction high performance Spark -- 2. How Spark works -- 3. Dataframes, datasets, and Spark SQL -- 4. …”
Libro electrónico -
47Publicado 2018Materias: “…Spark (Electronic resource : Apache Software Foundation)…”
-
48Publicado 2017Materias: “…Spark (Electronic resource : Apache Software Foundation)…”
Libro electrónico -
49Publicado 2017Tabla de Contenidos: “…Analyzing big data -- Introduction to data analysis with Scala and Spark -- Recommending music and the audioscrobbler data set -- Predicting forest cover with decision trees -- Anomaly detection in network traffic with K-means clustering -- Understanding Wikipedia with latent semantic analysis -- Analyzing co-occurrence networks with GraphX -- Geospatial and temporal data analysis on the New York City taxi trip data -- Estimating financial risk through Monte Carlo simulation -- Analyzing genomics data and the BDG project -- Analyzing neuroimaging data with PySpark and Thunder…”
Libro electrónico -
50por Mishra, Raju Kumar. authorTabla de Contenidos: “…Chapter 1: The Era of Big Data, Hadoop, and Other Big Data Processing Frameworks -- Chapter 2: Installation -- Chapter 3: Introduction to Python and NumPy -- Chapter 4: Spark Architecture and Resilient Distributed Dataset -- Chapter 5: The Power of Pairs: Paired RDD -- Chapter 6: IO in PySpark -- Chapter 7: Optimizing PySpark and PySpark Streaming -- Chapter 8: PySparkSQL -- Chapter 9: PySpark MLlib and Linear Regression…”
Publicado 2018
Libro electrónico -
51Publicado 2018Tabla de Contenidos: “…. -- Chapter 7: Structured Streaming with PySpark -- Introduction -- Understanding Spark Streaming…”
Libro electrónico -
52Materias: “…Spark (Electronic resource : Apache Software Foundation)…”
Libro electrónico -
53Publicado 2017Tabla de Contenidos: “…Cover -- Title Page -- Copyright -- Credits -- About the Author -- About the Reviewer -- www.PacktPub.com -- Customer Feedback -- Table of Contents -- Preface -- Chapter 1: Getting Started with Spark SQL -- What is Spark SQL? -- Introducing SparkSession -- Understanding Spark SQL concepts -- Understanding Resilient Distributed Datasets (RDDs) -- Understanding DataFrames and Datasets -- Understanding the Catalyst optimizer -- Understanding Catalyst optimizations -- Understanding Catalyst transformations -- Introducing Project Tungsten -- Using Spark SQL in streaming applications -- Understanding Structured Streaming internals -- Summary -- Chapter 2: Using Spark SQL for Processing Structured and Semistructured Data -- Understanding data sources in Spark applications -- Selecting Spark data sources -- Using Spark with relational databases -- Using Spark with MongoDB (NoSQL database) -- Using Spark with JSON data -- Using Spark with Avro files -- Using Spark with Parquet files -- Defining and using custom data sources in Spark -- Summary -- Chapter 3: Using Spark SQL for Data Exploration -- Introducing Exploratory Data Analysis (EDA) -- Using Spark SQL for basic data analysis -- Identifying missing data -- Computing basic statistics -- Identifying data outliers -- Visualizing data with Apache Zeppelin -- Sampling data with Spark SQL APIs -- Sampling with the DataFrame/Dataset API -- Sampling with the RDD API -- Using Spark SQL for creating pivot tables -- Summary -- Chapter 4: Using Spark SQL for Data Munging -- Introducing data munging -- Exploring data munging techniques -- Pre-processing of the& -- #160 -- household electric consumption Dataset -- Computing basic statistics and aggregations -- Augmenting the Dataset -- Executing other miscellaneous processing steps -- Pre-processing of& -- #160 -- the weather Dataset…”
Libro electrónico -
54Publicado 2015Materias: “…Spark (Electronic resource : Apache Software Foundation)…”
-
55Publicado 2018Tabla de Contenidos: “…Gentle overview of big data and Spark. What is Apache Spark? -- A gentle introduction to Spark -- A tour of Spark's toolset -- Part 2. …”
Libro electrónico -
56
-
57Publicado 2020Materias: “…Spark (Electronic resource : Apache Software Foundation)…”
Libro electrónico -
58Publicado 2017Materias: “…Spark (Electronic resource : Apache Software Foundation)…”
Libro electrónico -
59Publicado 2015Tabla de Contenidos: “…Analyzing big data -- Introduction to data analysis with Scala and Spark -- Recommending music and the audioscrobbler data set -- Predicting forest cover with decision trees -- Anomaly detection in network traffic with K-means clustering -- Understanding Wikipedia with latent semantic analysis -- Analyzing co-occurrence networks with GraphX -- Geospatial and temporal data analysis on the New York City taxi trip data -- Estimating financial risk through Monte Carlo simulation -- Analyzing genomics data and the BDG project -- Analyzing neuroimaging data with PySpark and Thunder…”
Libro electrónico -
60por Singh, Pramod. authorTabla de Contenidos: “…Chapter 1: Introduction to PySpark -- Chapter 2: Data Processing -- Chapter 3: Spark Structured Streaming -- Chapter 4: Airflow -- Chapter 5: Machine Learning Library (MLlib) -- Chapter 6: Supervised Machine Learning -- Chapter 7: Unsupervised Machine Learning -- Chapter 8: Deep Learning Using PySpark…”
Publicado 2019
Libro electrónico