Apache Oozie The Workflow Scheduler for Hadoop
Get a solid grounding in Apache Oozie, the workflow scheduler system for managing Hadoop jobs. With this hands-on guide, two experienced Hadoop practitioners walk you through the intricacies of this powerful and flexible platform, with numerous examples and real-world use cases. Once you set up your...
Main Author: | |
---|---|
Other Authors: | |
Format: | eBook |
Language: | Inglés |
Published: |
Sebastopol, California :
O'Reilly Media
2015.
|
Edition: | First edition |
Subjects: | |
See on Biblioteca Universitat Ramon Llull: | https://discovery.url.edu/permalink/34CSUC_URL/1im36ta/alma991009628671406719 |
Table of Contents:
- ""Copyright""; ""Table of Contents""; ""Foreword""; ""Preface""; ""Contents of This Book""; ""Conventions Used in This Book""; ""Using Code Examples""; ""Safari® Books Online""; ""How to Contact Us""; ""Acknowledgments""; ""Chapter 1. Introduction to Oozie""; ""Big Data Processing""; ""A Recurrent Problem""; ""A Common Solution: Oozie""; ""A Simple Oozie Job""; ""Oozie Releases""; ""Some Oozie Usage Numbers""; ""Chapter 2. Oozie Concepts""; ""Oozie Applications""; ""Oozie Workflows""; ""Oozie Coordinators""; ""Oozie Bundles""; ""Parameters, Variables, and Functions""
- ""Application Deployment Model""""Oozie Architecture""; ""Chapter 3. Setting Up Oozie""; ""Oozie Deployment""; ""Basic Installations""; ""Requirements""; ""Build Oozie""; ""Install Oozie Server""; ""Hadoop Cluster""; ""Start and Verify the Oozie Server""; ""Advanced Oozie Installations""; ""Configuring Kerberos Security""; ""DB Setup""; ""Shared Library Installation""; ""Oozie Client Installations""; ""Chapter 4. Oozie Workflow Actions""; ""Workflow""; ""Actions""; ""Action Execution Model""; ""Action Definition""; ""Action Types""; ""MapReduce Action""; ""Java Action""; ""Pig Action""
- ""FS Action""""Sub-Workflow Action""; ""Hive Action""; ""DistCp Action""; ""Email Action""; ""Shell Action""; ""SSH Action""; ""Sqoop Action""; ""Synchronous Versus Asynchronous Actions""; ""Chapter 5. Workflow Applications""; ""Outline of a Basic Workflow""; ""Control Nodes""; "" and ""; "" and ""; """"; """"; "" and ""; ""Job Configuration""; ""Global Configuration""; ""Job XML""; ""Inline Configuration""; ""Launcher Configuration""; ""Parameterization""; ""EL Variables""; ""EL Functions""; ""EL Expressions""; ""The job.properties File""
- ""Command-Line Option""""The config-default.xml File""; ""The Section""; ""Configuration and Parameterization Examples""; ""Lifecycle of a Workflow""; ""Action States""; ""Chapter 6. Oozie Coordinator""; ""Coordinator Concept""; ""Triggering Mechanism""; ""Time Trigger""; ""Data Availability Trigger""; ""Coordinator Application and Job""; ""Coordinator Action""; ""Our First Coordinator Job""; ""Coordinator Submission""; ""Oozie Web Interface for Coordinator Jobs""; ""Coordinator Job Lifecycle""; ""Coordinator Action Lifecycle""; ""Parameterization of the Coordinator""
- ""EL Functions for Frequency""""Day-Based Frequency""; ""Month-Based Frequency""; ""Execution Controls""; ""An Improved Coordinator""; ""Chapter 7. Data Trigger Coordinator""; ""Expressing Data Dependency""; ""Dataset""; ""Example: Rollup""; ""Parameterization of Dataset Instances""; ""current(n)""; ""latest(n)""; ""Parameter Passing to Workflow""; ""dataIn(eventName):""; ""dataOut(eventName)""; ""nominalTime()""; ""actualTime()""; ""dateOffset(baseTimeStamp, skipInstance, timeUnit)""; ""formatTime(timeStamp, formatString)""; ""A Complete Coordinator Application""
- ""Chapter 8. Oozie Bundles""