Giter Club home page Giter Club logo

oozie-scheduler-workflow's Introduction

Oozie-workflow-scheduler

Oozie combines multiple jobs sequentially into one logical unit of work. It is integrated with the Hadoop ecosystems, with MR1, YARN as its architectural center, and supports Hadoop jobs for MapReduce, Spark, Pig, Hive, and Sqoop. Oozie can also schedule jobs specific to a system, like Java programs, python or shell scripts. Also it acts as Cron of unix world as well. It helps scheduling the jobs and tracking the status of it.

1.Install/configure hdfs/yarn cluster. let's create the single node cluster where hdfs/yarn/oozie/db all hosted together.

2.Install/configure oozie and replace the default derby database with mysql or postgresql. here we use mysql.

3.Integrates the oozie library packages to hdfs and get the appropriate permission/ownership in place. Otherwise, it will be difficult to troubleshoot the jobs when it fails.

4.Run mapreduce jobs through Oozie. job/workflow details are in folder mapreduce-jobs.

$oozie job -oozie http://hostname:11000/oozie -config examples/apps/map-reduce/job.properties -run job: 0000014-190123083458183-oozie-oozi-W

$oozie job -oozie http://localhost:11000/oozie -info 0000014-190123083458183-oozie-oozi-W Job ID : 0000014-190123083458183-oozie-oozi-W Workflow Name : map-reduce-wf App Path : hdfs://hostname/user/oozie/wc1 Status : SUCCEEDED Run : 0 User : hadoop Group : - Created : 2019-01-23 00:31 GMT Started : 2019-01-23 00:31 GMT Last Modified : 2014-01-23 00:32 GMT Ended : 2014-01-23 00:32 GMT CoordAction ID: -

Actions ID Status Ext ID Ext Status Err Code 0000014-190123083458183-oozie-oozi-W@:start: OK - OK - 0000014-190123083458183-oozie-oozi-W@mr-node OK job_1401405229971_0022 SUCCEEDED - 0000014-190123083458183-oozie-oozi-W@end OK - OK $

5.Run spark jobs using Oozie workflow. Details are in the folder spark-jobs. follow the same steps as what follwed at running mapreduce job previously.

  1. If there is a confusion around creating/building jars and creating workflow. use the examples mentioned in apache-oozie documentation. sample jobs are added in folder "oozie-scheduler-workflow/oozie-setup/examples".

7.Scheduling Oozie jobs. Once the jobs are functioning fine, we can use the cron like scheduling jobs through "frequency" parameter. Oozie offers a cron like syntax structure to schedule/run the jon at particulat time or in regular interval.

/ / / / hdfs://hostname:8020/user/oozie / / /

oozie-scheduler-workflow's People

Contributors

balajirajan avatar

Stargazers

Pius Arhanbhunde avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.