Giter Club home page Giter Club logo

pipeline-samples's Introduction

pipeline-samples

These are simple exercises where there are pipelines and functions defined, that explore the usage of Apache Beam

Run on local

mvn package
java -jar target/pipelines-samples-0.1-shaded.jar

Run on local Flink cluster

mvn package -Pflink-runner
cd flink/flink-1.11.0
./bin/flink run /Users/user}/{somePath}/pipeline-samples/target/pipelines-samples-0.1-shaded.jar --runner=FlinkRunner

Run on Docker Flink cluster

Package the jar file as a fat.jar - dependencies included - using the shade plugin

mvn package -Pflink-runner

This will create a jar file in the /target/pipelines-samples-0.1-shaded.jar directory

From a windows Power shell command, start flink with docker compose, with the following commands:

set COMPOSE_CONVERT_WINDOWS_PATH=1
docker-compose up -d

Then bring up the Flink UI, I configured it for port 8888 localhost:8888

flink-ui-snapshot

upload the pipelines-samples-0.1-shaded.jar file, add the program argument :

--runner=FlinkRunner

flink-ui-upload-snapshot

You should be able to run your job, and see the results: screenshot-running-20

Run on Google Dataflow

export GOOGLE_APPLICATION_CREDENTIALS="/Users/{user}/{somePath}/XXX_credentials.json"
gcloud auth application-default login
mvn package -Pdataflow-runner
java -jar target/pipelines-samples-0.1-shaded.jar --runner=DataflowRunner --project=deloitte-beam-284202 --tempLocation=gs://deloitte-beam-sandbox/temp/ --region=us-west1

Run on Amazon EMR (Flink)

mvn package -Pflink-runner
scp -i ~/.ssh/keypair.pem ./target/pipeline-samples-0.1-shaded.jar ec2-user@ec2-xxx-xxx-xxx:/home/hadoop

pipeline-samples's People

Contributors

mrisney avatar

Watchers

James Cloos avatar  avatar

Forkers

somanshushekhar

pipeline-samples's Issues

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.