Giter Club home page Giter Club logo

emr-flink-hudi's Introduction

flink hudi example

  • flink DataGen connector生产数据写入到Hudi表,测试流程使用
# build
mvn clean package -Dscope.type=provided 
# 或者使用build好的jar
wget https://dxs9dnjebzm6y.cloudfront.net/tmp/emr-flink-hudi-1.0.jar
# emr flink-hudi-budle 拷贝到flink lib下
sudo cp /usr/lib/hudi/hudi-flink-bundle.jar /usr/lib/flink/lib/
# check-leaked-classloader set false
sudo sed -i -e '$a\classloader.check-leaked-classloader: false' /etc/flink/conf/flink-conf.yaml
# 运行作业 ,注意替换为自己的S3路径
sudo flink run -m yarn-cluster  -yjm 1024 -ytm 2048 -d -ys 4 -p 8 -c  com.aws.analytics.DataGen2Hudi /home/hadoop/emr-flink-hudi-1.0.jar -c s3://app-util/flink-data-gen/chk/ -p s3://app-util/flink-data-gen-hudi-02/ -t test_tb

# 作业参数说明如下
DataGen2Hudi 1.0
Usage: DataGen2Hudi [options]

  -c, --checkpointDir <value>
                           checkpoint dir
  -l, --checkpointInterval <value>
                           checkpoint interval: default 60 seconds
  -p, --hudiPath <value>   hudi path: eg. s3://xxx/xxx/
  -t, --hudiTableName <value>
                           hudi table name
  -r, --rowsPerSecond <value>
                           ddatagen rows-per-second, default:100

emr-flink-hudi's People

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.