Giter Club home page Giter Club logo

flume-hdfs-sink's Introduction

/**
 * Licensed to the Apache Software Foundation (ASF) under one
 * or more contributor license agreements.  See the NOTICE file
 * distributed with this work for additional information
 * regarding copyright ownership.  The ASF licenses this file
 * to you under the Apache License, Version 2.0 (the
 * "License"); you may not use this file except in compliance
 * with the License.  You may obtain a copy of the License at
 *
 *     http://www.apache.org/licenses/LICENSE-2.0
 *
 * Unless required by applicable law or agreed to in writing, software
 * distributed under the License is distributed on an "AS IS" BASIS,
 * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
 * See the License for the specific language governing permissions and
 * limitations under the License.
 */

This modify release can load data to parquet table

hdfs sink add impala table data loading logic.

add more jar dependencies:

	cd $FLUME_HOME/lib
	wget http://central.maven.org/maven2/org/apache/hive/hive-jdbc/1.2.1/hive-jdbc-1.2.1.jar
	wget http://central.maven.org/maven2/org/apache/hive/hive-service/1.2.1/hive-service-1.2.1.jar
	wget http://central.maven.org/maven2/org/apache/hive/hive-common/1.2.1/hive-common-1.2.1.jar
	wget http://central.maven.org/maven2/org/apache/hive/hive-metastore/1.2.1/hive-metastore-1.2.1.jar

update jar dependencies:

	cd $FLUME_HOME/lib
	rm httpcore*.jar httpclient*.jar
	wget http://central.maven.org/maven2/org/apache/httpcomponents/httpcore/4.3/httpcore-4.3.jar
	wget http://central.maven.org/maven2/org/apache/httpcomponents/httpclient/4.3/httpclient-4.3.jar

config example:

	agtest.sources =rudpl
	agtest.sinks =hdfs-sink
	agtest.channels =cudpl

	agtest.sources.rudpl.type = netcat
	agtest.sources.rudpl.bind = localhost
	agtest.sources.rudpl.port = 44444

	agtest.sinks.hdfs-sink.type = hdfs
	agtest.sinks.hdfs-sink.hdfs.path = hdfs://cdh-master:8020/tmp/test1
	agtest.sinks.hdfs-sink.hdfs.fileType = DataStream
	agtest.sinks.hdfs-sink.hdfs.batchSize = 3

	# custom hdfs-impala configure
    agtest.sinks.hdfs-sink.partitionFormat=yyyyMMddHH
    agtest.sinks.hdfs-sink.refCtimeColumn=createtime
  	agtest.sinks.hdfs-sink.tableName=default.test1,default.test1_txt
  	agtest.sinks.hdfs-sink.impalaUrl=jdbc:hive2://192.168.0.94:21050/;auth=noSasl

	agtest.channels.cudpl.type = memory
	agtest.channels.cudpl.capacity = 1000
	agtest.channels.cudpl.transactionCapacity = 100

	agtest.sources.rudpl.channels = cudpl
	agtest.sinks.hdfs-sink.channel = cudpl

start flume-ng

	bin/flume-ng agent -c conf -f conf/flume-conf.properties -name agtest &

flume-hdfs-sink's People

Contributors

lidaling avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.