144lucky Goto Github PK
Name: 墨水心
Type: User
Name: 墨水心
Type: User
12306智能刷票,订票
hbase2mongo
markdown content of garyelephant's blog.http://garyelephant.me
Hive UDF's for the data warehouse
DC/OS Build and Release tools
Quickly build arbitrary size Hadoop Cluster based on Docker
hbase-tools try easy to use and test the hbase,
hbase operations
一个支持多数据源的ETL数据导入/导出工具
Some useful custom hive udf functions, especial array and json functions.
A simple aggregate function (UDAF) for Hive -- like max() but it allows you to refer to additional columns in the maximal row.
Productivity-centric Python data analysis framework for SQL systems and the Hadoop platform. Co-founded by the creator of pandas
Mirror of Apache CarbonData (Incubating)
PredictionIO, a machine learning server for developers and ML engineers. Built on Apache Spark, HBase and Spray.
Used to monitor the consumer status of Kafka clusters, as well as offsets, metadata and other information.
This application, Kafka ES Indexer, will read the messages from Kafka, processes (if needed) and batch index them into ElasticSearch.
Python client for Apache Kafka
High Performance Kafka Consumer for Spark Streaming. Now Support Kafka 0.10
Pluggable Kafka offset manager for use in Spark streaming jobs
Some information about Apache Kylin interaction with Pentaho Mondrian
Kylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on Apache Hadoop and Spark. Kylo is licensed under Apache 2.0 and contributed by Think Big, A Teradata Company
Lock tailing on your rotating files
摩拜单车爬虫
美团对 flume 的扩展和改进
Kafka delivery semantics in the case of failure depend on how and when offsets are stored. Spark output operations are at-least-once. So if you want the equivalent of exactly-once semantics, you must either store offsets after an idempotent output, or store offsets in an atomic transaction alongside output.There is Spark Streaming how to store Kafka topic offset with HBase.
用户画像相关的参考代码
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.