handanchen Goto Github PK
Type: User
Type: User
All articles from website-www.kisspuppet.com
Java资源大全中文版,包括开发库、开发工具、网站、博客、微信、微博等,由伯乐在线持续更新。
微信小程序开发资源汇总 :100:
An exhaustive reference to problems seen in real-world data along with suggestions on how to resolve them.
阿里巴巴mysql数据库binlog的增量订阅&消费组件
CDH集群环境Hdfs、MapReduce、Hive、Hbase、Kafka、Solr、Spark、Zookeeper、Mahout示例代码
Content Data Store (HDFS/HBase)
自己动手做聊天机器人教程
云计算之hadoop、hive、hue、oozie、sqoop、hbase、zookeeper环境搭建及配置文件
Multipurpose tool for discovering and collecting Cloudera Manager metrics.
各大电商网站数据抓取分析
2013年4月:一个爬行去哪儿网(qunar.com)数据的爬虫脚本。提供了一种爬行AJAX类型网站数据的方法。
使用scrapy,redis, mongodb,graphite实现的一个分布式网络爬虫,底层存储mongodb集群,分布式使用redis实现,爬虫状态显示使用graphite实现
Creating Scrapy scrapers via the Django admin interface
Full featured redis cache backend for Django.
Crawl book and rating infomations from Douban App
dw etl 工具 mysql 增量、全量抽取 to hive. 合并 hive 数据表, 等数据平台清洗工具
电商爬虫系统:京东,当当,一号店,国美爬虫(代理使用)
easyrec
Elite Proxies (http://elite.proxies.online) middleware for scrapy http://rev.proxies.online
Remedy small files by combining them into larger ones.
Very simple search engine "specialised" in searching financial news (written using Nutch, Hbase, Solr, SpringBoot, Bootstrap and AngularJS)
Mirror of Apache Hadoop
Example implementation of hadoop CombineFileInputFormat
hbase+solr实现hbase的二级索引
Lily HBase Indexer - indexing HBase, one row at a time
通过solr实现hbase二级索引,主要通过hbase的coprocessor的Observer实现。
Utility to easily copy files into HDFS
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.