zhangtongle Goto Github PK
Type: User
Type: User
一个python爬虫程序用于爬海量**财经法规存入mysql数据库当中,不断完善当中
1. 主要分为三个模块,一个爬虫抓取模块,一个是数据处理模块,一个是用户模块。 2. 爬虫抓取模块主要是从直播吧、新浪体育、网易体育上爬取有关足球的新闻和用户关于足球的评论,利用集群HADOOP抓取网页,分析得出URL集,提取特征URL 3. 网页linux脚本过滤得到原始网页,然后二次过滤得到文本,并使用分布式储存。 4. 处理模块主要是根据训练集规则一和规则二,得到分词器,然后对文本进行操作,得出训练结果。 5. 通过特征脚本得到训练结果的特征词分类,然后提取出球队模糊集和球星模糊集。 6. 过滤得到球队精确集和球星精确集,并存入MYSQL数据库。 7. 从数据库中提取球星和球队的信息进行图表分析,并动态显示WIKI信息,调入显示模块中和用户进行交换
django实现报表平台.目标:可以嵌入kibana报表.可以展示数据仓库报表
Java Docker API Client
A Python library for the Docker Engine API
Doraemon-接口自动化测试工具
豆瓣爬虫,爬取多部电影的短评和评分,存入数据库
python3.5下的第一个scrapy项目,抓住豆瓣某部电视剧评论,并存入mysql数据库
scrapy框架,多页面爬取豆瓣电影,并将数据写入数据库
开源的 Material Design 豆瓣客户端(A Material Design app for douban.com)
联系微信(1764328791)、抖音API、抖音接口、抖音数据、抖音直播数据、抖音直播Api、抖音视频Api、抖音爬虫、抖音去水印、抖音视频下载、抖音视频解析、抖音直播监控、抖音数据采集、xgorgon
A module that integrates selenium and requests session, encapsulates common page operations, can achieve seamless switching between the two modes.
Exploration how to "auto-implement" a DAO defined as a interface. Vaguely similar to Rails ActiveRecord, but in Java. (DwimDao = "Do What I Mean Data Access Object");
泛微OA e-cology rce批量检测工具
:tea: iOS UI Automation Test Framework
一款基于kettle的数据处理web调度控制平台,支持文档资源库和数据库资源库,通过web平台控制kettle数据转换,可作为中间件集成到现有系统中
快速、简单避免OOM的java处理Excel工具
一款轻量级的图片加载库,图片缓存、压缩,防数据错乱
The IK Analysis plugin integrates Lucene IK analyzer into elasticsearch, support customized dictionary.
Android performance test tool-CPU,memory,network traffic,starting time,battery current and status
Catch common Java mistakes as compile-time errors
Python项目自动化多服务器部署的工具
:green_book: The Beauty of Python Programming.
A Python module making Telnet and SSH easy
Simple, Pythonic remote execution and deployment.
一款入门级的人脸、视频、文字检测以及识别的项目.
up to date simple useragent faker with real world database
前端面试每日 3+1,以面试题来驱动学习,提倡每日学习与思考,每天进步一点!每天早上5点纯手工发布面试题(死磕自己,愉悦大家),4000+道前端面试题全面覆盖,HTML/CSS/JavaScript/Vue/React/Nodejs/TypeScript/ECMAScritpt/Webpack/Jquery/小程序/软技能……
Feign makes writing java http clients easier
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.