Giter Club home page Giter Club logo

webinfo-crawler's Introduction


运行项目

  1. 安装依赖
pip install -r requirements.txt    # 当提示权限不足时,使用sudoer权限执行即可
  1. 生成数据结构

执行data.sql脚本生成数据结构,在config/settings.py中修改数据库连接

  1. 设置关键字

tianyancha.py中添加关键字,或从缓存、数据库读取关键字

  1. 后台运行, 自定义了日志收集,不需要nohup的标准输出日志
# 天眼查
nohup python3 tianyancha.py >/dev/null 2>&1 &
# 企查查
nohup python3 qichacha.py >/dev/null 2>&1 &

项目结构

├── LICENSE
├── README.md
├── config
│   ├── __init__.py
│   ├── env.py
│   └── settings.py
├── const
│   ├── __init__.py
│   └── constants.py
├── db
│   ├── __init__.py
│   ├── data.sql
│   ├── model
│   │   ├── __init__.py
│   │   └── model.py
│   └── mysql_connector.py
├── qichacha
│   ├── __init__.py
│   ├── client.py
│   ├── crawler.py
│   └── manager.py
├── qichacha.log
├── qichacha.py
├── requirements.txt
├── tianyancha
│   ├── __init__.py
│   ├── client.py
│   ├── crawler.py
│   └── manager.py
├── tianyancha.log
├── tianyancha.py
├── tree.md
└── util
    ├── __init__.py
    ├── date.py
    ├── httpclient.py
    └── log.py

声明:本项目仅做技术交流,本人不担当任何侵犯他人利益及其他违反国家法律等行为的技术支持角色, 如因使用该项目代码另做他途产生任何后果本人概不负责。

webinfo-crawler's People

Contributors

bouxin avatar

Stargazers

 avatar  avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.