Giter Club home page Giter Club logo

douban_movie's Introduction

帮助文档(windows)

1.将目录中的chromedriver.exe复制到电脑中的C:\Program Files (x86)\Google\Chrome\Application目录下

若没有该目录请安装Google浏览器

2.将上述目录添加到电脑环境变量中

此电脑 > 右击 > 属性 > 高级系统设置 > 环境变量 > 系统变量 > path > 编辑 > 新建 > 将目录复制到输入框中 > 一直保存

3.配置python环境

切换到python环境,命令行进入到项目目录,输入命令 python pip install -r requirements.txt

4.爬虫配置

FILE_PATH = '.'  # 结果文件路径(`.`表示当前目录)
FILE_NAME = 'result.csv'  # 结果文件名称 (短评文件名称,可自行修改前缀)
MOVIE_CONFIG = {
    'page_limit': 20,  # 每次请求电影条数(可自己修改,建议不要设置过小,过小容易封ip)
    'page_start': 0,  # 电影从第几条开始获取(默认为0,不做修改就行了)
    # 爬取电影类型,共有[‘热门','最新','经典','可播放','豆瓣高分','冷门佳片','华语','欧美','韩国','日本','动作','喜剧','爱情','科幻','悬疑','恐怖','动画‘ ]
    'tag': '最新', # 可改为以上类型例如:`'tag':'经典'`
    'page_total': 200,  # 爬取电影数量(建议设置200-250,不要设置过大,因为该类型可能并没有那么多电影)
}
SHORT_CONFIG = {
    'start': 0,  # 短评从第几条开始获取(默认为0,一般不用管)
    'short_num': 100  # 爬取短评条数 (爬取短评条数,自定义设置,建议不要过大)
}

douban_movie's People

Contributors

godword avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.