Giter Club home page Giter Club logo

data-collectors's Introduction

data-collectors

基于python开发的可以采集b站,微博,快手,小红书评论的GUI软件

使用教程

1.首先运行main.py,将图形界面窗口运行出。
image
2.再次在图形界面窗口中点击需要爬取的类型网站。
3.具体操作指南。

b站程序使用教程

  1. 找到视频网址(点击b站上面你想要看的任何一个视频)
image
  1. F12键打开开发者工具,找到红色标识的一栏

    第一步,找到network工具栏

image

第二步,找到可以输入aid号的白色框

image
  1. 程序的oid号就是这里的aid号,每个视频都会分配一个号(叫做oid号)
image
  1. 输入aid号(oid)
image
  1. 输入oid
image
  1. 评论开始爬取

image

  1. 程序结束之后,会在这个程序的同一级目录下成csv文件
image
  1. 点击这个文件,就可以看到爬取的内容

image

爬取结果图

image

data-collectors's People

Contributors

suupermans avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.