Giter Club home page Giter Club logo

darknet_chinesetrading's Introduction

mosaic.jpg

DarkNet_ChineseTrading - 暗网中文网监控实时爬虫

En_Doc 教程:如何实现暗网交易监控

监控大屏(grafana快速实现)

功能

  • Tor节点切换
  • 自动注册(中文式账户)
  • 自动登录
  • 防封禁
  • ORM交互
  • 事件详情/样本信息录入
  • 事件提醒(telegram)[图文]
  • 分类爬取
  • 裸体图片过滤(保存但不发送)
  • 残留EXIF-GPS信息提取

加入我们:https://t.me/fordarknetspiderbot

安装

  • python环境配置

    下载并安装 python 3.8

     pip install -r ./requirements.txt
     pip install -U 'requests[socks]'
    
  • tor安装

    当前需更新tor至[0.4.0.0版本],旧版将有几率无法取得数据 如果无法通过如下命令安装最新版,推荐至官网编译安装最新源码包

     brew install tor
     	
     cd /usr/local/etc/tor
     cp torrc.sample ./torrc
     vi torrc
    

    将如下配置添加到 torrc 后,运行 restart_tor.sh 开启tor

     SOCKSPort 9150 					# socks5代理地址
     Socks5Proxy 127.0.0.1:1086 		# 科学上网代理地址(如已翻墙可不填)
     RunAsDaemon 1 					# 开启后台运行
     ControlPort 9151 				# 开启控制端口
    
  • OCR(mac)

    识别率略低,可在parser.py的get_captcha处替换

     brew install tesseract
    

    snum.traineddata

  • 存储环境

    安装Docker后下载Redis Mysql即可

  • Centos下环境安装

yum install epel-release -y
yum install redis mariadb mariadb-server git tesseract tesseract-langpack-deu tor -y 
wget -P /usr/share/tesseract/tessdata/ https://pyocean.com/data/tesseract/snum.traineddata 


systemctl start mariadb   #启动mariadb
systemctl enable mariadb  #设置开机自启动
mysql_secure_installation 
  • 运行

    配置config_dev.py中的连接设定与TelegramRobotToken

     mv config_dev.py conf.py 
     bash restart_tor.sh
     bash restart_task.sh
     python run.py
     
    
  • 运行逻辑

  • 运行结果截图

    • telegram

    • run.py

  • 额外命令

     python3 run.py --help
    
     Usage: run.py [OPTIONS]
    
     Options:
       --debug        Print debug log
       --domain TEXT  Target domain.
       --save_error   Whether to save the error log
       --update       Whether it has only been updated to crawl
       --help         Show this message and exit.
    
    

darknet_chinesetrading's People

Contributors

s045pd avatar atpiu avatar imgbotapp avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.