Giter Club home page Giter Club logo

zanachka's Projects

domainspider icon domainspider

Simple web crawler that sticks to a set list of domains. Work in progress.

double-agent icon double-agent

A test suite of common scraper detection techniques. See how detectable your scraper stack is.

dragnet icon dragnet

Just the facts -- web page content extraction

dscrapy icon dscrapy

distributed scrapy 分布式网络爬虫

dukpy icon dukpy

Simple JavaScript interpreter for Python

dupuis icon dupuis

UI tools for record deduplication and linkage

e-scraper icon e-scraper

Collect product and reviews from a different e-commerce stores.

eli5 icon eli5

A library for debugging/inspecting machine learning classifiers and explaining their predictions

estela icon estela

estela, an elastic web scraping cluster 🕸

ethereum-scraper icon ethereum-scraper

UNMAINTAINED! Exporter for Ethereum blocks, transactions, ERC20 transfers, contracts, using Scrapy

exporters icon exporters

Exporters is an extensible export pipeline library that supports filter, transform and several sources and destinations

extraction icon extraction

A Python library for extracting titles, images, descriptions and canonical urls from HTML.

extruct icon extruct

Extract embedded metadata from HTML markup

fakebrowser icon fakebrowser

🤖 Fake fingerprints to bypass anti-bot systems. Simulate mouse and keyboard operations to make behavior like a real person.

fingerprints icon fingerprints

Make it easier to compare and cross-reference the names of companies and people by applying strong normalisation.

flatson icon flatson

Tool to flatten stream of JSON-like objects, configured via schema

flatten-dict icon flatten-dict

A flexible utility for flattening and unflattening dict-like objects in Python.

flattering icon flattering

Flatten, format, and export any JSON-like data to CSV (or any other string output).

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.