scrapy-fake-useragent's Introduction

scrapy-fake-useragent-fix

Random User-Agent middleware based on fake-useragent. It picks up User-Agent strings based on usage statistics from a real world database.

Installation

The simplest way is to install it via pip:

pip install scrapy-fake-useragent-fix

Configuration

Turn off the built-in UserAgentMiddleware and add RandomUserAgentMiddleware.

In Scrapy >=1.0:

DOWNLOADER_MIDDLEWARES = {
    'scrapy.downloadermiddlewares.useragent.UserAgentMiddleware': None,
    'scrapy_fake_useragent.middleware.RandomUserAgentMiddleware': 400,
}

In Scrapy <1.0:

DOWNLOADER_MIDDLEWARES = {
    'scrapy.contrib.downloadermiddleware.useragent.UserAgentMiddleware': None,
    'scrapy_fake_useragent.middleware.RandomUserAgentMiddleware': 400,
}

Configuring User-Agent type

There's a configuration parameter RANDOM_UA_TYPE defaulting to random which is passed verbatim to the fake-user-agent to random choose user agents. random, chrome, firefox, safari, internetexplorer are supported. If you want to choose from a specific device type, you can use a device prefix before browse type, such as desktop.chrome, mobile.chrome, only desktop, mobile, tablet are supported.

Usage with scrapy-proxies

To use with middlewares of random proxy such as scrapy-proxies, you need:

set RANDOM_UA_PER_PROXY to True to allow switch per proxy
set priority of RandomUserAgentMiddleware to be greater than scrapy-proxies, so that proxy is set before handle UA

Configuring Fake-UserAgent fallback

There's a configuration parameter FAKEUSERAGENT_FALLBACK defaulting to None. You can set it to a string value, for example Mozilla or Your favorite browser, this configuration can completely disable any annoying exception.

Recommend Projects

qiulin-wang / scrapy-fake-useragent Goto Github PK

scrapy-fake-useragent's Introduction

scrapy-fake-useragent-fix

Installation

Configuration

Configuring User-Agent type

Usage with scrapy-proxies

Configuring Fake-UserAgent fallback

scrapy-fake-useragent's People

Contributors

Stargazers

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent