Giter Club home page Giter Club logo

disboard-scraper's Introduction

What does Disboard Scraper do?

This is a simple scraper which purpose is to obtain data about servers located on Disboard, providing a simple API to describe what results you are looking for. Note that this Actor does NOT bypass the limitations of https://disboard.org/, meaning that the amount of pages scraped per tag is limited to 50 (circ. ~1200 results).

Extracts descriptions, images, ratings, reviews, user counts, even join links, and more!

Why use Disboard Scraper?

Reasons:

  • Simple, no time wasted trying to configure inputs
  • Automatic duplicate detection and filtering

Want to collect or track data about active/large servers? This is the way to do it!

How much will scraping Disboard cost you?

To obtain 1000 results, the cost is approximately $ 0.3. The Apify Free plan will therefore cover up to 15000 results!

How to scrape Disboard

  1. Go to the top of page Disboard Scraper on the Apify platform
  2. Click the Try for free button
  3. Fill the inputs with desired query values
  4. Click the Start button
  5. Wait for the Actor to finish
  6. Export your data

Is it legal to scrape Disboard?

Our scrapers are ethical and do not extract any private user data, such as email addresses, gender, or location. They only extract what the user has chosen to share publicly. We therefore believe that our scrapers, when used for ethical purposes by Apify users, are safe. However, you should be aware that your results could contain personal data. Personal data is protected by the GDPR in the European Union and by other regulations around the world. You should not scrape personal data unless you have a legitimate reason to do so. If you're unsure whether your reason is legitimate, consult your lawyers. You can also read our blog post on the legality of web scraping.

Input

Example input for getting the first 10 pages (circ. ~240 results) of largest servers with "community" tag:

{
    "startPageNumber": 1,
    "endPageNumber": 10,

    "keywords": [ "community" ],
    "sort": "member_count"
}

Click on the Input tab for more information.

Output

You can download the dataset extracted by Disboard Scraper in various formats, such as JSON, HTML, CSV, or Excel.

JSON output example (without reviews):

{
	"id": "244230771232079873",
	"name": "The Programmer's Hangout",
	"description": "Whether you've written 10 lines of code or have been writing code for 10 years, you're welcome here! The Programmer's Hangout (TPH) is an extremely active community and a great place to get a solid footing in programming.",
	"category": "Technology",
	"tags": [
		"social",
		"community",
		"programming",
		"javascript",
		"java"
	],
	"userCount": {
		"online": 25942
	},
	"disboardServerUrl": "https://disboard.org/server/244230771232079873",
	"iconUrl": "https://cdn.discordapp.com/icons/244230771232079873/a_7b187c1d9f1a61d68b1373ab69d79f86.jpg",
	"joinLinkUrl": "https://disboard.org/server/join/244230771232079873",
	"bumpedAt": "2024-04-11T19:56:35.000Z",
	"reviews": []
}

Tips

  • If you want to collect more servers overall, try to use more narrow/strict tags (not as generally used)

disboard-scraper's People

Contributors

milunnn avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.