Giter Club home page Giter Club logo

kursach's Introduction

How to setup the data sources

  1. You need to specify the data sources that will be parsed in sources.json file. It supports 2 types of sources: RSS feeds and Telegram channels.
    • RSS feeds source object structure: name - data source name, link - url to RSS feed, selector - CSS selector for the article text on the HTML page with the article
    • Telegram channels is an array of channel ids, I honestly don't know how to find them except for using Telegram API ๐Ÿ˜ข. IMPORTANT: The user you use in data-parser to log in the Telegram must be subscribed to this channels in order to have access to the messages in them. This could easily be done manually in your user's Telegram account before running the data-parser

How to run data-parser Docker container

  1. Fill in the env/.env file with values (required variable names are listed in env/example.env)
  2. Run the following command to build an image:
   docker build -t data-parser .
  1. Open the directory with the project in your terminal and run the container from it so docker volumes work properly (or change the volume paths to the absolute ones in startup.sh file)
  2. Run the following command to start a container:
   sh startup.sh
  1. (Optional) If you need to parse data from Telegram, you need to enter the verification code for the login in tg-auth/code.txt file as soon as you receive it without any whitespaces, new lines, etc.

kursach's People

Contributors

kowalski0805 avatar kuvichkamaksim avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.