Giter Club home page Giter Club logo

app_info_extracter's Introduction

Introduction

App info extractor is a utility to extract and analyze customer reviews and critical metrics of apps from multiple sources like google play store, Apple play store, and any other sources.This utility is highly configurable to extract and give relevant information about any app in a meaningful way.

How it works?

This utility takes config.yaml as input. Configuration for each app is self-contained in configs. Once a configuration is specified in config.yaml, the utility will iteratively go through each app, crawl the relevant information, and store the data locally as a CSV file.

How to use it?

  1. Check out the package
  2. Have python 3.8 on your system . Use pyenv to manage multiple version of python in your machine
  3. Open terminal and type cd app_info_extracter
  4. Setup python virtual enviorment having this will prevent you from messing up with the system-wide python installation
  5. Run pyb install_dependencies to install the dependencies for your project. Refer to the Building and testing section below incase of queries.
  6. To check if the build is successful, run pyb in the terminal.
  7. Add the required configuration in config.yaml using your favorite editor
  8. In terminal type python3 src/crawler/main.py src/crawler/config.yaml
  9. Results of the crawling will be stored in separate directories for each app in current directory
  10. In case of failure, just re-run python3 src/crawler/main.py src/crawler/config.yaml script will start from where its left off.

Building and testing

Prerequisites

  1. Verify that Python-3 and pip3 should be installed in your system. terminal>pip3 -V
  2. Install pyb: pip3 install pybuilder
  3. Verify Pyb installation :pyb --version

To build,clean and run tests, use the following command:

Build and test

pyb install_dependencies 
# Pyb should work without pyb install_dependencies, but unfortunately it doesnt.
# See details: here https://github.com/pybuilder/pybuilder/issues/727 
pyb -v
  • That's all you need to build the project.
  • Details about Test failures if any can be found in file target\reports\unittest
  • The binary artifacts will be available in the target folder.

Clean builds :

  • pyb clean

Reset environment :

  • pyb clean && rm -rf .pybuilder

working with the build system

IDE setup

  • Pycharm/intellij support is present in source. .idea files are avilable when you clone this repo. Just open the folder in pycharm.

Contributing

  • Use vim or InteliJUltimate or pycharm or visualstudio code.

app_info_extracter's People

Contributors

dhruvpeter avatar charlypa avatar 7tena avatar abose avatar vi-suji avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.