Giter Club home page Giter Club logo

active_learning_crowds's Introduction

Active learning using crowds in the loop

Install python 2.7 or anaconda python 2

Required python packages: pymongo, boto and boto3

========================================== Installation and setup

To clone the application:

git clone (git url)

Data and AWS key:

Place the data files inside the 'data' directory.

Add your AWS access key and secret token  in the credentials file with a new line in-between located in
python_scripts/AWS_key/.

To install Meteor:

curl https://install.meteor.com/ | sh

To start the meteor application and let it run continuously:

Go to meteor_application directory and run the command:

    nohup meteor --port 8080 &

To setup meteor :

Go to meteor application directory and run the commands:

    meteor npm install --save babel-runtime

    meteor npm install --save core-js

    meteor add session

    meteor remove autopublish

Load data into meteor mongo:

Go to python_scripts directory and run the commands:

    python insert_data_into_mongodb.py (To add 1 year twitter data in mongoDB)

    python insert_crowdflower_data.py (To add crowdflower data in mongoDB)

Create HITS:

Go to python_scripts directory and run the commands:

To run in the sandbox environment:

    python create_hit.py sandbox unlabeled (To create HITS using data which is not labeled)

    python create_hit.py sandbox crowdflower 10 (To create HITS using crowdflower data containing 10 tweets per HIT)

To run in the production environment:

    python create_hit.py production unlabeled (To create HITS using data which is not labeled)

    python create_hit.py production crowdflower 10 (To create HITS using crowdflower data containing 10 tweets per HIT)

=============================================== Compensation HIT and Worker Payment

Get compensation report:

Go to python_scripts directory and run the commands:

To get the report on the compensation:

    python get_report_based_on_labels.py

The report is generated in a csv file (/data/hit_report2.csv)

Create qualification type, compensation HIT and assign qualification:

Go to python_scripts directory and run the command:

To create a qualification type, compensation HIT and assigning workers with the qualification in the sandbox environment:

    python contact_users_compensation_hit.py sandbox

To create a qualification type, compensation HIT and assigning workers with the qualification in the production environment:

    python contact_users_compensation_hit.py production


To monitor the submission of the compensation HIT and to store the HIT information into mongoDB in the sandbox environment:

    python collect_compensation_hit_results.py sandbox

    nohup python collect_compensation_hit_results.py sandbox & (To execute the script as a background process in the server)

To monitor the submission of the compensation HIT and to store the HIT information into mongoDB in the production environment:

    python collect_compensation_hit_results.py production

    nohup python collect_compensation_hit_results.py production & (To execute the script as a background process in the server)

Pay workers:

Go to python_scripts directory and run the command:

To pay a worker with worker ID and the compensation amount in sandbox environment:

    python worker_compensation.py sandbox 'WORKERID' 'AMOUNT'

    example: worker_compensation.py sandbox A3VOSKJ5LS9WB 0.10

To pay a worker with worker ID and the compensation amount in production environment:

    python worker_compensation.py production 'WORKERID' 'AMOUNT'

    example: worker_compensation.py production A3VOSKJ5LS9WB 0.10

active_learning_crowds's People

Contributors

tongliutl avatar sabarigirish avatar

Watchers

James Cloos avatar  avatar  avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.