Giter Club home page Giter Club logo

wordcount's Introduction

Canonical Word Count

This workflow counts the occurrences of words in a text corpus. It implements a Cuneiform example workflow first published in Bux et al. 2015.

A detailed description can be found on the Cuneiform website. This cookbook installs all necessary tools, downloads all necessary data, sets up Cuneiform, and places the workflow in a predetermined location. The cookbook can be run on any system in a virtual machine. For running the cookbook natively, an Ubuntu 16.04 or higher is required.

Below you find installation instructions for, both, the native and the virtual machine setup.

Prerequisites

Install the following packages:

If you want to set up a VM to test Cuneiform these additional packages are required:

Under Ubuntu you can install the ChefDK by entering on the command line

sudo dpkg -i chefdk_*.deb

Building a VM with kitchen

This section describes how to set up the workflow environment in a Virtual Machine (VM). To do this, it does not matter whether you are running Linux, Mac OS, or Windows. However, if you are running an Ubuntu and want to set up the workflow locally (without creating a VM), see Section Building locally.

To build a VM from this cookbook for the first time, change your git base directory and enter the following:

git clone https://github.com/joergen7/wordcount.git
cd wordcount
kitchen converge

You can log into the newly built VM by entering

kitchen login

You can drop the VM by entering

kitchen destroy

Building locally

This section describes how to set up this workflow locally without the indirection of a VM. If you want to try out this workflow in a VM first see Section Building a VM with kitchen.

To install this cookbook locally, create a directory "cookbooks", clone the cookbook into it and run the chef client:

mkdir cookbooks
cd cookbooks
git clone https://github.com/joergen7/wordcount.git
cd wordcount
berks vendor ..
cd ../..
sudo chef-client -z -r "wordcount::default"

Running the Workflow

If you installed the workflow on a VM log into the machine by typing

kitchen login

Execute the workflow script by entering

cuneiform -d /opt/data /opt/wf/wordcount.cfl

wordcount's People

Contributors

joergen7 avatar

Stargazers

 avatar

Watchers

 avatar  avatar  avatar

wordcount's Issues

provide me teh instruction for HIWAY

Hi
can you provide me the instruction to type in linux terminal for implementing this on Marc Bux's "HIWAY".
I already installed and ran Helloworld On HIWAY.
thanks in advance!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.