Giter Club home page Giter Club logo

datascience-vm's Introduction

Data Science VM

Vagrant + VirtualBox scripts for setting up an Ubuntu (32 bit) Data Science server. If you want to make changes or improvements, don't be shy; send me a note or a pull request.

This system takes a download-it-all approach. As a result, it can take quite a whilte to download everything and set the server up. Do adjust the Vagrantfile to focus only on the tools you need.

Installing Locally with VirtualBox

  • Install VirtualBox
  • Install Vagrant
  • Set up the Ubuntu Precise32 Vagrant Box vagrant box add precise32 http://files.vagrantup.com/precise32.box
  • Run vagrant up This should launch the box and download/install a suite of data science tools.

Installing on Amazon AWS

Vagrant can also launch, provision, and manage Amazon AWS instances. For more, read the vagrant-aws page.

  • vagrant plugin install vagrant-aws
  • Create a new AWS access key or note the location of your existing key
  • Create a 'dummy' vagrant box that will stand in for AWS vagrant box add awsubuntu https://github.com/mitchellh/vagrant-aws/raw/master/dummy.box
  • Change the base box in Vagrantfile.aws to 'awsubuntu'
  • Add your access ID, key name, and key file location to Vagrantfile.aws
  • Copy Vagrantfile.aws over Vagrantfile
  • Add port 22, 8888, and 8787 to the default security group on your AWS account, or create a new security group with SSH access and add it to the Vagrant file
  • To launch your VM, run vagrant up --provider=aws

Tools Included

  • Python NLTK
  • Python Stanford Core NLP Library, with Web API
  • iPython Notebook
  • R Studio
  • Python Scikit-learn
  • Vim
  • Screen

Accessing R Studio

Port forwarding is automatically set up on port 8787. If you're running Vagrant locally, load localhost:8787. If remotely, load http://SERVER_ADDRESS:8787 The default username and password match the default password of Vagrant.

iPython Notebook

The proxy is automatically set up on port 8888. run ipython notebook and access localhost:8888

Accessing Web Services

If you launch other servers, you need to set up port forwarding. Read this guide to [port forwarding on Vagrant] (http://docs-v1.vagrantup.com/v1/docs/getting-started/ports.html)

Running the Stanford CoreNLP Python Module

python corenlp-python/corenlp/corenlp.py

datascience-vm's People

Contributors

natematias avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.