Command-line tools for data science
This repository is a collection of command-line tools that facilitate the obtaining, scrubbing, and exploring of data. Most of these tools are discussed in the blog post: 7 command-line tools for data science.
Currently, the box
directory contains a Vagrant environment for installing these command-line tools. This will soon be moved into a separate repository. The blog post Lean, mean data science machine contains instructions for installing this virtual environment.
License
GPLv3