Giter Club home page Giter Club logo

rhipe's Introduction

RHIPE: R and Hadoop Integrated Programming Environment

RHIPE is an R package that provides a way to use Hadoop from R. It can be used on its own or as part of the Tessera environment.

Installation

Installation of RHIPE requires a working Hadoop cluster and several prerequisites. A comprehensive installation guide, as well as other options to get an environment set up (Vagrant, Amazon Web Services, etc.) are discussed here.

If you are interested in installing and using RHIPE, please see the link just provided. The remainder of this README will focus on developers who want to build RHIPE from source or contribute to RHIPE development.

Hadoop Support

Our current development efforts are focused on Hadoop 2 (YARN), although code that works with earlier versions of Hadoop is also available in this repository.

Hadoop 2

The master branch of this repository is focused on Hadoop 2 development, and contains code to build RHIPE v0.75.x. As some aspects of YARN that we have addressed are not backward compatible, packages built from this branch will not work with Hadoop 1.

Hadoop 1

The v0.74 branch of this repository is for Hadoop 1.

Hadoop Distributions

There are several Hadoop distributions avaialble. RHIPE has been successfully built and run for Apache Hadoop 1.x , Cloudera CDH3, CDH4mr1, and CDH5mr2. There are maven profiles setup in the POM that build against Apache Hadoop 1.x, 2.x,CDH3, CDH4, CDH5, HDP 1,2 & 2.2.

Building Rhipe

Probably the easiest way to build RHIPE is to provision a Vagrant machine that has all the prerequisites configured. Another option is to set up a local pseudo-distributed Hadoop cluster, for example see here.

If you set up your own machine, you will need to make sure the following dependencies are met, beyond Hadoop:

Rhipe is built using both Ant and Maven. Maven handles the Java build with the various distro dependencies. Ant drives the build process including the R packaging, running R commands and testing.

ant

with no targets prints help.

To build Rhipe for a specific distro run

ant build-distro -Dhadoop.version=[hadoop-1,hadoop-2,cdh3,cdh4,cdh5,hdp-1,hdp-2]

rhipe's People

Contributors

saptarshiguha avatar hafen avatar jrounds avatar rperko3 avatar amwhite avatar xiaosutong avatar agentsguha avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.