Hadoop Map-Reduce Design Patterns
Clone the repository:
git clone [email protected]:geftimov/MapReduce.git
Go in to the folder:
cd MapReduce
Build it with Maven:
mvn clean install
Example run in each individual pattern example.
#####1. Numerical Summarization ReadMe
- CommentWordCount
- MinMaxCount
- Average
- MedianStdDev (With In-Memory Map)
- MedianAndStandardDeviationCommentLengthByHour (Without the Map, more efficient)
#####2. Inverted Index Summarization ReadMe
#####3. Counting with Counters ReadMe
#####1. Filtering ReadMe
#####2. Bloom Filtering ReadMe
#####3. Top Ten ReadMe
#####4. Distinct ReadMe
#####1. Structured to Hierarchical ReadMe
#####2. Partitioning ReadMe
#####3. Binning ReadMe
#####4. TotalOrderSorting ReadMe
#####5. Shuffling ReadMe
#####1. Reduce Side Join ReadMe
#####2. Replicated Join ReadMe
#####3. Composite Join ReadMe
#####4. Cartesian Product ReadMe
Georgi Kalinov Eftimov
[email protected]