We used the C4.5 Algorithm to create decision trees by making Map-Reduce functions to calculate Gain Ratio in a distributed manner. Then we compared the performance of the MR and Spark frameworks by creating the decision trees using different sizes of datasets.
Team :
- Pathik Patel
- Nisarg Suthar
- Jugal Patidaar
- Manjal Shah
- Hemanya Chaudhary