Giter Club home page Giter Club logo

frequent-pattern-mining's Introduction

Frequent-Pattern-Mining

Course: CSC 240

DATE: Oct.24th , 2016

Project 1

MAIN CONTINENT IN THE FOLDER:

test_short.txt

	Short test dataset from textbook problem 6.6.

test.txt

	Test dataset generated by my classmate Xuan Tang.

adult.txt

	Required adult dataset from UCI database. (originally named adult.data)

OUTPUT.pdf

	Output of the program for the adult dataset with minimum support of 0.6.

src (folder)

	java source files:

	Apriori.java

		Class implementing Apriori algorithm which containing a main method

	FPGrowth.java

		Class implementing FP-Growth algorithm which containing a main method

	FPTree.java

		Class of FPTree object

	HeaderNode.java

		Class of header node object used to generate the header table in FP-Growth

	HeaderComparator.java

		Class of comparator used to sort the header table

	AprioriImproved.java

		Class implementing my improved Apriori algorithm which containing a main method

BRIEF DESCRIPTION

This program implements Apriori, FP-Growth, my improved Apriori algorithms.

Apriori and FP-Growth are generally based on the description and the pseudocode provided in the textbook.

For my improved algorithm, I used the hash table improvement and transaction scan reduction improvement strategies, for more details, please see my report and code.

HOW TO COMPILE AND RUN THE CODE

javac Apriori.java

java Apriori

OR

javac FPGrowth.java

java FPGrowth

OR

javac AprioriImproved.java

java AprioriImproved

CITATION:

Data Set:

	Adult Data Set, https://archive.ics.uci.edu/ml/datasets/Adult , Lichman, M. (2013). UCI Machine Learning Repository [http://archive.ics.uci.edu/ml]. Irvine, CA: University of California, School of Information and Computer Science.


TextBook:

	Jiawei Han, Micheline, Kamber, Jian Pei, Data Mining: Concepts and Techniques, 3/E, Morgan Kaufmann, 2011


Open Source:

	Monperrus, Martin. Java implementation of the Apriori algorithm for mining frequent itemsets. GitHub repository, https://gist.github.com/monperrus/7157717

	Nobahar, Kamran. An implementation of FP-Growth algorithm in Java, GitHub repository, https://github.com/goodinges/FP-Growth-Java (referred for my insert method in FP-Growth)

	Generating all permutations of a given string, Stack Overflow, http://stackoverflow.com/questions/4240080/generating-all-permutations-of-a-given-string (referred for my combinations generation method in FP-Growth)

frequent-pattern-mining's People

Contributors

kellyyutonghe avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.