Giter Club home page Giter Club logo

data-warehousing_project-1's Introduction

CITE3401-Data Warehousing - Project-1

Project 1: Data Warehousing and Association Rule Mining

Result: 45/50

Outline of the project

The project focuses on performing data warehousing techniques on real crime datasets to answer some business queries.

Submission file included:

  • PowerBI file
  • PDF report generated from PowerBI
  • Data cleaning/ETL script
  • SQL script and cleaned CSV files
  • Solution project file (and its folder) of the SSDT analysis service multi-dimensional project
  • PDF file discribing the association rule mining process and results.

Data Warehousing Design and Implementation

  • Consider a few business questions my data warehouse could help answer.
  • Draw a StarNet with the aim to identify the dimensions and concept hierarchies for each dimension. This should be based on the lowest information level I can access.
  • Use the StarNet footprints to illustrate how my design can answer business queries.
  • Once the StarNet diagram is completed, draw it using a tool or software - in this project, diagram.net was used, and paste it into PowerBI
  • Implement a star schema using SQL Server Management Studio (SSMS) and paste the database ER diagram generated by SSMS onto Power BI Dashboard.
  • Load the data from the CSV files to populate the tables by using the SQL code command
  • Use SQL Server Data Tools to build a multi-dimensional analysis service solution, with a cube designed to answer my business queries. Make sure the concept hierarchies match StarNet design. Paste the cube diagram to your Power BI Dashboard.
  • Use Power BI to visualise the data returned from the business queries.

Association Rule Mining

The Association Rule Mining is applied in this crime dataset.

  • Process the dataset into a case table and a nested table.
  • Explain the top k rules (according to importance or probability) that have the "crime" type (or other suitable columns) on the right-hand side, where k>=1.
  • Explain the meaning of the k rules in plain English.

data-warehousing_project-1's People

Contributors

nguyenhaitran avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.