Giter Club home page Giter Club logo

nba-simulator---simple-model's Introduction

NBA Simulator (Simple Model) - Exploratory Analysis & Multivariable Regression Project

Author: Patrick de Guzman

Primary Data Source: Basketball Reference

Project Components

This NBA Simulator Project will contain several components:

  • Getting, Wrangling, Compiling, and Cleaning the data (Github)
  • Exploratory Data Analysis (Rpubs)
  • Multivariable Regression Model to Predict W/L% and 2019-2020 season results (Rpubs)
  • Advanced Model, Interactive Trade System to Incorporate Individual Player Stats (In Progress!)

The final goal of this project is to develop an advanced regression/ML model with interactivity through a Shiny app to allow for the dynamic movement of NBA players from team to team. By doing this, we can improve our prediction results by incorporating the effect that individual players have on the teams that they play for (since our multivariable model will be shown to base itself on the assumption that teams remain static, i.e., no trades/waivers/injuries/etc. occur throughout the forecast horizon).

Synopsis

Data on the National Basketball Association (NBA) seasons from the Basketball Reference website was compiled, cleaned, and merged to create a final dataset of the following variables (definitions for all variables can be found in the basketball reference glossary):

  • Team per game statistics (such as points per game, assists per game, etc.)
  • Opponent Team per game statistics (same as above but for opponent teams)
  • Advanced statistics (higher-level stats such as margin-of-victory, strength of schedule, etc.)

From this compiled data, exploratory data analysis was performed in an attempt to discover the likely traits of 'winning' teams. For the purposes of this project, 'winning' teams are those that qualify for the NBA playoffs and/or win the NBA championships within a given year. Trends within the league were also explored, and features within the dataset were identified to correlate with a team's success rate (i.e., their Win/Loss percentages in the regular season).

The following are included in the repository:

  • Scripts (Folder): Contains Data cleaning & compiling script "Data Collection & Cleaning V2.R" and Exploratory Analysis 'Rmd' file (click here for the published version)
  • Team Data (Folder): Contains 3 types of datasets (team statistics, opponent team statistics, and advanced statistics) for the NBA seasons ranging from 1980-2019 (seasons split into separate files and extracted manually from Basketball Reference).
  • NBA Data by Team and Season.csv : Compiled & cleaned dataset containing all 3 types of data (noted above) for all seasons and all teams in the NBA from 1980-2019.
  • README.md: README file
  • Team by Season by Advanced/Game Stats/Opp Game Stats.csv : Files produced from "Data Collection & Cleaning V2.R" script prior to final consolidation to 'NBA Data by Team and Season.csv' dataset.

nba-simulator---simple-model's People

Contributors

patrickdg avatar

Stargazers

 avatar  avatar

Watchers

 avatar

Forkers

phuclinh9802

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.