Giter Club home page Giter Club logo

decoding-ipl-match-analysis's Introduction

Decoding IPL: Match Analysis

About the Project

Given the recent advancement in technology, analytics through Machine Learning and Artificial Intelligence has become crucial in many areas, including sports. A considerable amount of data is now collected and used to predict scores and future trends, make managerial decisions and analyse the performance of the players in various sports, using Machine Learning techniques.

We tried to use these advancements to analyse and explore the Indian Premier League (IPL).

Our project has two aims:

  1. Predict the outcome of matches, i.e. the winner of the match, based on previous years’ data.
  2. Predict the scores during certain intervals of the game. For example, the score in overs 1-15, 1-10 or 3-8 etc.

To accomplish this, we used algorithms such as Decision Trees, Linear Regression, Random Forest and Support Vector Machines.

File Structure

The file structure is as follows:

  1. Dataset/: This directory contains all the data files, stored weights and the embedding matrix.

    1. Dataset/Embeddings/: Contains the embedding matrix
      1. emb_player_vec_dict: contains the embedding matrix and vector information.
      2. emb_player_stoi_dict: contains the string to index information for player names.
      3. del_to_emb_final: stores the mapping of player names from deliveries.csv to embedding matrix.
      4. emb_player_vec_dict: stores the embedding vector player_name wise.
      5. not_found: contains the list of players who had different names in both the datasets.
    2. Dataset/cricksheet_ipl_csv: Data collected from Cricsheet's website in .csv format.
    3. Dataset/cricsheet_ipl_yaml: Data collected from Cricsheet's website in .yaml format.
    4. Dataset/ipl_stats: Data scraped from the IPL's official website for season wise player points.
    5. Dataset/kaggle_data: contains matches.csv (match-wise information about every IPL season from 2008 to 2020) and deliveries.csv (delivery-wise information about every IPL match).
    6. Dataset/Ball_by_ball.csv: contains modified deliveries.csv with 7-dimensional vectors instead of player names.
  2. Reports/:

    1. Project_Presentation.pptx: Project presentation
    2. Project_Report.pdf: A detailed project report
  3. plots/: Folder containing plots created during Exploratory Data Analysis.

  4. Ball_by_Ball_Regression.ipynb: Code to perform regression on ball-by-ball dataset. It predicts the score for each ball.

  5. Create_ball_by_ball_dataset.ipynb: Converts the categorized features (batsman, non-striker and bowler) in deliveries.csv into their corresponding vectors.

  6. Create_variable_balls_dataset.ipynb: Groups together variable number of balls from the deliveries.csv.

  7. EDA.ipynb: Contains the Exploratory Data Analysis performed on the collected data.

  8. HyperparameterTuning_Regression.ipynb: Regression based models with hyperparameter tuning for the best model.

  9. Player_Embedding_Vectors.ipynb: Creates player embeddings based on the player-wise performance scraped from IPL's official website.

  10. match_classification.ipynb: Classification models with hyperparameter tuning for the match result prediction task.

Contributors

  1. Mudit Dhawan
  2. Samik Prakash
  3. Shivangi Dhiman

decoding-ipl-match-analysis's People

Contributors

shivangidhiman avatar mudit-dhawan avatar samikprakash avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.