This project uses Python with pandas, SQL and principal component analysis. The database was downloaded from below: https://www.kaggle.com/hugomathien/soccer/data
https://www.kaggle.com/dimarudov/data-analysis-using-sql/data This is the ultimate Soccer database for data analysis and machine learning.
European Soccer Database (from Kaggle.com)
+25,000 matches
+10,000 players
11 European Countries with their lead championship
Seasons 2008 to 2016
Players and Teams' attributes* sourced from EA Sports' FIFA video game series
-
Which player’s attribute contributes most to player’s overall rating
-
What attributes set players apart
- Strong positive linear correlation between
a. Overall Rating and Potential (Coeff. = 0.7840)
b. Overall Rating and Reactions (Coeff. = 0.7248) - Defending and goalkeeping attributes as a whole set players into subgroups
a. Total Defending Score: marking, standing tackle and sliding tackle
b. Total Goalkeeping Score: diving, handling, kicking, positioning and reflexes