This Jupyter notebook provides a comprehensive analysis of IMDb data, uncovering insights into movies, ratings, and trends over the years. It also includes machine learning models to predict movie ratings and identify key factors influencing a movie's success.
- Data Cleaning: Preprocessing steps to clean and prepare the data for analysis.
- Exploratory Data Analysis (EDA): Visualizations and statistics to explore the dataset and understand its characteristics.
- Machine Learning:
- Rating Prediction: Building and evaluating models to predict movie ratings based on various features.
- Feature Importance: Identifying key factors that influence a movie's rating and success.
- Python 3.x
- Jupyter Notebook
- Pandas
- NumPy
- Matplotlib
- Seaborn
- Scikit-learn
NetworkX
: For constructing and analyzing the network graph.Pandas
andNumPy
: For data manipulation and analysis.Matplotlib
andSeaborn
: For data visualization.- Machine learning libraries (like
scikit-learn
) for predictive modeling.