Giter Club home page Giter Club logo

used-automobile-price-prediction's Introduction

Used Automobile Price Prediction

Overview

This project aims to predict automobile prices based on a dataset containing information about various automobile attributes. It involves data analysis, preprocessing, modeling, and interpretability techniques. The primary goal is to develop a regression model that accurately predicts the price of automobiles.

Table of Contents

Dataset

The dataset used in this project is named 'autos.csv.' It contains information about automobiles, including features such as 'price,' 'yearOfRegistration,' 'powerPS,' 'kilometer,' 'model,' 'vehicleType,' 'gearbox,' and more. The dataset is used to train and evaluate the regression model.

Data Analysis

The project begins with data analysis, where we explore the dataset to understand its structure. This includes examining the shape, columns, data types, and summary statistics of the dataset.

Capture d’écran 2023-11-05 222135

Data Preprocessing

Data preprocessing involves several steps:

  • Handling missing values in categorical features.
  • Extracting date-related features from 'dateCreated,' 'dateCrawled,' and 'lastSeen.'
  • Handling outliers in specific columns.
  • Feature selection to remove constant and quasi-constant features.

Modeling

The project implements and evaluates several regression models, including:

  • Linear Regression
  • Decision Tree Regression
  • Random Forest Regression
  • XGBoost Regression

The models are trained to predict automobile prices based on the dataset's features.

Model Evaluation

The models' performance is evaluated on a test dataset. Metrics like R-squared (R2), Mean Squared Error (MSE), and Root Mean Squared Error (RMSE) are used to assess model accuracy and generalization.

Model Interpretability

To gain insights into the models' decision-making processes, SHAP (SHapley Additive exPlanations) values are used for interpretability. SHAP values help understand feature importances and the impact of each feature on predictions.

Capture d’écran 2023-11-05 222059

Usage

You can use this project to:

  • Predict automobile prices based on given attributes.
  • Analyze the importance of different features in predicting prices.
  • Customize and improve the regression models.

Feel free to adapt and extend the code and analysis for your specific needs.

used-automobile-price-prediction's People

Contributors

khames-lab avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.