Giter Club home page Giter Club logo

predicting-apple-quality's Introduction

๐ŸŽ Predictive Classification of Apples ๐ŸŽ

Table of Contents

  1. Project Overview
  2. Data
  3. Methodology
  4. EDA
  5. Modification of Data
  6. Visualization
  7. Discussion of Results

Project Overview

The purpose of this project is to create predictive classifiers to determine the quality of apples. These predictions are based on variables related to the quality of apples using features such as ripeness, crunchiness, weight, acidity, etc. The models can be utilized by farmers or produce sellers to determine the financial value of their product or to predict the quality of future anticipated harvests.

Data

The data was retrieved in the form of a CSV file from Kaggle.com and uploaded by Nidula Elgiriyewithana who states that the set was provided by a nameless American agriculture company. https://www.kaggle.com/datasets/nelgiriyewithana/apple-quality

Methodology

  • Python: EDA, modification, cleaning, and modeling

EDA

The uploader kindly took the liberty of scaling and cleaning the data. There were some remaining NA values which I cleaned prior to modeling.

Modification of Data

  • Column: A_id | Represents unique identifiers for each fruit. This feature was not required for modeling and was dropped from the dataframe.
  • Column: Quality | Initially the quality of the apples was listed as either "good" or "bad". For the sake of convenience these values were re-mapped to numerical binary values. Good apples are transformed to "1" and bad apples are "0"

Visualization

The data can be visualized utilizing a kernel density estimate (KDE) plot which covers all of the variables in the set. Apple KDE

Discussion of Results

model accuracy

Observing the results the K-Nearest Neighbor(KNN) model is the most accurate at 90.08% while our least accurate model is Naive Bayes at 72.25%. Ideally we would like to see values of >= 85% and 6 out of 10 models were able to meet this success rate. Implementation of the KNN model will lead to accurate predictions in 9 out of 10 evaluations.

predicting-apple-quality's People

Contributors

danielwrightgit avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.