Giter Club home page Giter Club logo

breast_cancer_prediction's Introduction

Breast Cancer Predictor: Project Overview

Computer Vision Project Kaggle Dataset

  • Built a model that accepts cell nucleus values features of a breast cancer tumor as input and predicts if the cancer is Benign or Malignant.
  • Model is trained on a dataset of 570 Breast Cancer Images from the Kaggle Wisconsin UCI Breast Cancer dataset.
  • Data was trained on 5 different models. K-fold cross-validation was performed to validate for overfitting and a final trained Support Vector Machine (SVM) model was used to build the predictor.

Code and Resources used

Libraries Badge
GitHub Notebook YouTube Video Article

For Web Framework Requirements: pip install -r requirements.txt

Data Cleaning

Following changes were made to the data to make it usable for a model:

  • Column with Null Values was removed.
  • Got the count of malignant vs benign tumor cells.
  • Performed encoding to to represent categorical variables as numerical values to use it in the ML model.

EDA

Various analysis was made related to the dataset and the models. Below are a few highlights.

alt text alt text

Model Building

StandardScaler method was used to remove the mean and scale each feature/variable to unit variance. The data was split into train and test sets with a test size of 20%.
Five different models were tried and evaluated based on their metrics:

  • Logistic Regression
  • K-Nearest Neighbor
  • Decision Tree Classifier
  • Random Forest Method
  • Support Vector Machines

Model performance

The SVM model outperformed the other approaches on the test and validation sets.

  • Random Forest : Accuracy = 94.73%
  • Decision Tree : Accuracy = 93.85%
  • Logistic Regression: Accuracy = 96.49%
  • K-NN: Accuracy = 95.61%
  • SVM: Accuracy = 98.24%

Deployment

A Final Trained model was built on SVM where the input of the nucleus features are accepted from the user and the model predicts if the tumor is malignant or benign. The Final model can be downloaded from svm_model.pkl

breast_cancer_prediction's People

Contributors

sughoshdeshpande7 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.