Giter Club home page Giter Club logo

shrijayan / heart-disease-prediction Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 0.0 103 KB

Heart disease is a major global health concern that affects millions of people around the world. Early detection and accurate prediction of heart disease can help to prevent the progression of the disease and save lives. In this project, we aim to develop a predictive model for heart disease using various machine learning algorithms.

Jupyter Notebook 100.00%
machine-learning decision-tree lightgbm logistic-regression naive-bayes random-forest svm-classifier xgboost

heart-disease-prediction's Introduction

Heart Disease Prediction

In this model we have predicted the risk of heart disease based on real dataset that we got from a medical university chennai.

DATASET

The dataset we used contians more than 150 patient records and their daily routine data. As a result it helps our model to train with realtime data.Our data set contains many features like age,Dietary hahbit,Alcohol consumer or not etc...

Dataset : https://archive.ics.uci.edu/ml/datasets/Heart+Disease

Data Preprocessing:

We have preprocessed our dataset using Sk-learn. sk-learn:

The sklearn.preprocessing package provides several common utility functions and transformer classes to change raw feature vectors into a representation that is more suitable for the downstream estimators. In general, learning algorithms benefit from standardization of the data set.

Models Used:

  1. SVM
  2. Naive Bayes
  3. Logistic Regression
  4. Decision Tree
  5. Random Forest
  6. LightGBM
  7. XGboost

Model's Result Prediction Percentage:

SVM: Support Vector Machine

    ~Training Set Prediction : 0.6694214876033058
    
    ~Testing Set Prediction : 0.5737704918032787

Naive Bayes: Works based on Naive Bayes algorithm

    ~Training Set Prediction : 0.8677685950413223
    
    ~Testing Set Prediction :  0.7868852459016393

Logistic Regression: Logistic regression estimates the probability of an disease based on a given dataset of independent variables

    ~Training Set Prediction : 0.8636363636363636
    
    ~Testing Set Prediction : 0.8032786885245902

Decision Tree: A decision tree is a non-parametric supervised learning algorithm, which is utilized for both classification and regression tasks.

    ~Training Set Prediction : 1.0     
    
    ~Testing Set Prediction :  0.7704918032786885 

Random Forest: Works based on Naive Bayes algorithm

    ~Training Set Prediction : 1.0
    
    ~Testing Set Prediction : 0.7704918032786885

LightGBM: Gradient Boosting Decision Tree (GBDT) algorithm with the addition of two novel techniques: Gradient-based One-Side Sampling (GOSS) and Exclusive Feature Bundling (EFB).

    ~Training Set Prediction : 0.9958677685950413
    
    ~Testing Set Prediction :0.7704918032786885

XGBoost: It builds a decision tree for a given boosting iteration, one level at a time, processing the entire dataset concurrently on the GPU.

    ~Training Set Prediction : 0.987603305785124
    
    ~Testing Set Prediction : 0.7540983606557377      

CONCLUDED MODEL FROM PREDICTION From the prediction results Logistic Regression is more suitable for both training and testing data .

heart-disease-prediction's People

Contributors

shrijayan avatar shangamesh2805 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.