Giter Club home page Giter Club logo

med-264's Introduction

Investigating the association between Fitbit wearable data and self-reported measures of life satisfaction

This repository contains the code base for our MED-264 group's Final Project.

Data

The data is part of All of Us Registered Tier Dataset (version 7). The notebooks were developed on Python 3.7 and All of Us Jupyter Notebook environment.

Files

  1. data_collection.ipynb - This notebook extracts the data from All of Us dataset using GoogleBigQuery query and saves it to the persistent disk on the created workspace.
  2. data_preprocessing.ipynb - In this notebook, the saved dataframes are read and upon observing missingness, the feature list is filtered.
  3. data_cleaning.ipynb - In this notebook, the missing data for each feature is imputed with the patient level mean.
  4. data_splitting.ipynb - In this notebook, the dataset is split into train and test after feature engineering. The split ensures that there is no leakage of patient level data on train and test sets.
  5. model_building.ipynb - Traditional machine learning models such as Logistic Regression, Decision Tree Classifier, Random Forest Classifier, and XGBoost Classifier are chosen to perform both multi-class and binary class classification tasks. The results of these are available in the notebooks.
  6. data_correlation_and_statistics.ipynb - General statistics about the population and correlation among features is captured in this notebook.
  7. python_ordinal_regression.ipynb - Ordinal Regression regression is carried out to observe the odd ratios and 95% confidence intervals. Furthermore, the statistical significance (p-values) is reported in this notebook.
  8. assets/ - Contains all the illustrations derived from our study.

Explainability of Random Forest Classifier

Random Forest Feature Importance (Binary Classification)

Random Forest Feature Importance

Random Forest Feature Importance (Multi-class Classification)

Random Forest Feature Importance

Acknowledgements

We would like thank Dr. Tsung-Ting Kuo (instructor) for arranging lectures with various other lecturers for our sessions. We would like to also thank the TAs of this course, Grace Yufei Yu and Aaron Boussina.

med-264's People

Contributors

vishaln15 avatar

Stargazers

Tsung-Ting Kuo avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.