Giter Club home page Giter Club logo

Welcome!

🔬 Who Am I?: I am a scientist that wears many hats. By training, I'm a Biostatistician with a Master's degree from California State University, but I also have a background in data analytics and data science. My journey in data analysis and statistical programming has been driven by a long-standing deep interest in public health epidemiology.

📊 My Experience: Previously, I've done modeling work to learn about the impact of diet on cardiovascular disease in US adults. My current focus: working with my lab to analyze the impact of certain xenobiotics on bronchopulmonary dysplasia in infants.

🌟 My Goal: My current goal is to improve and expand my statistical knowledge and skillset to better answer unsolved questions in the public health arena. I hope to one day give back and make great contributions to the field of biostatistics in a way that has a massive positive impact on people's health from all walks of life--whether locally or globally.

❤️‍🔥 My Hobbies: In my free time I like to learn new things, meditate, read, write, and analyze sports data. My current passion projects include: learning Python, studying African history, making art, and indoor-gardening.

Highlighted Projects

🧬 Survival Analysis: Unraveling Factors in Post-Transplant Disease

Paper | Repository

  • Objective: To identify factors influencing chronic graft-versus-host disease onset in allogeneic bone marrow transplant recipients across 4 different clinical sites: The Ohio State University Hospitals, Hahnemann in Philadephia, St. Vincent's Hospital in Sydney, Australia, and Alfred Hospital in Melbourne.
  • Analysis/Tools: Survival analysis techniques in SAS: Kaplan-Meier, AFT modeling.
  • Outcome: Uncovered disparities in CGVHD onset across different demographics and clinical sites; St. Vincent patients experiencing a 35% longer time to CGVHD and Alfred patients experiencing a 10.3% shorter time.

🏀 Forecasting Wins and Losses with Machine Learning

Presentation | Repository

  • Focus: Forecasted the full 82 game season performance of the Lakers using data from 73 games.
  • Analysis/Tools: Applied linear discriminant analysis to predict win-loss records.
  • Outcome: Forecasted that had the Lakers played all 82 games, they would have won 55 games, and lost 27, with a final win percentage of 67% and a loss percentage of 33%.

⛹🏾 ️ Regression Analysis of the Lakers' Regular Season

Presentation | Repository

  • Focus: Developing a predictive model for the Lakers' points per game using 5 key features.
  • Analysis: Multiple linear regression in R.
  • Outcome: Model explained 86% of the variance in the team's performance.

🫀 Tall People and Risk of Cardiovascular Disease

Presentation | Repository

  • Focus: Investigating the correlation between height and cardiovascular disease.
  • Analysis: Examined a large dataset from Kaggle.
  • Outcome: Identified a 47% probability of CVD in taller individuals.

🫘 Fixed-Effect Analysis of Baseline Serum Creatinine across sex and blood pressure levels

Presentation | Repository

  • Focus: Is there an observable difference in baseline creatinine levels and blood pressure between sexes.
  • Analysis: Examined a kaggle dataset of electronic medical records of 491 patients from Tawam hospital in Al-Ain city in the United Arab Emirates in 2008 using SAS.
  • Outcome: There is a difference in baseline creatinine across sexes, and it is statistically significant. On average, women over 20 µmol/L lower compared to men in the dataset. There was no effect of blood pressure on serum creatinine.

Azuka Atum's Projects

cardiostatsusa icon cardiostatsusa

R Shiny application for exploring 20 years of blood pressure data from NHANES

ckdproj icon ckdproj

ANOVA on Chronic Kidney Disease from data collected Electronic medical records of 491 patients collected at the Tawam Hospital in Al-Ain city (Abu Dhabi, United Arab Emirates), between 1st January and 31st December 2008.

cvdheight icon cvdheight

Analysis of patient data from a kaggle dataset to assess if tall people's risk of developing cardiovascular disease was higher than short people's.

forecasting-wins-and-losses-with-machine-learning icon forecasting-wins-and-losses-with-machine-learning

Using Linear Discriminant Analysis - analyzing 2019-2020 regular season data to determine if there would be a difference in Lakers' regular season standing if the 2020 pause during the COVID-19 pandemic never happened.

jasa icon jasa

Quarto template for the American Statistical Association Journals

jss icon jss

Quarto template for the Journal of Statistical Software

markdown-templates icon markdown-templates

Templates for creating scientific, academic and technical documents using Markdown and Pandoc that support equations, code blocks, citations, Unicode symbols and vector graphics.

mlrlakers2122 icon mlrlakers2122

Multiple Linear Regression Analysis of the Lakers 21-22 Regular Season - assessing if 5 key predictors (FT%, FG%, games played, personal fouls, steals, and blocks) are good indicators of PPG performance during the regular season.

survgvhd icon survgvhd

Survival Analysis of time-to-Chronic Graft-versus-Host Disease in leukemia patient event data across 4 clinical sites with respect to 5 clinical characteristics: age, sex, cytomegalovirus immune status, waiting time to transplant, and disease group.

swimmer-plots icon swimmer-plots

Slides for my R/Medicine 2022 talk, "Making Swimmer Plots for Longitudinal Data using {ggplot}".

yihui icon yihui

Personal website of Yihui Xie

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.