Shan Tong 's Projects
An analysis on what factor determines in-state tuition and the difference between African-American enrollment percentage in different institutions compared to other major races
This is an analysis on movies created in 2014 and 2015 which utilizes multiple linear regression analysis to predict the financial success of films and investigate the relationship between screens and year.
Forecasting Short-term Future COVID19 Cases
This is an SQL project focused on the use of SQL for data exploration and data cleaning on COVID data.
This folder contains my progress and solutions for the case studies for the 8 Week SQL Challenge prepared by Danny Ma.
The Washington Post is compiling a database of every fatal shooting in the United States by a police officer in the line of duty since 2015.
Collection of useful data science topics along with code and articles
This project aims to recreate some of the machine learning methods Nate Silver used in 2016, by using the actual election data and determining how accurate some of these methods were at predicting the final results. The methods we used were Principal Component Analysis, Hierarchical Clustering, Decision Trees, Logistic Regression and Lasso Regularization. In addition to using those methods, we performed other classification methods such as K-Nearest Neighbors and Random Forest and explored the possibility of Simpsonβs Paradox in our dataset used for the algorithms.
Our project aims to explore whether police shootings are racially biased and if certain racial groups are targeted more. We also aim to look specifically at California which is the State that has the most police shootings and to see if there are any racial disparities in its victims.
A robot powered training repository :robot:
These are homework assignments from my Introduction to Machine Learning course for Fall 2020. In this class, I learned about the concepts of basic statistical machine learning and applying them to discover patterns and relationships in large data sets.
A testing repo for the UCSB Data Science Capstone Preparation Workshop