Emily Maynard's Projects
Pick one of 50 datasets and use PySpark to perform the ETL process to extract the dataset, transform the data, connect to an AWS RDS instance, and load the transformed data into pgAdmin. Next, use PySpark to determine if there is any bias toward favorable reviews from Vine members.
Use bike sharing data from one city to help determine if the program will be successful in another city. Use Tableau to create a dashboard with data visualizations.
Using supervised machine learning to predict credit risk. Trying oversampling, under sampling, combination sampling and ensemble learning to find the model with the best fit
Columbia University final group project
Unsupervised machine learning to preform principal component analysis, and use clustering to determine trends in different crytocurrencies.
Using Python to automate an election analysis. Using for loops and practice coding!
Config files for my GitHub profile.
An analysis of renewable energy stocks using VBA
Using excel to analyze crowdfunding data
Using D3, geoJson, Mapbox, API calls, and Leaflet to create a map plotting earthquakes over the last seven days from USGS data with magnitude.
Using R and statistics in RStudio to analyze different variables for review. Use the production data for insights that may help the manufacturing team. Perform multiple linear regression analysis to identify which variables in the dataset predict the mpg .
Using webscraping, this analysis visits the NASA Mars news. page and Space images pages to scrape data which is then displayed on a website about Mars.
Learning about model optimization, choosing good metrics, resampling, and hyperparameters.
This project uses exploratory data analysis or EDA to scrape movie data from the internet. The ETL process cleans files, merges dataframes, and publishes to a SQL database. This code creates functions, uses list comprehension, and utilizes regular expressions to clean the web data. After cleaning the data step by step, the code was refactored to take three files in a function order to automate the pipeline.
Exploring machine learning with nueral networks for a charity analysis. Adjusting the model to try and improve accuracy to predict which projects are likely to be successful.
Using SQL to analyze HR data and link tables to determine a process for company retirement planning and mentorship programs.
Plant Phenology study with IPF
This analysis uses Plotly to make interactive charts for a study on belly button biodiversity. The analysis uses
A portfolio with info about me and some of my projects
Use Matplotlib to visualize ride sharing data in different types of cities to determine trends for program improvements.
This analyzes and compares schools in a district for a school board to allocate funding and resources based on school performance
Using logistic regression and random forest classifier to compare models and try to create a span detection model.
The analysis uses python and pandas with SQLite and SQlalchemy to analysis how weather may impact the idea for a surf and ice cream shop in Hawaii.
Use D3 and javascript to create a website that allows users to filter for UFO criteria to find out about various UFO sightings. Using bootstrap for formatting.
Using API calls to GoogleMaps and OpenWeather, this analysis uses certain weather criteria to plan a vacation itinerary with hotel stops.