Giter Club home page Giter Club logo

harpproject2021's Introduction

Data Science Project for HARP Internship

All the visualizations have been put under the Visualizations folder. There is a video presentation that is linked here and a Powerpoint presentation that can be accessed online here. If you want to read more about the process of the project, I have detailed it in my portfolio website here.

Overview

This summer, I intern at Harassment and Reporting Platform, a non-profit organization with a goal to increase awareness on assault and harassment. We aim to gather crow-sourced contextual data, analyze, and create a cohesive narrative to bridge the gap of technical research and public understanding. While at the data team, I can explore and propose a data science project to research a topic related to harassment and assault of personal interest.

Goals

The main aim of the project is to gain insights to social media representation associated with the Asian American hate crimes incidents.

Methods

For our purposes, we choose the New York Times journal because it is reputable and it has a clear API documentation.

  • Request article data from New York Times using the NYT API

  • Parse the data into the format that we want, save into a csv file.

    • The Asians American NYT Dataset.csv file is all the headlines for the tag while the updated csv file is only within the pandemic timeframe.
  • Download the US statistics on Covid cases from the New York Time repository

  • Explore the NYT dataset

  • Merge two dataset

  • Create wordclouds for all the headlines and the headlines in the pandemic.

  • Create heatmap visualizations

  • Create cases percentage versus each subject counts visualizations

  • Create subject counts visualization

If you have further questions, please feel free to contact me through Github or visit my personal website for more social media accounts. Thank you very much!

harpproject2021's People

Contributors

minhanh2806 avatar

Stargazers

Toon Tran avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.