Giter Club home page Giter Club logo

newsinequalitycheckup's Introduction

#Mapping news inequality

##How even legitimate news sources can paint incomplete and misleading pictures of places they cover.

The problem: Not all communities receive equal coverage, and what little coverage they do get can be skewed towards "crimes and fires".

Over time, this erodes community trust and confidence in the media’s accountability, transparency, and accuracy.

This project begins to build out ways of investigating, parsing, and visualizing newspaper coverage by geographic area, with the goal of exposing subtle bias in the areas of coverage.

View the website to see our visual analysis of 2 case studies:

  • How local blogs cover Williamsburg, Brooklyn vs. Marine Park, Brooklyn
  • How a Portland newspaper covers Southern Maine

Guiding questions

  • Spatial inequality in news coverage -- are communities getting covered equally? Are certain communities being overlooked?
  • Transparency/accountability by news orgs -- how can news orgs audit their coverage for gaps?

Solution: A set of tools to analyze equality in coverage

Using a measure of "articles per capita" and some basic word frequency analysis, we visualized the most attention-grabbing places, and the different types of words used in headlines about each.

We'd like to develop a tool for newsrooms and civic advocates to assess local news coverage and promote more equitable distribution of journalism resources.

Process notes/documentation

###What type of data you can use to visualize news coverage

  • Tags with place names, scoped to the area you want to look at (neighborhoods in a city, towns in a state, etc.)
    • We used Gothamist tag pages, since they tag each story by NYC neighborhood
    • On DNAinfo, used search results by keyword (e.g. "Marine Park" Brooklyn)
    • For Maine, used an exported list of towns from story tags, via Wordpress
  • Headlines/ledes associated with each tag, for word analysis
    • This will usually be included in a tag page or search results page

###Getting the data

  • Web scraper + HTML parser for scraping tag pages
    • Various tools would work here: Python + Beautifulsoup, R
  • Geographic tags from a WordPress site (sometimes you can export these)

###Outcomes

  • There were no clear correlations The 2 places (Maine/Brooklyn) were really different cases that looked different

    • Maine: pretty even per capita coverage across household income levels (so coverage was more or less proportionate with population)
    • Brooklyn: extremely uneven coverage, not related to population (Marine Park has more people than Williamsburg, but only gets a fraction of the coverage)
  • Context is important in interpreting the data The areas in Maine that received disproportionate coverage had specific reasons why (ongoing scandal increased coverage of a small town; major state employer based in a smaller town)

    • Would caution anyone doing this on a large scale to take care interpreting the results for this reason

###Hypotheses! Or what we want to explore next

  • who is any given media organization alienating?

  • it's harder to trust an organization if you don't see yourself or your community represented -- is trust lower in communities that get less coverage?

  • are communities that get fewer journalism resources more susceptible to disinformation?

###Types of data that could be used for comparison

  • 311 calls
  • Household income
  • Voting data
  • U.S. census rural/urban rating

newsinequalitycheckup's People

Contributors

vigorousnorth avatar cathydeng avatar hamdanazhar avatar leslie-s avatar

Stargazers

Phillip Smith avatar  avatar Mateo Salinas Clarke avatar  avatar Michael Morisy avatar

Watchers

Michael Morisy avatar James Cloos avatar  avatar  avatar  avatar  avatar

Forkers

cathydeng

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.