Giter Club home page Giter Club logo

memeanalyze's Introduction

StudyingWeb Culture: Acquisition of a Meme-Corpus and Analysis of textual components

Abstract

Memes are a popular way to express oneself online in today’s internet culture. Crowd-sourcing encyclopedias such as knowyourmeme.com exist solely to document all the different kinds of memes. In this paper, we present a novel quantitative approach to meme analysis. First, we gather a corpus by using mentioned encyclopedia and applying OCR to extract the textual data contained in memes’ images. We used common text mining metrics and techniques to explore our data set via sentiment analysis, POS-tagging and topic modelling. Our results show, that the language used in memes differs greatly from common literature. Although, memes are used in a lot of different contexts, they all follow certain syntactical or pragmatic design patterns.

Structure of this repository

The crawling source files for collecting the meme text date can be found in nodeyourmeme directory.

In R-folder, all files concerning the exploration of the gathered corpus are placed. This includes R files for:

  • word occurances statistics and wordclouds
  • sentiment analysis (standard SA and emotion analysis)
  • topic modeling
  • pre-processing files for text cleaning and stemming/lemmatisation

Also included in the R directory are:

  • a folder with used stop word lists
  • a out folder for generated csv, which were used in the paper
  • a img folder containing all relevant images that were generated

memeanalyze's People

Contributors

dramsauer avatar mrroobot avatar phhartl avatar

Watchers

 avatar  avatar  avatar

memeanalyze's Issues

Memeobject

Instead of sending every variable separately after a request was made a meme class should be generated to contain all.

Filenames can be invalid

Sometimes meme names can be formatted in a a way they cause problems to save. Find a way to erase this problem, most likely by using some form of Regex.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.