Giter Club home page Giter Club logo

video-game-text-corpora's Introduction

video-game-text-corpora

Data and code for a paper about video game text corpora.

Datasets

  • Torchlight II quest texts: quest dialogue, main quest summaries and GUI text in CSV-format.
  • Star Wars: Knights of the Old republic: branching player and NPC dialogue in CSV-format.
  • The Elder Scrolls (Arena, Daggerfall, Morrowind, Oblivion, Skyrim and The Elder Scrolls: Online): in-game books in JSON-format.

Code

Each game-folder has a src/ folder that contains the code for creating the dataset. It should give some insight in how the data was extracted.

For TorchLight II and SW:KOTOR: before you can run the code, you should have access to the original game files from which the data was extracted.

Scientific paper

This repository is for the research paper Fantastic Strings and Where to Find Them: The Quest for High-Quality Video Game Text Corpora, to appear in the proceedings of INT 2020. Preprint version of the paper. If you use the data or code, please cite the following paper:

@inproceedings{vanstegeren2020fantastic,
    title = "{Fantastic Strings and Where to Find Them: The Quest for High-Quality Video Game Text Corpora}",
    author = {van Stegeren, Judith and Theune, Mari{\"e}t},
    booktitle = "Intelligent Narrative Technologies Workshop",
    month = oct,
    year = "2020",
    publisher = {AAAI Press},
}

Games

The corpora were extracted from three commercial video games. The games and the game assets are copyright the respective game publishers and game developers. If you use the datasets, don't forget to cite the games too!

@misc{game:starwarsknightsoftheoldrepublic,
title = {\emph{Star Wars: Knights of the Old Republic}},
year = {2003},
organization = {LucasArts},
publisher = {LucasArts},
author = {{BioWare}},
Howpublished = {Game [PC]},
Note = {LucasArts, San Francisco, US},
}

@misc{game:torchlight2,
title = {\emph{Torchlight II}},
year = {2012},
organization = {Runic Games},
publisher = {Runic Games},
author = {{Runic Games}},
Howpublished = {Game [PC]},
Note = {Runic Games, Seattle, Washington, US},
}

@misc{gamesseries:tes,
title = {\emph{The Elder Scrolls I-V} and \emph{The Elder Scrolls Online}},
date = {1994/2014},
year = {1994--2014},
organization = {Bethesda Softworks},
publisher = {Bethesda Softworks},
author = {{Bethesda Softworks}},
Howpublished = {Game series [PC]},
Note = {Bethesda Softworks, Rockville, Maryland, US},
}

video-game-text-corpora's People

Contributors

jd7h avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

video-game-text-corpora's Issues

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.