Giter Club home page Giter Club logo

ntunlplab / lifelog-vislife Goto Github PK

View Code? Open in Web Editor NEW
2.0 0.0 0.0 8 KB

Recently, people tend to record their daily life via filming Video Weblog (VLog), which contains visual and audio data. These large scale multimodal data can be used to support information recall service that enables users to query their past experiences. To this end, we construct a visual lifelogging dataset for investigating the issues of personal life event extraction from vlogs shared on YouTube and constructing a personal knowledge base (PKB) for individuals. There are 1,733 videos from three selected YouTubers ranging from 2016 to 2019. The videos we crawled are all about traveling.

lifelog-vislife's Introduction

VisLife [viz laɪv]: Visual Lifelogging Dataset

Personal Knowledge Base Construction from Multimodal Data

Introduction

Recently, people tend to record their daily life via filming Video Weblog (VLog), which contains visual and audio data. These large scale multimodal data can be used to support information recall service that enables users to query their past experiences. To this end, we construct a visual lifelogging dataset for investigating the issues of personal life event extraction from vlogs shared on YouTube and constructing a personal knowledge base (PKB) for individuals. There are 1,733 videos from three selected YouTubers ranging from 2016 to 2019. The videos we crawled are all about traveling.

Format

Each row of the csv files is consisted of video_id, list of action, list of video clip length and the path to the triple file.

The entry in the triple JSON format is composed of "subject", "verb", "object", "times", and "subtitle" of each life event.

The triple_mapping and verb_and_object_mapping provide the id of class (the id pair of verb and object).

Example

train.csv

    id, actions, length, path
    a2xDIcRUQ_I, c0000 13.2 53.2;c0001 206.33 246.33, PATH_TO_TRIPLE/CHANNEL_NAME/DATE/VIDEO_NAME/triples.json
    ...

triples.json

    {
        "subject": {
          "50": "i",
          ...
        },
        "verb": {
          "50": "finish",
          ...
        },
        "object": {
          "50": "pancakes",
          ...
        },
        "times": {
          "50": "569.53",
          ...
        },
        "subtitle": {
          "50": "I'M gonna fall asleep on the bikes down,
                and this is how I feel when I can't finish the best pancakes in the world,
                but I did get a doggie bag and also I have one final wish yo.",
          ...
        }
      }

triple_mapping.json

    
    {
        "c0420": ["v0019", "o0115"],
        ...
    }

verb_and_object_mapping.json

   
    {
        "c0420": ["get", "food"],
        ...
    }

Download

Please write us an email with the agreement. Click here to download the agreement of VisLife.

Email Address: [email protected]

How to Cite the Corpus

Please cite the following papers when referring to the VisLife in academic publications and papers.

An-Zi Yen, Chia-Chun Chang, Hen-Hsen Huang, and Hsin-Hsi Chen (2021). “Personal Knowledge Base Construction from Multimodal Data.” In Proceedings of ACM International Conference on Multimedia Retrieval 2021 (ICMR 2021), August 21-24, 2021, Taipei, Taiwan. DOI:

License

VisLife is made available under the Creative Commons Attribution 4.0 International (CC BY 4.0) license.

lifelog-vislife's People

Contributors

pcchen-ntunlp avatar

Stargazers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.