Giter Club home page Giter Club logo

pennaction's Introduction

============================================================
                     Penn Action Dataset
============================================================
Penn Action Dataset contains 2326 video sequences of 15 
different actions and human joint annotations for each 
sequence. The dataset is available for download via the
following link:

   https://upenn.box.com/PennAction

------------------------------------------------------------
                          Reference
------------------------------------------------------------
If you use our dataset, please cite the following paper:

Weiyu Zhang, Menglong Zhu and Konstantinos Derpanis,  "From 
Actemes to Action: A Strongly-supervised Representation for
Detailed Action Understanding" International Conference on 
Computer Vision (ICCV). Dec 2013.

------------------------------------------------------------
                       Dataset Content
------------------------------------------------------------
The dataset is organized in the following format:

    /frames  ( all image sequences )
       /0001 
          000001.jpg
          000002.jpg
          ...
       /0002
        ...
    /labels  ( all annotations )
        0001.mat
        0002.mat
        ...
    /tools   ( visualization scripts )
        visualize.m
        ...

The image frames are located in the /frames folder.  All 
frames are in RGB. The resolution of the frames are within
the size of 640x480.

The annotations are in the /labels folder. The sequence
annotations include class label, coarse viewpoint, human 
body joints (2D locations and visibility), 2D bounding 
boxes, and training/testing label. Each annotation is a 
separate MATLAB .mat file under /labels.


An example annotation looks as follows in MATLAB:

    annotation = 

            action: 'tennis_serve'
              pose: 'back'
                 x: [46x13 double]
                 y: [46x13 double]
        visibility: [46x13 logical]
             train: 1
              bbox: [46x4 double]
        dimensions: [272 481 46]
           nframes: 46

------------------------------------------------------------
                       List of Actions
------------------------------------------------------------
baseball_pitch  clean_and_jerk  pull_ups  strumming_guitar  
baseball_swing  golf_swing      push_ups  tennis_forehand   
bench_press     jumping_jacks   sit_ups   tennis_serve
bowling         jump_rope       squats    

------------------------------------------------------------
                      Annotation Tools
------------------------------------------------------------
The annotation tool used in creating this dataset is also
available. Please refer to http://dreamdragon.github.io/vatic/
for more details.

------------------------------------------------------------
                           Contact
------------------------------------------------------------
Please direct any questions regarding the dataset to

Menglong Zhu <[email protected]>

http://cis.upenn.edu/~menglong

pennaction's People

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.