Giter Club home page Giter Club logo

cs231_project's Introduction

CS231_Project

Project Organization:

/code: mostly code for preprocessing the data. * shotdetect (dependencies not included) * tubelets (dependencies not included) * optical flow /data: * directories where we reformat the video to be a collection of frames and create lists of frame to class_id. [video_name/%08d] [class_id] * allFrames is the only directory used for the final experiments.

/examples: * directories that contain Caffe files. Usually each directory contains its necessary Caffe .prototxt files

. * random assortment of files and notebooks that are not well organized. * optFlow0.py: files used to compute the optical flow at some point.

Data:

(Action Recognition)

  1. Training: entire UCF action data set. It consists of 101 human action categories with 13,320 videos in total. Each category has more than 100 video clips, all of which are temporally trimmed.

  2. Validation data: There are 1,000 videos in total provided for 101 action classes as validation data (each class has exactly 10 videos). In general, there is one primary action class shown in each video; however, some videos may include one or more instances from other action classes.

  3. Background data: 2,500 videos, which are verified to make sure they do not include an instance of any of the 101 action classes, are provided as background. Each video is relevant to one of the action classes. For instance, the background videos related to the ac-tion class “Basketball Dunk” may show the basketball court when the game is not being played.

  4. 1,574 temporally untrimmed videos are provided as the test data. Some videos may contain one or multiple instances from one or multiple action classes, and some videos may not include any actions from the 101 classes. A significant portion of the video may not include any particular action, and multiple instances may occur at differnet time stamps within the video.

(Action Detection)

  1. Training: A subset of UCF101 action dataset with 20 action classes is used for training. The remaining action classes in UCF101 may also be used for training, if desired.

  2. 200 videos from 20 action classes are provided as the validation set. These videos have the same properties as the validation videos in the recognition task. The validation set could be added to the training data, if desired.

  3. Background: same as before.

  4. Same format as validation.

Result format: (Action Recognition) [test_video_name_1] [confidence_score_class_1] [confidence_score_class_2] … [confidence_score_class_m]

for each of the m=101 test classes.

(Action Detection) [video_name] [starting_time] [ending_time] [class_label] [confidence_score]

A detector can fire multiple times in a test video (there may be multiple rows with the same video name)

Project goal: The project hopes to achieve two goals: Action Recognition and Temporal Action Detection.

Use a CNN trained on the training and validation data to output a real valued score indicating the confidence of the precense of each action class in a test video.

cs231_project's People

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.