mvig-sjtu / alphaction Goto Github PK

Spatio-Temporal Action Localization System

Python 90.77% C++ 1.53% C 0.43% Cuda 6.59% Cython 0.67%

action-detection spatio-temporal-action-localization alphaction alpha-action action-recognition state-of-the-art pytorch torch gpu ava

alphaction's Introduction

AlphAction

AlphAction aims to detect the actions of multiple persons in videos. It is the first open-source project that achieves 30+ mAP (32.4 mAP) with single model on AVA dataset.

This project is the official implementation of paper Asynchronous Interaction Aggregation for Action Detection (ECCV 2020), authored by Jiajun Tang*, Jin Xia* (equal contribution), Xinzhi Mu, Bo Pang, Cewu Lu (corresponding author).

Demo Video

[YouTube] [BiliBili]

Installation

You need first to install this project, please check INSTALL.md

Data Preparation

To do training or inference on AVA dataset, please check DATA.md for data preparation instructions. If you have difficulty accessing Google Drive, you can instead find most files (including models) on Baidu NetDisk([link], code: smti).

Model Zoo

Please see MODEL_ZOO.md for downloading models.

Training and Inference

To do training or inference with AlphAction, please refer to GETTING_STARTED.md.

Demo Program

To run the demo program on video or webcam, please check the folder demo. We select 15 common categories from the 80 action categories of AVA, and provide a practical model which achieves high accuracy (about 70 mAP) on these categories.

Acknowledgement

We thankfully acknowledge the computing resource support of Huawei Corporation for this project.

Citation

If this project helps you in your research or project, please cite this paper:

@inproceedings{tang2020asynchronous,
  title={Asynchronous Interaction Aggregation for Action Detection},
  author={Tang, Jiajun and Xia, Jin and Mu, Xinzhi and Pang, Bo and Lu, Cewu},
  booktitle={Proceedings of the European conference on computer vision (ECCV)},
  year={2020}
}

alphaction's People

Contributors

Stargazers

Watchers

alphaction's Issues

ImportError: libtorch_cpu.so

Thanks for your sharing!
I installed as prompted by the installation script.
But when I run the demo.py,i got the following error:

import AlphAction.custom_ext as _C
ImportError: libtorch_cpu.so: cannot open shared object file: No such file or directory

Can you help me ?

训练代码

非常好的工作，请问作者什么时候开源训练代码呢？

customize dataset

Hello, I have a few questions.

I'd like to know why we don't use 'person_bbox' during training
I don't know what "# disable box_file when train, use only gt to train" means. Isn't box the ground truth?
I want to customize the dataset. My action data usually lasts about 3 seconds with varying length, but I see the clip of AVA as one second, I first came into contact with the concept of keyframes, and why should I take the first frame as a keyframe, which can represent the action classification of this clip? I am very confused. Can you give me the general idea?

looking forward to you reply.

Fine-Tuning

Hi @Fang-Haoshu ,

Thank you for sharing your amazing work. I wanted to ask if you would be making a fine-tuning code available soon or not? It'll be really helpful for my project as well! Thanks

Question about the config files

Hi, thanks again for sharing the great work. I'm running the training codes and found that the schedule in config files is slightly different from the one in the GETTING_STARTED page (https://github.com/MVIG-SJTU/AlphAction/blob/master/GETTING_STARTED.md#training). The setting of base learning and training iteration is not consistent (considering adjust schedule according to the linear scaling rule). I'm just wondering which version is correct for reproducing the result using the current codebase? Thanks!

install setup.py ERROR

Hi！thank you for the project, when I installing the setup.py file i meet ERROR, the ERROR is below， I hope you can help me。

g++: error: /home/zhangzhenbo/AlphAction/build/temp.linux-x86_64-3.7/home/zhangzhenbo/AlphAction/alphaction/csrc/cuda/ROIAlign3d_cuda.o: No such file or directory
error: command 'g++' failed with exit status 1
(alphaction) zhangzhenbo@zhangzhenbo-TUF-Gaming-FX505GM-FX86FM:/AlphAction$ g++ --version
g++ (Ubuntu 7.5.0-3ubuntu118.04) 7.5.0
Copyright (C) 2017 Free Software Foundation, Inc.
This is free software; see the source for copying conditions. There is NO
warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.

(alphaction) zhangzhenbo@zhangzhenbo-TUF-Gaming-FX505GM-FX86FM:~/AlphAction$ python setup.py install
running install
running bdist_egg
running egg_info
writing alphaction.egg-info/PKG-INFO
writing dependency_links to alphaction.egg-info/dependency_links.txt
writing requirements to alphaction.egg-info/requires.txt
writing top-level names to alphaction.egg-info/top_level.txt
reading manifest file 'alphaction.egg-info/SOURCES.txt'
writing manifest file 'alphaction.egg-info/SOURCES.txt'
installing library code to build/bdist.linux-x86_64/egg
running install_lib
running build_py
running build_ext
building 'alphaction._custom_cuda_ext' extension
Emitting ninja build file /home/zhangzhenbo/AlphAction/build/temp.linux-x86_64-3.7/build.ninja...
Compiling objects...
Allowing ninja to set a default number of workers... (overridable by setting the environment variable MAX_JOBS=N)
1.10.1
g++ -pthread -shared -B /home/zhangzhenbo/anaconda3/envs/alphaction/compiler_compat -L/home/zhangzhenbo/anaconda3/envs/alphaction/lib -Wl,-rpath=/home/zhangzhenbo/anaconda3/envs/alphaction/lib -Wl,--no-as-needed -Wl,--sysroot=/ /home/zhangzhenbo/AlphAction/build/temp.linux-x86_64-3.7/home/zhangzhenbo/AlphAction/alphaction/csrc/vision.o /home/zhangzhenbo/AlphAction/build/temp.linux-x86_64-3.7/home/zhangzhenbo/AlphAction/alphaction/csrc/cuda/SoftmaxFocalLoss_cuda.o /home/zhangzhenbo/AlphAction/build/temp.linux-x86_64-3.7/home/zhangzhenbo/AlphAction/alphaction/csrc/cuda/SigmoidFocalLoss_cuda.o /home/zhangzhenbo/AlphAction/build/temp.linux-x86_64-3.7/home/zhangzhenbo/AlphAction/alphaction/csrc/cuda/ROIPool3d_cuda.o /home/zhangzhenbo/AlphAction/build/temp.linux-x86_64-3.7/home/zhangzhenbo/AlphAction/alphaction/csrc/cuda/ROIAlign3d_cuda.o -L/home/zhangzhenbo/anaconda3/envs/alphaction/lib/python3.7/site-packages/torch/lib -L/usr/local/cuda-10.1/lib64 -lc10 -ltorch -ltorch_cpu -ltorch_python -lcudart -lc10_cuda -ltorch_cuda -o build/lib.linux-x86_64-3.7/alphaction/_custom_cuda_ext.cpython-37m-x86_64-linux-gnu.so
g++: error: /home/zhangzhenbo/AlphAction/build/temp.linux-x86_64-3.7/home/zhangzhenbo/AlphAction/alphaction/csrc/cuda/SoftmaxFocalLoss_cuda.o: No such file or directory
g++: error: /home/zhangzhenbo/AlphAction/build/temp.linux-x86_64-3.7/home/zhangzhenbo/AlphAction/alphaction/csrc/cuda/SigmoidFocalLoss_cuda.o: No such file or directory
g++: error: /home/zhangzhenbo/AlphAction/build/temp.linux-x86_64-3.7/home/zhangzhenbo/AlphAction/alphaction/csrc/cuda/ROIPool3d_cuda.o: No such file or directory
g++: error: /home/zhangzhenbo/AlphAction/build/temp.linux-x86_64-3.7/home/zhangzhenbo/AlphAction/alphaction/csrc/cuda/ROIAlign3d_cuda.o: No such file or directory
error: command 'g++' failed with exit status 1

How to track target people?

Hi,
thank you for providing very nice method.

I have one question about AIA. How to correspond one people in the frame and one people in next frame as same people?
This method seems to learn temporal information about each people in video. If you can, please tell me how to track people and where it is implemented.

/SigmoidFocalLoss_cuda.cu error

when I run pip install -e . , there is something error. How to solve it ?

/home/action_detection/AlphAction/alphaction/csrc/cuda/SigmoidFocalLoss_cuda.cu(132): error: a pointer to a bound function may only be used to call the function

/home/action_detection/AlphAction/alphaction/csrc/cuda/SigmoidFocalLoss_cuda.cu(132): error: type name is not allowed

/home/action_detection/AlphAction/alphaction/csrc/cuda/SigmoidFocalLoss_cuda.cu(132): error: expected an expression