pomonam / attentioncluster Goto Github PK

TensorFlow Implementation of "Attention Clusters: Purely Attention Based Local Feature Integration for Video Classification".

License: Apache License 2.0

Python 100.00%

attention-clusters attention video-classification

attentioncluster's People

Contributors

Stargazers

Watchers

Forkers

kwangjinoh xiaoyu5301 maikuraky ammieqi hnn123 cvsaber dreadlord1984 subhankar21

attentioncluster's Issues

Train on other video datasets

Hi,

Thanks a lot for your code! Could you provide some general guidelines on how extracted video/audio features from any dataset should be prepared to train the frame-level AttentionClusterModule? Thanks. :)

Is there only one attention map for a single modality?

Hello Juhan Bae,

This repo for Attention Cluster is nice, after reading the corresponding paper I have a question about the number of clusters. It seems that you implement the attention cluster with only one attention map for each modality, while in the official paper there are many clusters(many attention maps) for each modality (rgb & audio), so maybe a loop should be introduced in class AttentionClusterModel() in frame_level_model.py ? :- )

Best wishes,
Skye

Redundant normalisation

Hi,

I believe that the following line is unnecessary:

AttentionCluster/frame_level_models.py

Line 87 in 7dd11bc

transformed_activation = tf.nn.l2_normalize(transformed_activation, 1)

Maybe L101 is unnecessary as well. In my understanding, the paper proposes normalisation only right after the shifting operation. So, I believe that only the normalisation in L99 is useful.

I can understand the normalisation in L101 as is common to normalise the whole vector and not only within the cluster (similarly to VLAD), but L87 seems pretty redundant.

Let me know what you think. Thanks.

Questions about this paper.

The article mentions that "when we train the model, we can randomly sample a part of the features from the local feature set, but use all the features during testing." Why can we use only a part of the features for training, but use all the features during testing?

Thank you!

there is an audio feature issue

I run your code and download dataset from kaggle manually but I got this error :

tensorflow.python.framework.errors_impl.InvalidArgumentError: Name: , Feature list 'audio' is required but could not be found.  Did you mean to include it in feature_list_dense_missing_assumed_empty or feature_list_dense_defaults?
	 [[{{node train_input/ParseSingleSequenceExample_1/ParseSingleSequenceExample}}]]

How can I fix this issue?

pomonam / attentioncluster Goto Github PK

attentioncluster's People

Contributors

Stargazers

Watchers

Forkers

attentioncluster's Issues

Train on other video datasets

Is there only one attention map for a single modality?

Redundant normalisation

Questions about this paper.

there is an audio feature issue

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent