Giter Club home page Giter Club logo

msaf's Introduction

MSAF: Multimodal Split Attention Fusion

arXiv PWC

Code for the paper MSAF: Multimodal Split Attention Fusion. This is our implementation of the MSAF module and the three MSAF-powered multimodal networks.

If you use this code, please cite our paper:

@misc{su2020msaf,
      title={MSAF: Multimodal Split Attention Fusion}, 
      author={Lang Su and Chuqing Hu and Guofa Li and Dongpu Cao},
      year={2020},
      eprint={2012.07175},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Installation

Clone this repo along with submodules

git clone --recurse-submodules https://github.com/anita-hu/MSAF.git

Install dependencies

Method 1: Using environment.yml (installs dependencies for all three datasets)

With this method, you can skip dependency installation steps from the dataset specific README files

conda env create -f environment.yml

Method 2: Without environment.yml

This code was developed with Python 3.6, PyTorch 1.7.1 in Ubuntu 20.04.

  • Basic dependencies (needed for all datasets): Pytorch, Tensorboard
  • Dataset specific dependencies: see README file in each dataset folder

Usage

  • The MSAF module is implemented in MSAF.py
  • The README file in each dataset folder has details on data preprocessing, training and evaluation (pretrained weights are available)

msaf's People

Contributors

anita-hu avatar kevinsu628 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

msaf's Issues

need 2 optimizers?

Hello.
There are 2 optimizers in the training phase, and it seems to learn the pose stream and apperance stream alternately.
Is there a reason why you didn't learn to cross-entropy loss using only one optimizer after combining the results of the pose stream and appearance stream?
Thank you for always.

not run

CMU_MOSEI not run.error:ModuleNotFoundError: No module named ‘mmsdk‘

problem

When i run "ravdess",the error is "Expected object of scalar type Double but got scalar type Float for argument #3 'mat1' in call to th_addmm"

About code issues

Hello, can the code in Figure 3 in the paper be published? I want to see a section of that code. Thank you

Eval accuracy slightly different

Hi. Thank you so much for providing useful code.

When I tested NTU-RGBD datset through the command provided
'python main_msaf.py --datadir <path/to/NTU>
--checkpointdir checkpoints
--test_cp msaf_ntu_epoch12_92.24.checkpoint
--no_bad_skel'

, the performance was slightly different and 91.38 instead of 92.24.
Am I missing something? Thank you.

about code

Hello, can you give me a copy of the complete code of Resnext,thank you

error

With this address,I have the problem:
Traceback (most recent call last):
File "dataset_prep.py", line 130, in
dataset = md.mmdataset(recipe)
File "/root/autodl-tmp/MSAF-master/cmu_mosei/mmsdk/mmdatasdk/dataset/dataset.py", line 28, in init
self.computational_sequences[entry]=computational_sequence(address,destination)
File "/root/autodl-tmp/MSAF-master/cmu_mosei/mmsdk/mmdatasdk/computational_sequence/computational_sequence.py", line 32, in init
h5handle,data,metadata=self.initialize(resource,destination)
File "/root/autodl-tmp/MSAF-master/cmu_mosei/mmsdk/mmdatasdk/computational_sequence/computational_sequence.py", line 92, in initialize
return readCSD(self.mainFile)
File "/root/autodl-tmp/MSAF-master/cmu_mosei/mmsdk/mmdatasdk/computational_sequence/file_ops.py", line 14, in readCSD
log.error("%s file not found, please check the path ..."%resource,error=True)
File "/root/autodl-tmp/MSAF-master/cmu_mosei/mmsdk/mmdatasdk/log/log.py", line 18, in error
action("%s%s"%(msgstart,msgstring))
File "/root/autodl-tmp/MSAF-master/cmu_mosei/mmsdk/mmdatasdk/log/log.py", line 17, in
action,msgstart=(lambda x:( for _ in ()).throw(errorType(x)),": ") if error else (lambda x:sys.stdout.write(str(x)+'\n'),bcolors.WARNING+bcolors.BOLD+": "+bcolors.ENDC)
File "/root/autodl-tmp/MSAF-master/cmu_mosei/mmsdk/mmdatasdk/log/log.py", line 17, in
action,msgstart=(lambda x:( for _ in ()).throw(errorType(x)),": ") if error else (lambda x:sys.stdout.write(str(x)+'\n'),bcolors.WARNING+bcolors.BOLD+": "+bcolors.ENDC)
RuntimeError: : ./CMU_MOSEI/csd/CMU_MOSEI_VisualFacet42.csd file not found, please check the path ...

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.