highmmt's Introduction

HighMMT

HighMMT is a general-purpose model for high-modality (large number of modalities beyond the prototypical language, visual, and acoustic modalities) and partially-observable (across many tasks, where each task is defined only over a small subset of all modalities we are interested in modeling) scenarios.

HighMMT uses multitask learning with shared unimodal and multimodal layers to enable stable parameter counts (addressing scalability) and cross-modal transfer learning to enable information sharing across modalities and tasks (addressing partial observability).

The same HighMMT model (architecture and parameters) is able to simultaneously encode joint representations between different subsets spanning images, text, audio, sets, time-series, and graphs.

Paper

High-Modality Multimodal Transformer: Quantifying Modality & Interaction Heterogeneity for High-Modality Representation Learning
Paul Pu Liang, Yiwei Lyu, Xiang Fan, Shentong Mo, Dani Yogatama, Louis-Philippe Morency, Ruslan Salakhutdinov
TMLR 2022.

If you find this repository useful, please cite our paper:

@article{liang2022high,
  title={High-Modality Multimodal Transformer: Quantifying Modality \& Interaction Heterogeneity for High-Modality Representation Learning},
  author={Liang, Paul Pu and Lyu, Yiwei and Fan, Xiang and Tsaw, Jeffrey and Liu, Yudong and Mo, Shentong and Yogatama, Dani and Morency, Louis-Philippe and Salakhutdinov, Russ},
  journal={Transactions on Machine Learning Research},
  year={2022}
}

Contributors

Correspondence to:

Usage

Environment Setup Using Conda

conda env create -f env_HighMMT.yml

Quick Start

The instructions for running the code and data retreival can be found after typing

./run.sh help

You can also find detailed instructions below

Data Download

three datasets: robotics, enrico and RTFM can be setup directly using script ./download_datasets.sh Run

./download_datasets.sh help

for instructions To setup each dataset, run "./download_datasets.sh " For example

./download_datasets.sh robotics

downloads the robotics dataset to the directory datasets/robotics This repo is built on top of the MultiBench repository, so to download the dataset, follow the same instructions as https://github.com/pliang279/MultiBench.git

Easy setting experiment code

From the root of this repo, run

python private_test_scripts/perceivers/roboticstasks.py model.pt

The model will be saved to model.pt.

Medium setting experiment code

To run medium tasks, please run

python private_test_scripts/perceivers/medium_tasks.py

Hard setting experiment code

To run multitask training on 1/2/3/4 tasks, please run

python private_test_scripts/perceivers/singletask.py
python private_test_scripts/perceivers/twomultitask.py
python private_test_scripts/perceivers/threemultitask.py
python private_test_scripts/perceivers/fourmultitask.py

Parameter Sharing Experiments

To run the parameter sharing experiments, please run

python private_test_scripts/perceivers/shared_fourmulti.py

A baseline can be trained as a starting point for finetuning by running the fourmultitask.py file like described above. You can specify the baseline in shared_fourmulti.py.

Parameter groupings can also be specified in the shared_fourmulti.py file.

Heterogeneity Matrix

To run get the heterogeneity matrix between individual modalitiesa and pairs of modalities, please run

python private_test_scripts/perceivers/tasksim.py

highmmt's People

Contributors

Stargazers

Watchers

highmmt's Issues

Nonexistent pytorch_perciever.

Hello! When I run python private_test_scripts/perceivers/singletask.py, I get the error Traceback (most recent call last): File "private_test_scripts/perceivers/singletask.py", line 5, in <module> from private_test_scripts.perceivers.crossattnperceiver import MultiModalityPerceiver, InputModality File "/run/determined/workdir/irad_users/smithk/nlp/HighMMT/HighMMT/private_test_scripts/perceivers/crossattnperceiver.py", line 9, in <module> from perceiver_pytorch.caching import cache_by_name_fn ModuleNotFoundError: No module named 'perceiver_pytorch.caching'. Could you push the perceiver_pytorch module?

Problems while Running the code

As given in the documentation, I installed the dependencies and tried to run the code:

$ python private_test_scripts/perceivers/roboticstasks.py model.pt

After trying to run that, it gives me some sort of fannypack related error:

The full error is:

python private_test_scripts/perceivers/roboticstasks.py model.pt
Output will be model.pt
Traceback (most recent call last):
  File "/media/4TB_hardisk/sangam/HighMMT/private_test_scripts/perceivers/roboticstasks.py", line 31, in <module>
    trains3, valid3, test3 = PushTask.get_dataloader(
  File "/media/4TB_hardisk/sangam/HighMMT/datasets/gentle_push/data_loader.py", line 84, in get_dataloader
    train_trajectories = cls.get_train_trajectories(**dataset_args)
  File "/media/4TB_hardisk/sangam/HighMMT/datasets/gentle_push/data_loader.py", line 134, in get_train_trajectories
    return _load_trajectories("gentle_push_1000.hdf5", **dataset_args)
  File "/media/4TB_hardisk/sangam/HighMMT/datasets/gentle_push/data_loader.py", line 247, in _load_trajectories
    with fannypack.data.TrajectoriesFile(
  File "/home/sangam/anaconda3/envs/jtsaw/lib/python3.10/site-packages/fannypack/data/_trajectories_file.py", line 77, in __init__
    with self._h5py_file() as f:
  File "/home/sangam/anaconda3/envs/jtsaw/lib/python3.10/site-packages/fannypack/data/_trajectories_file.py", line 354, in _h5py_file
    return h5py.File(self._path, mode=mode, libver="latest")
  File "/home/sangam/anaconda3/envs/jtsaw/lib/python3.10/site-packages/h5py/_hl/files.py", line 562, in __init__
    fid = make_fid(name, mode, userblock_size, fapl, fcpl, swmr=swmr)
  File "/home/sangam/anaconda3/envs/jtsaw/lib/python3.10/site-packages/h5py/_hl/files.py", line 235, in make_fid
    fid = h5f.open(name, flags, fapl=fapl)
  File "h5py/_objects.pyx", line 54, in h5py._objects.with_phil.wrapper
  File "h5py/_objects.pyx", line 55, in h5py._objects.with_phil.wrapper
  File "h5py/h5f.pyx", line 102, in h5py.h5f.open
OSError: Unable to synchronously open file (file signature not found)

Recommend Projects

pliang279 / highmmt Goto Github PK

highmmt's Introduction

HighMMT

Paper

Contributors

Usage

Environment Setup Using Conda

Quick Start

Data Download

Easy setting experiment code

Medium setting experiment code

Hard setting experiment code

Parameter Sharing Experiments

Heterogeneity Matrix

highmmt's People

Contributors

Stargazers

Watchers

Forkers

highmmt's Issues

Recommend Projects

Recommend Topics

Recommend Org