facebookresearch / mmbt Goto Github PK

Supervised Multimodal Bitransformers for Classifying Images and Text

License: Other

Python 100.00%

mmbt's Introduction

MultiModal BiTransformers (MMBT)

Introduction

MMBT is the accompanying code repository for the paper titled, "Supervised Multimodal Bitransformers for Classifying Images and Text" by Douwe Kiela, Suvrat Bhooshan, Hamed Firooz, Ethan Perez and Davide Testuggine.

The goal of the repository is to provide an implementation of the MMBT model and replicate the experiments in the paper.

Paper Link: https://arxiv.org/abs/1909.02950

Getting Started

Setup Enviroment

PyTorch version >= 1.0.0
Python version >= 3.6
pip install torch torchvision sklearn pytorch-pretrained-bert numpy tqdm matplotlib

Model Training

train.py provides the common training pipeline for all datasets.

task: mmimdb, food101, vsnli
model: bow, img, concatbow, bert, concatbert, mmbt

The following paths need to be set to start training.

data_path: Assumes a subfolder for each dataset.
savedir: Location to save model checkpoints.
glove_path: Path to glove embeds file. Needed for bow, concatbow models.

Example command:

python mmbt/train.py --batch_sz 4 --gradient_accumulation_steps 40 \
 --savedir /path/to/savedir/ --name mmbt_model_run \
 --data_path /path/to/datasets/ \
 --task food101 --task_type classification \
 --model mmbt --num_image_embeds 3 --freeze_txt 5 --freeze_img 3  \
 --patience 5 --dropout 0.1 --lr 5e-05 --warmup 0.1 --max_epochs 100 --seed 1

MMBT in Transformers

MMBT is also available in HuggingFace Transformers. See https://github.com/huggingface/transformers/tree/master/examples/research_projects/mm-imdb for an example that shows how easy it is to run MMBT in that framework.

License

MMBT is licensed under Creative Commons-Non Commercial 4.0. See the LICENSE file for details.

Citation

Please cite it as follows

@article{kiela2019supervised,
  title={Supervised Multimodal Bitransformers for Classifying Images and Text},
  author={Kiela, Douwe and Bhooshan, Suvrat and Firooz, Hamed and Testuggine, Davide},
  journal={arXiv preprint arXiv:1909.02950},
  year={2019}
}

mmbt's People

Contributors

Stargazers

Watchers

mmbt's Issues

train.jsonl for foo101

Downloaded UPMC_Food101, and trying to run the code for the food101 experiment. It seems the code needs json files for the dataset (which is not provided by dataset itself). How can I get them?

$ python mmbt/train.py --batch_sz 4 --gradient_accumulation_steps 40  --savedir mmbt/ --name mmbt_model_run  --data_path mmbt/data/  --task food101 --task_type classification  --model mmbt --num_image_embeds 3 --freeze_txt 5 --freeze_img 3   --patience 5 --dropout 0.1 --lr 5e-05 --warmup 0.1 --max_epochs 100 --seed 1
Traceback (most recent call last):
  File "mmbt/train.py", line 280, in <module>
    cli_main()
  File "mmbt/train.py", line 272, in cli_main
    train(args)
  File "mmbt/train.py", line 183, in train
    train_loader, val_loader, test_loaders = get_data_loaders(args)
  File "/home/fb/mmbt/data/helpers.py", line 113, in get_data_loaders
    os.path.join(args.data_path, args.task, "train.jsonl")
  File "/home/fb/mmbt/data/helpers.py", line 40, in get_labels_and_frequencies
    data_labels = [json.loads(line)["label"] for line in open(path)]
FileNotFoundError: [Errno 2] No such file or directory: 'mmbt/data/food101/train.jsonl'

Run on GQA dataset

Hello
Do you have an instructions to run MMBT on the GQA dataset?
I've tried to make changes in the repo but it doesn't seem to work.

I've tried to run with this CMD:
python mmbt/train.py --batch_sz 4 --gradient_accumulation_steps 40 --savedir /users/yonatan/mmbt/savedir --name mmbt_model_run --data_path /users/yonatan/gqa_data_and_images_dir --task food101 --task_type classification --model mmbt --num_image_embeds 3 --freeze_txt 5 --freeze_img 3 --patience 5 --dropout 0.1 --lr 5e-05 --warmup 0.1 --max_epochs 100 --seed 1
Thank you

Error is follows, please could you help me prepare "test_hard_gt.jsonl".

python mmbt/train.py --batch_sz 4 --gradient_accumulation_steps 40 --savedir savedir --name mmbt_model_run --data_path datasets --task food101 --task_type classification --model mmbt --num_image_embeds 3 --freeze_txt 5 --freeze_img 3 --patience 5 --dropout 0.1 --lr 5e-05 --warmup 0.1 --max_epochs 100 --seed 1
Traceback (most recent call last):
  File "mmbt/train.py", line 280, in <module>
    cli_main()
  File "mmbt/train.py", line 272, in cli_main
    train(args)
  File "mmbt/train.py", line 183, in train
    train_loader, val_loader, test_loaders = get_data_loaders(args)
  File "E:\mmbt\mmbt\data\helpers.py", line 197, in get_data_loaders
    args,
  File "E:\mmbt\mmbt\data\dataset.py", line 23, in __init__
    self.data = [json.loads(l) for l in open(data_path)]
FileNotFoundError: [Errno 2] No such file or directory: 'datasets\\food101\\test_hard_gt.jsonl'