OpenVQA is a general platform for visual question ansering (VQA) research, with implementing state-of-the-art approaches (e.g., BUTD, MFH, BAN and MCAN) on different benchmark datasets like VQA-v2, GQA and CLEVR.
This project is released under the Apache 2.0 license.
Supported methods and benchmark datasets are shown in the below table. Results and models are available in MODEL_ZOO.md.
VQA-v2 | GQA | CLEVR | |
---|---|---|---|
BUTD | ✓ | ||
MFB | ✓ | ||
MFH | ✓ | ||
BAN | ✓ | ||
MCAN | ✓ |
Please refer to INSTALL.md for installation and dataset preparation.
Please see GETTING_STARTED.md for the basic usage of OpenVQA.
We appreciate all contributions to improve OpenVQA. Please refer to CONTRIBUTING.md for the guideline.
This repo is currently maintained by Zhou Yu (@yuzcccc) and Yuhao Cui (@cuiyuhao1996).