q-future / q-bench Goto Github PK

①[ICLR2024 Spotlight] (GPT-4V/Gemini-Pro/Qwen-VL-Plus+16 OS MLLMs) A benchmark for multi-modality LLMs (MLLMs) on low-level vision and visual quality assessment.

Home Page: https://q-future.github.io/Q-Bench/

Python 16.89% Jupyter Notebook 83.11%

image-quality-assessment large-language-models low-level-vision quality-assessment vision-language-dataset visual-large-language-models gpt-4 iclr

q-bench's People

Contributors

Stargazers

Watchers

Forkers

zzc-1998 teowu baoliang93 eltociear zhiyuanyou guspan-tanadi vqassessment shreshthsaini cv-ip marenan

q-bench's Issues

Questions about gpt-4v-vs-human.png

Hi @teowu, thanks for your great project.

I would like to ask a side question. Are your images https://github.com/Q-Future/Q-Bench/blob/master/gpt-4v-vs-human.png generated by an AI model? That is very vivid.

I'm not familiar with the diffusion model, so I would like to know what model and prompt you use. Thanks in advance:)

The leaderboard is broken in display

Check https://q-future.github.io/Q-Bench/leaderboards/

About the details of evaluating the model offline.

Hello authors,

Good job!
I'm wondering whether I can test my own model with q-bench offline.
I've checked the A1 annotation json file and found an unknown field (which may have some relation with the sub-type distortion/others/in-context distortion/...).
Also, the golden descriptions of A2 have not yet been released.

Thanks!

Question about model version.

Great work!
I would like to know which llava model do you use in the experiment.
Is the base model Vicuna-13B-v1.3, Vicuna-13B-v1.1, or Vicuna-13B-v0?

Stale link

https://q-future.github.io/Q-Bench/leaderboards/iqa_results/

no annotation json file for lldescribe data

Hi! This work is very great. And I can't fine the json file for description annotation for lldescribe data. Thank you in advance~!

Difference between llvisionqa_dev and llvisionqa_test

Hi,

great work!

I'm wondering what is the difference between the two subset llvisionqa_dev and llvisionqa_dev? I can't find any clues from the paper and your project page.

Thanks!

q-future / q-bench Goto Github PK

q-bench's People

Contributors

Stargazers

Watchers

Forkers

q-bench's Issues

Questions about gpt-4v-vs-human.png

The leaderboard is broken in display

About the details of evaluating the model offline.

Question about model version.

Stale link

no annotation json file for lldescribe data

Difference between llvisionqa_dev and llvisionqa_test

The column names are misdisplayed...

Can't to see the performance of GPT4-V on DESCRIPTION and ASSESSMENT tasks!

About evaluation

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent