jianzongwu / betrayed-by-captions Goto Github PK

View Code? Open in Web Editor NEW

43.0 6.0 2.0 16.95 MB

(ICCV 2023) Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation

Home Page: https://arxiv.org/abs/2301.00805

Python 39.77% Jupyter Notebook 60.11% Shell 0.12%

iccv2023

betrayed-by-captions's People

Contributors

Stargazers

Watchers

Forkers

cv-seg whuhxb

betrayed-by-captions's Issues

Reproduce paper's scores

Hi everyone,

Is there anyone can reproduce accuracy scores in the paper?
I have tried to use 2 Nvidia Geforce 3090 but the scores were dropped by 5%.

How to achieve paper's accuracy?

Thanks.

Evaluate the performance of caption generation

Hi, can you tell me how to evaluate the performance of caption generation according to your code?

KeyError: 'metric bbox is not supported'

open_set里提供的coco_panoptic_open.py文件不支持bbox。

        allowed_metrics = ['PQ']
        for metric in metrics:
            if metric not in allowed_metrics:
                raise KeyError(f'metric {metric} is not supported')

Demo notebook fails at inferenceDetector

The demo notebook fails with a cuda error, possibly relating to incorrect matrix shapes during multiplications. I'm using the exact same torch/cuda/mmcv/mmdet versions as listed in the README, on a Quaddro RTX 8000 GPU.

The stack trace I get after running the cell

result = inference_detector(model, img, with_caption=True, logging=True)[0]

---------------------------------------------------------------------------
RuntimeError                              Traceback (most recent call last)
Cell In[8], [line 2](vscode-notebook-cell:?execution_count=8&line=2)
      [1](vscode-notebook-cell:?execution_count=8&line=1) # Predict segmentation results, as well as image captions
----> [2](vscode-notebook-cell:?execution_count=8&line=2) result = inference_detector(model, img, with_caption=True, logging=True)[0]
...
--> [360](~/miniforge3/envs/cgg/lib/python3.8/site-packages/torch/functional.py:360) return _VF.einsum(equation, operands)

RuntimeError: CUDA error: CUBLAS_STATUS_INVALID_VALUE when calling `cublasSgemmStridedBatched( handle, opa, opb, m, n, k, &alpha, a, lda, stridea, b, ldb, strideb, &beta, c, ldc, stridec, num_batches)

Have you encountered this before? And any way to fix this?

About code release

Great Job!
When will your group release the code?

Could you release pre-trained weiths with evaluation code?

I want to use this model as a baseline to generate pseudo labels for my task.

Apply for open_set/datasets/build_dataloader.py

Hi, thanks for releasing your great work. However, when I run "python tools/train.py" to try to reproduce, an error appears as follows:
ImportError: cannot import name 'build_dataloader' from 'open_set.datasets'
It seems that the build_dataloader.py function is missing, could you kindly provide the missing file? Thanks a lot.

jianzongwu / betrayed-by-captions Goto Github PK

betrayed-by-captions's People

Contributors

Stargazers

Watchers

Forkers

betrayed-by-captions's Issues

Reproduce paper's scores

Evaluate the performance of caption generation

KeyError: 'metric bbox is not supported'

Demo notebook fails at inferenceDetector

About code release

Could you release pre-trained weiths with evaluation code?

Apply for open_set/datasets/build_dataloader.py

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent