Comments (2)
You can try to use the token span
args to detect your specific objects, you can refer to here:
https://github.com/IDEA-Research/GroundingDINO#arrow_forward-demo
CUDA_VISIBLE_DEVICES={GPU ID} python demo/inference_on_a_image.py \
-c groundingdino/config/GroundingDINO_SwinT_OGC.py \
-p ./groundingdino_swint_ogc.pth \
-i .asset/cat_dog.jpeg \
-o logs/1111 \
-t "There is a cat and a dog in the image ." \
--token_spans "[[[9, 10], [11, 14]], [[19, 20], [21, 24]]]"
[--cpu-only] # open it for cpu mode
This will help u to reduce the wrong predictions for some words.
from groundingdino.
Thanks @rentainhe! I'm wondering is there a method to work with all possible queries, including unseen queries?
from groundingdino.
Related Issues (20)
- e
- Is Grounding DINO SwinL avaible
- Huggingface Demo Error
- 物体检测不输入提示词
- 是否可以用源码运行而不进行install?
- Issues Differentiating Between "Fallen Person" and "Person" in Zero-shot Object Detection
- Deploy on NVIDIA Triton
- Why is the effect better on the web page?
- TensorRT-LLM multimodal
- Grounding DINO is now available in 🤗 Transformers! HOT 7
- how to use gpu when execute "python demo/gradio_app.py"
- 大佬什么时候release训练代码呀qvq HOT 1
- 为什么ms_deform_attn需要分别有一个CPU版本和GPU版本?
- RuntimeError: Unsupported TypeMeta in ATen: (please report this error) HOT 2
- 环境变量中设置了cl.exe的Path,但是pip install -e . 时报错。 HOT 2
- Installation on AMD GPU with ROCm / HIP: error: use of undeclared identifier 'CUDART_VERSION'
- Multi-object caption has negative effect on detection results.
- Error Detection with unexpected labels: [CLS]and[SEP] HOT 1
- A bug in docker_test.py
- GroundingDINO module needs to be built for every prediction request ? HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from groundingdino.