Giter Club home page Giter Club logo

delvm's People

Contributors

ggjy avatar lose4578 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

delvm's Issues

Problem when running demo.ipynb (blank result)

Hi, I am interested in this great job! Here I have a problem when running demo. I use LLaMA-1b-hf and vqgan-f16-8192-laion(As shown in data_preparition). The generated_img results:
image

(I find generated_img.max() = 0.07. Is there any mistake?
Thanks

Problem when running the demo.ipynb

Hi,

Thank you for your great work. I am facing the same problem as #2 (comment). Using the seg_1.png gives black and my inputid / output.sequence is the same as [Sutongtong233]'s. When I screenshot the seg_1.png, it works reasonably. Other images under the data folder are good as well.

I am using pytorch 1.13.1 and transformer 4.38.1. Could you give any advice to debug this? Or any insight that could cause this strange issue? Thank you and looking forward to your reply.

Evaluation metric

Hi

When you evaluate the model on image segmentation task, for calculating the accuracy, what post-processing did you use to align the prediction with categories? And can you please provide the evaluation code to calculate the ACC?

All the best,

请求审稿意见

恭喜作者做出这项不错的工作!
作者你好,请问这个工作是投的ICML2024吗?请问被录用了吗?审稿意见能否发一下呢?
谢谢,期待作者的回复。

GPU

how many A800 80G used for training?

The performance on more tasks

Your profile photo are just like you! Niubility! I have been waiting LVM release code longlong time.

This work has a great performance on segment&pose&deraining. And did you test on more tasks? (Especially the 3D tasks such as
depth estimation, which the original LVM performance is good) In other words, I'm very curious about the multi-task capability of LVM model with less training data. Could u show more experimental results?

Muse codes and pre-trained model

Hi, thanks for the great work.

The paper claims that you use an off-the-shelf VQGAN from Muse.

Could you kindly share which specific Muse code and pretrained model you used for this project?

Additionally, it would be really helpful to know where these resources can be found.

Thanks again!

Code about foreground segmentation

Hi, thank you for releasing your great work!
I want to know the details of post-processing in foreground segmentation.
Can you provide some code about it? e.g., code on how to inference on PASCAL-5i.
Looking forward to your reply.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.