Giter Club home page Giter Club logo

Comments (5)

aryaabdi avatar aryaabdi commented on July 29, 2024

Thank you for this interesting work. I have trained this model from scratch using medical images. When evaluating the model, all the output masks (# of masks used = 8) consistently show the same structure with different intensity. Have you seen this issue using natural images? Any idea what could cause this? Thank you.

from groupvit.

mbehjati avatar mbehjati commented on July 29, 2024

Hi @aryaabdi,
Did you manage to solve the problem you mentioned? I'm getting a similar behavior.

from groupvit.

MohammadHossein-Bahari avatar MohammadHossein-Bahari commented on July 29, 2024

I get the same issue. @xvjiarui Can you help please?

from groupvit.

xvjiarui avatar xvjiarui commented on July 29, 2024

Hi all,

Sorry for the late reply. If you are training with specific domain images, I would suggest you start with pre-training on large scale natural images first. And the contrastive loss needs large batch size and large dataset to work.

from groupvit.

aryaabdi avatar aryaabdi commented on July 29, 2024

I tried training from scratch and also from the pre-trained (on natural images) model. The latter performed better. However, I realized the contrastive loss is not going to be effective if the number of entities within a batch is limited. I believe @xvjiarui can use a very large batch size because the training dataset contains many different entities. For example, gcc3m contains ~16k different entities. This was not the case in my training dataset and I think that is why I was not getting the desired behavior. Hope this helps.

from groupvit.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.