Giter Club home page Giter Club logo

Comments (6)

WenbinLee avatar WenbinLee commented on September 4, 2024

Thanks for your suggestion. Yes, it's an important comparison experiment. The reason why we didn't add this experiment in the original paper was that we thought the local-based Prototypical Net (feature dimensionality is 64) should be worse than the Global-based Prototypical Net (feature dimensionality is 1600).

Fortunately, we did this experiment recently. When doing a 5-way 5-shot task, the accuracy of using mean local features (i.e., Prototypical Net) is 67.15+0.65%. Hope this can help you. Also, you can use our code to easily implement this experiment.

BTW, in fact, one novelty is indeed the usage of local features without pooling , the other more important novelty is the image-to-class measure which can take full advantage of the local features because of the exchangeability.

from dn4.

WenbinLee avatar WenbinLee commented on September 4, 2024

It is done on the miniImagenet dataset. Thank you.

from dn4.

icoz69 avatar icoz69 commented on September 4, 2024

Thanks for your suggestion. Yes, it's an important comparison experiment. The reason why we didn't add this experiment in the original paper was that we thought the local-based Prototypical Net (feature dimensionality is 64) should be worse than the Global-based Prototypical Net (feature dimensionality is 1600).

Fortunately, we did this experiment recently. When doing a 5-way 5-shot task, the accuracy of using mean local features (i.e., Prototypical Net) is 67.15+0.65%. Hope this can help you. Also, you can use our code to easily implement this experiment.

BTW, in fact, one novelty is indeed the usage of local features without pooling , the other more important novelty is the image-to-class measure which can take full advantage of the local features because of the exchangeability.

Thank you for your instant reply. This is an inspiring idea. However, I did not quite understand your reply. Why Prototypical Net has such a high dimensionality? Should't it has the same dim with local based, as it simply average all local features? Actually, what i want to ask is a baseline model that averages all local features and still use cosine distance, such that the only difference to yours is to average or not. If it is compared with prototypical directly, both the distance function and the local/global, changed and we do not know how much the locality works. Thank you.

from dn4.

WenbinLee avatar WenbinLee commented on September 4, 2024

In section 3.2 of the original paper of Prototypical Net, you can find that the feature dimensionality is 1600 because they use global features 64x5x5=1600. Never mind, I just mentioned this is to explain why we didn't do this experiment in our original CVPR paper. Please ignore this part.

Yes, I mean I have done what you suggested. We can use global average pooling to get a 64-dimensional local feature vector for each class, and we still use cosine distance, then we can get an accuracy of 67.15+0.65% for the 5-way 5-shot setting on the miniImagenet dataset. But if you use our method, you can get 71.02+0.64%. Is it more clear?

Thank you.

from dn4.

icoz69 avatar icoz69 commented on September 4, 2024

Thank you for your reply. That sounds good. Does it also have advantages in 1shot 5way task?

from dn4.

WenbinLee avatar WenbinLee commented on September 4, 2024

You are welcome. Yes, it does

from dn4.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.