Giter Club home page Giter Club logo

Comments (5)

icoz69 avatar icoz69 commented on September 10, 2024 1
  1. FEAT has been updated to arxiv very early. The results you see now is their latest version after CVPR2020 submission, you can check the history of arxiv to find their previous performance.
  2. For the results in Table3C, part of some is based on our implementation, while for the others , we added the citation behind the method name, which you can refer to.
    As to the resolution, if you would love to obtain a result with other augmentation strategies, such as raw image size or pre-resized images, you can modify our code very easily, and report this result as you need.

from deepemd.

icoz69 avatar icoz69 commented on September 10, 2024

hello
thanks for your interest in our work.
You are right about the dataloader part. Our project is based on the open-source code of feat.
I think this is a standard way for image classification tasks. Although early works in few shot learning/meta learning pre-process data very easily, such as no data augmentation, I think it is better not to ignore the standard operations, such as random crop/scale. Otherwise it is hard to know whether an algorithm really sloves the problem or just simply play the same role as data augmentation.
Anyway, if you would love to test our algorithm with other data pre-processing methods, it is easy to modify based on our project.

from deepemd.

Tsingularity avatar Tsingularity commented on September 10, 2024

Thanks a lot for your quick response!

Yes I agree this is a standard way for the normal image recognition task. but when we compare with baselines, we need to ensure that the methods are using the same data-preprocessing (no matter which one, just like what u said, otherwise "it is hard to know whether an algorithm really sloves the problem or just simply play the same role as data augmentation" )? Or at least I think it would be better to indicate in the paper that which method uses what data-preprocessing?

And thanks for pointing out you constructed your code based on FEAT. And I just checked their implementation, I noticed that for the CUB dataset, FEAT mentioned that they are using the bounding-box cropped CUB images instead of the original one. So the CUB number in your papers are also implemented in this way?

Thanks for your answer again!

from deepemd.

icoz69 avatar icoz69 commented on September 10, 2024

1.all the analysis experiments we compared are all using the same data augmentation strategy and training strategy, so you can see the protonet, matching net, closer look, yield higher results than their reported ones, we have indicated this below the table.

2.The CUB dataset we are usign is the same with FEAT's.

from deepemd.

Tsingularity avatar Tsingularity commented on September 10, 2024
  1. speaking of the numbers in the table. I am confused about several numbers. Since you said your code is based on FEAT's, why the paper doesn't use the proto/FEAT model numbers in the FEAT paper? In that paper, the proto's resnet-12 results are 62.39 and 80.53, and in ur paper, proto's numbers are 60/78. And FEAT's original resnet-12 numbers are 66.78 and 82.05, but in your paper, why the numbers become 62.96/78.02? The other number is Meta-Opt's, I think their implementation only uses pre-resized 84 images instead of original images with severe augmentation. (Please correct me if i am wrong).

  2. Except deepEMD, FEAT and hyperbolic embedding, I don't think I see any paper's implementation using the bounding-box cropped images as input. (Again, please correct me if I am wrong). Could you please give me a reference that I can refer to for the CUB baseline numbers in your table 3c?

Thanks!

from deepemd.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.