Giter Club home page Giter Club logo

Comments (7)

1adrianb avatar 1adrianb commented on June 18, 2024

Hi @hzh8311, The two numbers are used to aproximatevly map the center and the scale obtained from a tight bounding box to the one used during training.

Regarding the results: You should get around 74% if you run this code. For LS3D-W balanced you have to remove the 1.75 shift at this line:

predictions[i] = preds_img:clone() + 1.75
otherwise you will ge the wrong result. The shift is required only for 300W where there is a diference of indexes been 0 or 1 based.

from 2d-and-3d-face-alignment.

hzh8311 avatar hzh8311 commented on June 18, 2024

So why 1.75 here? Should not it be 1. ?

from 2d-and-3d-face-alignment.

hzh8311 avatar hzh8311 commented on June 18, 2024

It seems that the center and scale for training are fixed as (225, 275) and 1.8, respectively. And the center and scale for testing are computed according to bounding box. Does this difference matters and leads to such performance drop (17-18%)? @1adrianb

from 2d-and-3d-face-alignment.

1adrianb avatar 1adrianb commented on June 18, 2024

@hzh8311 the center is fixed for 300W-LP because the images are already normalised and therefore computing a bounding box is not required. At test time however the faces are coming in all sort of sizes and are found at various locations in the image, therefore some sort of normalisation is required. This is done for simplicity based on the bounding box size.

The error of 17% is not coming from normalisation diff, but from the shift between the ground truth and the predictions. The shift is applied after the predictions are obtained.

from 2d-and-3d-face-alignment.

hzh8311 avatar hzh8311 commented on June 18, 2024

I removed the shift you mentioned, but only 3-4% improvement compared to my first result. That is to say it is still 13-14% lower than the result reported in the paper.
For more details, the AUCs on traning set and validation set(300W_LP_test) are quiet high (~80%), but it drop badly on LS3D-W dataset.

from 2d-and-3d-face-alignment.

1adrianb avatar 1adrianb commented on June 18, 2024

Oh, so you are using a retrained model? Because using the pretained model, already provided, you should be able to match exactly that numbers.

from 2d-and-3d-face-alignment.

hzh8311 avatar hzh8311 commented on June 18, 2024

Yes, but I can't figure out why the retrained model can not reach your performance.

from 2d-and-3d-face-alignment.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.