Giter Club home page Giter Club logo

Comments (11)

moberweger avatar moberweger commented on July 21, 2024

Of course you can run the second file as well, there is no difference, it is another test sequence, however very similar.

from deep-prior-pp.

WeihongM avatar WeihongM commented on July 21, 2024

hello, @ @moberweger
now I met something confusing after reading the code,
in the file main_icvl_com_refine.py row147
poseNetParams = ScaleNetParams(type=1, nChan=nChannels, wIn=imgSizeW, hIn=imgSizeH, batchSize=batchSize, resizeFactor=2, numJoints=1, nDims=3)
why you set numJoints = 1 rather than 16?

from deep-prior-pp.

WeihongM avatar WeihongM commented on July 21, 2024

whether it means your pretrained model is trained on the only first joint coordinate?
In my opinion I think this is not quite proper. @moberweger

from deep-prior-pp.

moberweger avatar moberweger commented on July 21, 2024

@WeihongM L147 that you mentioned is only for the refinement of the hand localization (com). Therefore, we only predict a single joint, ie the hand center. Thus numJoints=1 and nDims=3 for the 3D offset.

from deep-prior-pp.

WeihongM avatar WeihongM commented on July 21, 2024

@moberweger sorry, so which file is the code to predict the all numjoints? or is it right to just change the numjoints = 16 to train the network, if not, can you give me some advice how to write?
and Now I am more confused, in this repo, you use the multi-scale input to train the offset of a single joint, but in the paper deep prior++, the offset is not trained in this way.
what is more, I find this code(L147) is also in the deep prior repo code, you also use numjoint=1 and multi-scale input to train.
I guess you just want to give the first joint to check the loss and error in this code?
Can you understand what I mean, hopefully look for solutions, thx

from deep-prior-pp.

moberweger avatar moberweger commented on July 21, 2024

You should take a look at main_icvl_posereg_embedding.py if you want to predict all joints.
Regarding the multi-scale input, it actually does not matter in terms of accuracy. It is the same code from DeepPrior. If you read the paper, you can see the usage of the different networks.

from deep-prior-pp.

WeihongM avatar WeihongM commented on July 21, 2024

ok, @moberweger
Now I try changing the code (change the numJoints from 1 to 16) to check the performance, because in the paper deep prior, you have mentioned this network to predict all the joint coordinate.
Do you mean the multi-scale input does not have much improvement on the joint accuracy?

from deep-prior-pp.

moberweger avatar moberweger commented on July 21, 2024

The script main_icvl_posereg_embedding.py does already what you want, I guess. Also, multi-scale does not help much, yes.

from deep-prior-pp.

WeihongM avatar WeihongM commented on July 21, 2024

@moberweger hello,
when I run the main_icvl_posereg_embedding.py on the NYU dataset, I got the mean error 13.8069019318mm, but in the paper deep prior ++ which is 12.3 mm on the NYU dataset. Can you give me some advice on improving?

from deep-prior-pp.

moberweger avatar moberweger commented on July 21, 2024

main_icvl_posereg_embedding.py is intended for ICVL dataset. So please check that. Did you use the refinement network for the hand localization?

from deep-prior-pp.

WeihongM avatar WeihongM commented on July 21, 2024

thanks for your reply very, i know what i missed

from deep-prior-pp.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.