Comments (5)
Hi @saichanda ,
The current version of SemGCN cannot handle occlusions.
One potential solution might use some masks to impose 2D occlusions during network training.
Best,
Long
from semgcn.
@garyzhao ,
Thank you for the response. Thanks for the solution.
But I'm curious to know, from the paper, it is mentioned that the occlusions are handled.
we improve previous methods by a large margin for the action of directions, taking photo, posing, sitting down, walking dog and walking together. We hypothesize that this is due to the severe self-occlusions in these actions, while they can be effectively encoded by our SemGCN using relations within graphs.
Can you elaborate on what severe occlusions SemGCN is effectively encoding, if you say that the current version of SemGCN cannot handle occlusions.
Thank you.
from semgcn.
Sorry, Closed the issue by mistake.
from semgcn.
Hi @saichanda ,
Never mind.
That's a good question.
The "occlusions" you mentioned here ('-1' or '0' in the 2D keypoints) are extreme cases that one or more 2D joints are totally "vanished" in the 2D output, which cannot be handled by us.
In our paper, we expected that the 2D detector can still make some reasonable guesses when there are occlusions, which means the 2D output might not be accurate but close to the ground truth (reasonable). In this case, our method could refine the 3D prediction.
Therefore, to handle your occlusions, I suggest that you can add some masks (which randomly drop some 2D outputs just like your case) during training, which might improve the performance.
Best,
Long
from semgcn.
Sure @garyzhao ,
Thank you for the time and support.
best regards,
Sai
from semgcn.
Related Issues (20)
- question
- 问题
- 问题
- 问题
- Concatenation operation in Non_local
- RuntimeError: mat1 dim 1 must match mat2 dim 0 HOT 1
- problem when running viz.py
- The correspondence between 2D joints and 3D joints
- The correspondence between 2D joints and 3D joints
- SemGCN&SemCHGCN&PG
- 3D关节点
- converting H36M to openpose
- Error value of each action HOT 1
- Questions about feature dimensions
- Different input output size
- Results about perceptual feature
- 关于输入的2D坐标的问题
- Assertion Error: assert poses_3d is not None
- 文中提到的ResGCN的pytorch代码
- Dataset deleted
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from semgcn.