3D Human Body Estimation and Comparison

This repository contains two major components.

Run five 3D-Human-Pose-Estimation methods on a single, or multiple RGB video(s). Render the estimated 3D meshes and 3D skeletons, in a front-view and a side-view (90 degrees rotated about the up-axis), on top of a white background.
Compare two 3D human meshes. That is, compare the discrepancies in the angles of the limbs, and visualise the discrepancies on the mesh as shown below. The colored mesh is the input mesh, which we compare with a reference mesh. Figure: The limbs of this mesh are colored based on their correctness. Green means that the limb angle is within the accepted error range with respect to the reference, and red means it is outside of the accepted error range. SMPL-Body was used for this visualisation courtesy of the Max Planck Institute for Intelligent Systems.

Hardware

This code has been tested on an Ubuntu 18.04 machine.

Testing five 3D-Human-Pose-Estimation models

Install the methods

Clone these repositories and follow their installation instructions.

Method	Repository link
[1] ROMP	https://github.com/Saafke/ROMP
[2] DecoMR	https://github.com/zengwang430521/DecoMR
[3] ExPose	https://github.com/vchoutas/expose
[4] VIBE	https://github.com/mkocabas/VIBE
[5] VideoPose3D	https://github.com/facebookresearch/VideoPose3D

Note: The above link for ROMP is a fork and improved over the original, to facilitate running this method on videos.

Run the methods

Now we can run the above methods on your input video(s). Use bash to run the "estimate.sh" script. Make sure to change the directories in the top of this script, to your correct directories. That is, the directories where you installed the above methods and conda environments. Execute the following commands:

First, go into the correct subfolder of this repository:

$ cd test_five_methods

Then run:

$ bash estimate.sh

This will run the methods on your input video(s) and store the results - i.e. the estimated 3D meshes (or skeletons) and camera parameters - in the corresponding output folders (in this repository).

Visualise results

If we want to visualise the results, we need to render them. The following script will render the 3D meshes (or skeletons) via a weak-perspective camera model. Execute the following command, specifying the desired height and width of the rendered images (recommended to use the same resolution as the input image or video):

$ python render.py --width 1920 --height 1080

The above code renders independent videos for each method. To combine the videos into a single view:

$ python mix_clips.py

Comparing Two Human Meshes

This will compare two estimated SMPL [6] meshes. We first extract the 3D skeleton from the meshes. We compute the discrepancies between skeleton1 and skeleton2, i.e. the differences between the limbs' 3D angles. The error threshold is a hyperparameter you can change. To try the toy-example:

Install requirements via conda

conda create -n two-humans python=3.8
conda activate two-humans
pip install -r compare_two_humans/requirements.txt

Download vertex labels

This file tells us which vertices belongs to which (SMPL) body parts.

wget -P ./compare_two_humans/data https://github.com/Meshcapade/wiki/tree/main/assets/SMPL_body_segmentation/smpl/smpl_vert_segmentation.json

Execute script

python compare_two_humans/compare_and_vis.py

Your result is the output.png file. To try your own meshes, change the file paths in compare_and_vis.py with your own estimations or ground-truths. These should be in the form of SMPL models, such the estimations from VIBE, ExPose or ROMP.

References

[1] Sun, Y., Bao, Q., Liu, W., Fu, Y., Black, M. J., & Mei, T. (2021). Monocular, one-stage, regression of multiple 3d people. In Proceedings of the IEEE/CVF International Conference on Computer Vision (pp. 11179-11188).[Weblink]

[2] Zeng, W., Ouyang, W., Luo, P., Liu, W., & Wang, X. (2020). 3D Human Mesh Regression with Dense Correspondence. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 7054-7063). [Weblink]

[3] Choutas, V., Pavlakos, G., Bolkart, T., Tzionas, D., & Black, M. J. (2020, August). Monocular expressive body regression through body-driven attention. In European Conference on Computer Vision (pp. 20-40). Springer, Cham. [Weblink]

[4] Kocabas, M., Athanasiou, N., & Black, M. J. (2020). Vibe: Video inference for human body pose and shape estimation. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 5253-5263). [Weblink]

[5] Pavllo, D., Feichtenhofer, C., Grangier, D., & Auli, M. (2019). 3D Human Pose Estimation in Video with Temporal Convolutions and Semi-supervised Training. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 7753-7762). [Weblink]

[6] Loper, M., Mahmood, N., Romero, J., Pons-Moll, G., & Black, M. J. (2015). SMPL: A skinned multi-person linear model. ACM transactions on graphics (TOG), 34(6), 1-16. [Weblink]

Contact

This code has been written by Xavier Weber and Mohamed Ilyes Lakhal. For queries regarding this repository, please contact Xavier ([email protected]).

Acknowledgements and licenses

This repository makes use of the SMPL-Body, which is licensed under the Creative Commons Attribution 4.0 International License. License link: https://smpl.is.tue.mpg.de/bodylicense.html

saafke / dance-fitness Goto Github PK

dance-fitness's Introduction

3D Human Body Estimation and Comparison

Hardware

Testing five 3D-Human-Pose-Estimation models

Install the methods

Run the methods

Visualise results

Comparing Two Human Meshes

Install requirements via conda

Download vertex labels

Execute script

References

Contact

Acknowledgements and licenses

dance-fitness's People

Contributors

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent