Giter Club home page Giter Club logo

show's People

Contributors

lithiumice avatar slimevrx avatar vra avatar yhw-yhw avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

show's Issues

Error with Face Only Video and Openpifpaf

I am trying to run SHOW to get a rendering of a face portrait video. It is showing the following error:

2023-08-04 12:49:04 | INFO     | SHOW.load_assets:48 - mmpose det length before: 1
2023-08-04 12:49:04 | INFO     | SHOW.load_assets:53 - no whole person detected
2023-08-04 12:49:04 | ERROR    | SHOW.load_assets:122 - max_person_crop_im is None

The model is also trying to import Openpifpaf, which was not installed during the setup process. It was commented out in modules/PyMAF/requirements.txt.

Here is a sample image from the video attached:
000001

Is there a way to run it without using Openpifpaf and solve this error?

Regarding hand pose dimension

It is provided that the hand pose shape is (bs,12) in the dataset description, [https://github.com/yhw-yhw/SHOW#dataset-description] but each hand should consist of 15 joints resulting a shape of (bs,45) right. Can you please explain the discrepancy here.

mmcv install version

Could you please share the mmcv-full, mmpose, mmdet version ? These libs confilcts with each other. According to readme, the installed libs are mmcv-full==1.7.2 and mmpose==1.3.1.

Poses and Trans parameter from SMPLX model

From file all.pkl

I can get the parameters:

vertices
joints
full_pose
global_orient
transl
v_shaped
betas
body_pose
left_hand_pose
right_hand_pose
expression
jaw_pose

How to convert to Poses and Trans from SMPLX model?

Thank you!

Running demo on custom video

It seems that stage 1 expects ground truth op values to run on a video, but also calculates the open pose values if they dont exist.

However, in practice if there is no all.pkl in test/demo_video/ours , it does not detect any GT values and ends stage1 here

SHOW/stage1_main.py

Lines 218 to 224 in 06447d4

(op_kpts_org_data, mp_kpts, deca_kpts, op_valid_flag, mp_valid_flag,
deca_valid_flag, gt_seg) = ret['gt_data'].values()
if op_valid_flag.sum() == 0:
logger.warning(f'op_valid_flag is all False, skipping')
return False

What steps do we need to run Show on a custom video from scratch?

mmcv library in Colab

I noticed the author is having problems with the mmcv library in Colab

mmhuman3d requires mmcv>=1.3.17, < 1.6.1

but follow https://mmcv.readthedocs.io/en/latest/get_started/installation.html#install-with-pip

Current Colab: torch=1.13.1 and cuda=cu116 only support mmcv=1.7.0

So I decided to build mmcv=1.6.0 from source

I followed the instructions here

https://mmcv.readthedocs.io/en/latest/get_started/build.html

!curl -LO https://github.com/open-mmlab/mmcv/archive/refs/tags/v1.6.0.tar.gz
!tar xzf v1.6.0.tar.gz
%cd mmcv-1.6.0
!pip install -r requirements/optional.txt
!MMCV_WITH_OPS=1 pip install -e . -v

Takes about 30 minutes

image

Inference on audio

Good afternoon, maybe I was inattentive when I read the documentation, but how do I run your model on an audio file?

Please not using openpose

openpose is good, but it's not properly using as a lib, since it hard to calling inside python scripts. Only useful when prepareing data.

But if inference needed data, better using own pose model or mmpose or alphapose.

the questions about autoregressive models

大家好,请教大家问题:

我在训练pixel自回归模型时,遇到以下两个问题:
When I was training a pixel autoregressive model, I encountered the following two problems:

  1. gated pixelcnn 非常容易过拟合,在1、2个epoch之后,val loss就一直上升不降。
    Gated pixelcnn is very easy to overfit. After 1 or 2 epochs, the val loss keeps rising.

2.自回归模型会出现身份泄露的问题,比如speakerA生成时,会出现speakerB的动作和手势。

The autoregressive model will have the problem of identity leakage. For example, when speakerA is generated, the actions and gestures of speakerB will appear.

请教大家有什么解决问题的策略吗?感谢!

About run SHOW on demo video

Hi, i try to run SHOW on demo video but i found some mistakes as below:
WARNING | SHOW.utils:95 - not exist: /home/dell/projects/talkingface/SHOW/test/demo_video/ours/final_metric.json
WARNING | stage1_main:135 - final_losses_json_path not valid
WARNING | stage1_main:139 - ours_pkl_file_path not exists
WARNING | stage1_main:222 - op_valid_flag is all False, skipping
WARNING | stage2_main:108 - bs_at_a_time: 14
WARNING | stage2_main:138 - ours_pkl_file_path not exists: /home/dell/projects/talkingface/SHOW/test/demo_video/ours/all.pkl

Pytorch3D in Windows

Hi,

I see that the author can run SHOW on Windows
I am having problems installing pytorch3D on Windows
Do you have any instructions for me?

Thank you very much!

About the download of the video

I want to download the video provided in the paper . But most of links to video on youtube seems invalid. Could you release or directly send me the links to download the video used in paper? I tried to download with the download_youtube.py and only download successfully with no more than 11w seconds, smaller than the 27hours described in the paper.

Questions about differences between Paper and Code Implementation

Thank you for your thorough and comprehensive work firstly! It is a very solid work.
But I found some differences between paper and code default implementation.
For the default config, I found that:

  1. You opted not to utilize the silhouette loss.
  2. Instead of employing pymaf-x for hand pose initialization, you continued to use pixie hand pose initialization.
  3. There was no utilization of mediapipe to generate additional keypoints for face mesh supervision.

I am interested in understanding the reasons behind these design choices and the insights behind them.
Thank you for your time and kindness!

Can I set focal length?

Hello, @yhw-yhw
I observed that SHOW has set the initial focal length to 5000. Could this value vary? I ask because I intend to utilize the pre-calibrated camera intrinsics.

models.zip downloada failed

下载models.zip文件时,快完成的时候报 “失败-位置服务器错误。请重试或与服务器管理员联系。” 错误,连续重试好几次都不行。
img_v2_499b6fc5-2c95-4884-b528-c610de1dd22g

另外,项目有docker包没,直接在服务器上配置环境也好多坑。

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.