The animateportrait from peterzs

Code for "Animating Portrait Line Drawings from a Single Face Photo and a Speech Signal"

The program takes a face photo and a speech signal as inputs, and outputs an artistic talking video of portrait line drawings or portrait cartoon.

Prerequisites

Linux
Python 3
NVIDIA GPU + CUDA CuDNN

Environment

Install PyTorch and dependencies by:

conda create -n animeportrait python=3.6
pip install -r requirements
pip install torch==1.8.2+cu111 torchvision==0.9.2+cu111 torchaudio==0.8.2 -f https://download.pytorch.org/whl/lts/1.8/torch_lts.html

Quick Test

Download pretrained models from here:

Module1_checkpoints, unzip by "tar xf" and put the folder to Module1/checkpoints;
Module2_checkpoints, unzip by "tar xf" and put the folder to Module2/checkpoints

Run commands below to generate artistic talking videos of line drawing and cartoon:

# for line drawing
CUDA_VISIBLE_DEVICES=0 python main_end2end_module2.py --jpg examples/hermione2.jpeg --audio examples/female12.wav --exp formal/drawing
# for cartoon
CUDA_VISIBLE_DEVICES=0 python main_end2end_module2.py --jpg examples/hermione2.jpeg --audio examples/female12.wav --exp formal/cartoon

The results are in output/hermione2-female12/

Training

Download data from here:

Line drawing & Cartoon training data, unzip by "tar xf" and put the folder as Data;
training list, unzip by "tar xf" and put the folders to Module2/datasets/list/trainA and Module2/datasets/list/trainB;

Run commands below to train

cd Module2
# train line drawing style
CUDA_VISIBLE_DEVICES=0 python train.py --dataroot drawing --name training/drawing1 --model geomgm_ifw_fore --netG resnet_9blocks_rcatland32_full_ifw --netg_resb_div 3 --netg_resb_disp 3 --output_nc 1 --display_env training_drawing1 --lr 0.00005 --lambda_geom 50 --lambda_geom_lipline 50 --more_weight_for_lip 2 --lambda_face 3.0 --lambda_warp_inter 10  --blendbg 1 --select_target12_thre 0.0 --niter 70 --niter_decay 0
# train cartoon style
CUDA_VISIBLE_DEVICES=0 python train.py --dataroot cartoon --name training/cartoon1 --model geomgm_ifw_cartoon_fore --netG resnet_9blocks_rcatland32_full_ifw --dataset_mode umlvd_ifw_cartoon --netg_resb_div 3 --netg_resb_disp 3 --output_nc 3 --display_env training_cartoon1 --lr 0.00005 --lambda_geom 50 --lambda_geom_lipline 0 --more_weight_for_lip 2 --lambda_face 3.0 --lambda_warp_inter 10 --blendbg 1 --niter 70 --niter_decay 0

The models are saved in Module2/checkpoints/training/drawing1 and Module2/checkpoints/training/cartoon1 respectively.

Acknowledgement

Code for Module1 is borrowed from https://github.com/adobe-research/MakeItTalk

peterzs / animateportrait Goto Github PK

animateportrait's Introduction

Code for "Animating Portrait Line Drawings from a Single Face Photo and a Speech Signal"

Prerequisites

Environment

Quick Test

Training

Acknowledgement

animateportrait's People

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent