minar09 / cp-vton-plus Goto Github PK

Official implementation for "CP-VTON+: Clothing Shape and Texture Preserving Image-Based Virtual Try-On", CVPRW 2020

Home Page: https://minar09.github.io/cpvtonplus/

License: MIT License

Python 100.00%

cp-vton-plus pytorch pytorch-implementation official-implementation emcom emcom-lab seoultech virtual-try-on image-based 2d-virtual-try-on

cp-vton-plus's Introduction

CP-VTON+ (CVPRW 2020)

Official implementation for "CP-VTON+: Clothing Shape and Texture Preserving Image-Based Virtual Try-On" from CVPRW 2020.
Project page: https://minar09.github.io/cpvtonplus/.
Saved/Pre-trained models: Checkpoints
Dataset: VITON_PLUS
The code and pre-trained models are tested with pytorch 0.4.1, torchvision 0.2.1, opencv-python 4.1 and pillow 5.4 (Python 3 env).

Project page | Paper | Dataset | Model | Video

Usage

This pipeline is a combination of consecutive training and testing of GMM + TOM. GMM generates the warped clothes according to the target human. Then, TOM blends the warped clothes outputs from GMM into the target human properties, to generate the final try-on output.

Install the requirements
Download/Prepare the dataset
Train GMM network
Get warped clothes for training set with trained GMM network, and copy warped clothes & masks inside data/train directory
Train TOM network
Test GMM for testing set
Get warped clothes for testing set, copy warped clothes & masks inside data/test directory
Test TOM testing set

Installation

This implementation is built and tested in PyTorch 0.4.1. Pytorch and torchvision are recommended to install with conda: conda install pytorch=0.4.1 torchvision=0.2.1 -c pytorch
For all packages, run pip install -r requirements.txt

Data preparation

For training/testing VITON dataset, our full and processed dataset is available here: https://1drv.ms/u/s!Ai8t8GAHdzVUiQRFmTPrtrAy0ZP5?e=rS1aK8. After downloading, unzip to your data directory.

Training

Run python train.py with your specific usage options for GMM and TOM stage.
For example, GMM: python train.py --name GMM --stage GMM --workers 4 --save_count 5000 --shuffle
Then run test.py for GMM network with the training dataset, which will generate the warped clothes and masks in "warp-cloth" and "warp-mask" folders inside the "result/GMM/train/" directory. Copy the "warp-cloth" and "warp-mask" folders into your data directory, for example inside "data/train" folder.
Run TOM stage, python train.py --name TOM --stage TOM --workers 4 --save_count 5000 --shuffle

Testing

Run 'python test.py' with your specific usage options.
For example, GMM: python test.py --name GMM --stage GMM --workers 4 --datamode test --data_list test_pairs.txt --checkpoint checkpoints/GMM/gmm_final.pth
Then run test.py for GMM network with the testing dataset, which will generate the warped clothes and masks in "warp-cloth" and "warp-mask" folders inside the "result/GMM/test/" directory. Copy the "warp-cloth" and "warp-mask" folders into your data directory, for example inside "data/test" folder.
Run TOM stage: python test.py --name TOM --stage TOM --workers 4 --datamode test --data_list test_pairs.txt --checkpoint checkpoints/TOM/tom_final.pth

Inference/Demo

Download the pre-trained models from here: https://1drv.ms/u/s!Ai8t8GAHdzVUiQA-o3C7cnrfGN6O?e=EaRiFP. Then run the same step as Testing to test/inference our model. The code and pre-trained models are tested with pytorch 0.4.1, torchvision 0.2.1, opencv 4.1 and pillow 5.4.

Testing with custom images

to run the model with custom internet images, make sure you have the following:

image (image of a person, crop/resize to 192 x 256 (width x height) pixels)
image-parse (you can generate with CIHP_PGN or Graphonomy pretrained networks from the person image. See this comment)
cloth (in-shop cloth image, crop/resize to 192 x 256 (width x height) pixels)
cloth-mask (binary mask of cloth image, you can generate it with simple pillow/opencv function)
pose (pose keypoints of the person, generate with openpose COCO-18 model (OpenPose from the official repository is preferred))
Also, make a test_pairs.txt file for your custom images. Follow the VITON dataset format to keep same arrangements, otherwise you can modify the code.

What to do in case of unexpected results

There are many factors that can make distorted/unexpected results. Can you please do the following?

First try the original viton dataset and test pair combinations, check the intermediate results and the final output. Check if they are as expected.
If the original viton results are not as expected, please check the issues raised in this github repo, people have already found several issues and see how they solved it.
If the original viton test results are as expected, then run your custom test sets and check the intermediate results and debug where its going wrong.
If you are testing with custom images then check the github repository readme and related issues on how to run with custom images.

Its difficult to understand your issue from only single image/output. As I mentioned, there are various factors. Please debug yourself step by step and see where its going wrong. Check all the available intermediate/final inputs/outputs visually, and check multiple cases to see if the issue is happening for all cases. Good luck to you!

Citation

Please cite our paper in your publications if it helps your research:

@InProceedings{Minar_CPP_2020_CVPR_Workshops,
	title={CP-VTON+: Clothing Shape and Texture Preserving Image-Based Virtual Try-On},
	author={Minar, Matiur Rahman and Thai Thanh Tuan and Ahn, Heejune and Rosin, Paul and Lai, Yu-Kun},
	booktitle = {The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops},
	month = {June},
	year = {2020}
}

Acknowledgements

This implementation is largely based on the PyTorch implementation of CP-VTON. We are extremely grateful for their public implementation.

cp-vton-plus's People

Contributors

Stargazers

Watchers

Forkers

nandhakishorm akhileshkakade lotayou mrtechnoo peternara rubreh sahiljit abc123yuanrui gauravkuppa binhmuc georgedavila anishone arvind-iyer ragnariock maheswarantp brnmttos won21kr usama3059 wchan757 amoghatsunil huuthieu sharpiless qosmoinc phillip-strefling rkuo2000 tomas1337 1702609 alexadvent tj-eey buglossjisoo mindlostguy nishipatodi atulnagane45 apokar rjsm39 zhang-lingyun soniyanaik1177 super-detective sidriaz lmpawan10 roy-sr thisaragimhani sumukh0512 chamodi08jaya frqei blessinvarkey pkundu25 masfour7 edsuch21 yinbc mh6845 nac-39 jack-app indrapratamalin pukhraj10 chaitra22 stq-hydra masom9768 miibotree ttgoding sawk1 azeembootwala shuaibai623 prism4304 youngstu efoundit mavisw foo123 cocosjsu skeras bukebuhao abhishekhugetech omarahmed19992 masonwang513 shivanidubey28 natsumia ikenaga530 cmyguel parameshvadivel eng-yuri89 jahnavibhide thaithanhtuan zx874561 bushranajeeb zarafsheenirfan wangyq9 junaidhassan99 prajwal208 anzwolf msergej diliplehlah dmulxw chaudharykapil khanhtbh debtanug 0xsoon eduardoprea timurrybak lly20000 shrihari-prakash

cp-vton-plus's Issues

Unexpected result from pre-trained model

I used the pretrained model and resulting images have a black face and I am getting unexpected result.

Can you please help

size issue when running train.py GMM

I have been trying solve it but didnt reach anywhere. The output shape is not able to match the broadcast shape

python train.py --name GMM --stage GMM --workers 4 --save_count 5000 --shuffle

Namespace(batch_size=4, checkpoint='', checkpoint_dir='checkpoints', data_list='train_pairs.txt', datamode='train', dataroot='data', decay_step=100000, display_count=20, fine_height=256, fine_width=192, gpu_ids='', grid_size=5, keep_step=100000, lr=0.0001, name='GMM', radius=5, save_count=5000, shuffle=True, stage='GMM', tensorboard_dir='tensorboard', workers=4)
Start to train stage: GMM, named: GMM!
initialization method [normal]
initialization method [normal]
Traceback (most recent call last):
File "train.py", line 229, in
main()
File "train.py", line 210, in main
train_gmm(opt, train_loader, model, board)
File "train.py", line 72, in train_gmm
inputs = train_loader.next_batch()
File "/content/gdrive/My Drive/virtual mirror/cp_dataset.py", line 228, in next_batch
batch = self.data_iter.next()
File "/usr/local/lib/python3.6/dist-packages/torch/utils/data/dataloader.py", line 345, in next
data = self._next_data()
File "/usr/local/lib/python3.6/dist-packages/torch/utils/data/dataloader.py", line 856, in _next_data
return self._process_data(data)
File "/usr/local/lib/python3.6/dist-packages/torch/utils/data/dataloader.py", line 881, in _process_data
data.reraise()
File "/usr/local/lib/python3.6/dist-packages/torch/_utils.py", line 395, in reraise
raise self.exc_type(msg)
RuntimeError: Caught RuntimeError in DataLoader worker process 0.
Original Traceback (most recent call last):
File "/usr/local/lib/python3.6/dist-packages/torch/utils/data/_utils/worker.py", line 178, in _worker_loop
data = fetcher.fetch(index)
File "/usr/local/lib/python3.6/dist-packages/torch/utils/data/utils/fetch.py", line 44, in fetch
data = [self.dataset[idx] for idx in possibly_batched_index]
File "/usr/local/lib/python3.6/dist-packages/torch/utils/data/utils/fetch.py", line 44, in
data = [self.dataset[idx] for idx in possibly_batched_index]
File "/content/gdrive/My Drive/virtual mirror/cp_dataset.py", line 138, in getitem
shape_ori = self.transform(parse_shape_ori) # [-1,1]
File "/usr/local/lib/python3.6/dist-packages/torchvision/transforms/transforms.py", line 61, in call
img = t(img)
File "/usr/local/lib/python3.6/dist-packages/torchvision/transforms/transforms.py", line 166, in call
return F.normalize(tensor, self.mean, self.std, self.inplace)
File "/usr/local/lib/python3.6/dist-packages/torchvision/transforms/functional.py", line 208, in normalize
tensor.sub(mean).div(std)
RuntimeError: output with shape [1, 256, 192] doesn't match the broadcast shape [3, 256, 192]

What model did you use to generate the human segmentation that includes the neck label?

How to create "image-parse-new" when testing on custom images ?

I am testing CP-VTON+ on my custom images. I have created masks and parsed images for both cloth and models using some human-body parsing repos like CIHP, JPP-Net but couldn't understand the use of "image-parse-new" folder in test and train directories of VITON dataset. How to create these images inside "image-parse-new" folder. Nothing was mentioned about this in the README file.

generate pose

Hello, is it possible to have the code that you used to generate the pose keypoints? I try to generate the keypoints of some image of the cp-vton+ with the code on this github repository: https://github.com/minar09/openpose-pytorch. However, I do not find the same points as the ones in the cp-viton+ dataset.

grid size

hi,
can you please tell me why is the grid size 5 and what does 5 indicate ?
also, in FeatureRegression the output_dim in 2*opt.grid_size**2 where as generally it should be 6 right because of (2,3 ) parameters in tps transform.
could you please elaborate on grid size and output_dim in FeatureRegression
thank you

Test for high resolution image

@minar09 what should I do to use this model for slightly high resolution images?

How to do quantitative evaluation?

I wanted to know how did you calculate Warped Blended (IoU), SSIM, LPIPS, IS metrics for quantitative evaluation and where exactly did you implement in the code for quantitative evaluation?

image-parse error

I found that the pictures I processed at this stage can't be put into the previous CP VTON. Can you help me? The prompt error is"RuntimeError :The size of tensor a (192) must match the size of tensor b(3) at non-singleton dimension 2"

Unknown Error

got this error with the line: python train.py --name GMM --stage GMM --workers 4 --save_count 5000 --shuffle

and got this with: python train.py --name TOM --stage TOM --workers 4 --save_count 5000 --shuffle

can you guide me, on how to solve these error
downloaded the data, model and did according to the read me

Threshold value for body masking

In the make_body_mask function in body_binary_masking.py, I see this line of code

54:  _, seg_mask = cv2.threshold(gray, 1, 255, cv2.THRESH_BINARY)

Shouldn't the ~~first~~ second argument be 0 not 1? Won't the hat segmentation part be lost if 1 is set as the threshold?

Using multiple GPUs

Hi,

I specified gpus_id = '0,1', but model just run on first device. I checked the code and couldn't find the "DataParallel" to distribute the model to multiple devices. Please let me know if I am missing something

sleeve issue

Hi,

When I tested this model with a long-sleeve shirt to fit a person who was wearing a short-sleeve shirt, the long-sleeve shirt turned into a short-sleeve shirt, and vice versa. Any advice on how to solve this short and long sleeve issue so that the long-sleeve shirt will remain long-sleeve after fitting? Many thanks,

some issue with segmented mask in dataset_neck_skin_correction.py

When I run dataset_neck_skin_correction.py on the train data I'm getting the following error.

Is it due to image parsing since I'm using https://github.com/Engineering-Course/LIP_JPPNet for parsing ?

RuntimeError: output with shape [1, 256, 192] doesn't match the broadcast shape [3, 256, 192]

I have been following the tutorial and I am unable to train the algorithm. I have downloaded the dataset via data_download.py. What should I do?

Results are not good

I am using custom dataset.
More details
Batch_size =1
Segmentation Mask from CHIP_PGN results.
Pose from openpose Coco Model and changed the code that described here
Cloth and cloth mask are from the preprocessed data

Image parsing

I am trying to give inputs from internet like take an image of a person and an image of the cloth..
I tried running the dataset_neck_skin_connection.py
How should I get the image-parse for the input image.. pls guide me.. I am a beginner

Requesting end to end system or a colab demo

Hey,

Would it be possible for a colab demo or an end to end integrated system run(Single File Run)

Regards

how to test custom image?? pls help

Bad results

Thank for the work.
I have run the modules according to the read me, but most of results after the TOM stage ('try-on' generated folder) contains significant artifacts. Just attaching here first 5 generated photos. Some are ok, but I would say that most are bad. Don't understand is this a some bug, problem with my actions or current limitation of the approach.

binary mask for white cloth on white background

Hi there. I wanted to use your code to create a binary mask for a white cloth on a white background image; however, I only got an absolutely black mask image and I couldn't see the cloth at all. Could you advise how I can create a binary mask for a white cloth on a white background image? Thank you so much!

custom image error

I'm going to try cp-vton-plus with custom images

In dataset_neck_skin_correction.py, only custom images get the following error

Traceback (most recent call last):
File "dataset_neck_skin_correction.py", line 258, in
main()
File "dataset_neck_skin_correction.py", line 254, in main
image_dir, seg_dir, each[0], mask, updated_seg_dir)# def update_image_segmentation(data_dir, mask_dir, image_name, mask_name, save_dir=None, save_vis=True):
File "dataset_neck_skin_correction.py", line 184, in update_image_segmentation
result = segmentation + neck_mask
ValueError: operands could not be broadcast together with shapes (256,192,3) (256,192)

The dataset image succeeds, the custom image gives me this error. Do you know the cause?

Unexpected results from test model when running TOM

There seems to be artifacts when running TOM testing. Does this suggest that the input to the TOM is incorrectly given or are you seeing the same results as below? The input is fixed as reported here: #8 FYI.

output with shape [1, 256, 192] doesn't match the broadcast shape [3, 256, 192]

File "test.py", line 224, in
main()
File "test.py", line 210, in main
test_gmm(opt, test_loader, model, board)
File "test.py", line 83, in test_gmm
for step, inputs in enumerate(test_loader.data_loader):
File "/usr/local/lib/python3.6/dist-packages/torch/utils/data/dataloader.py", line 345, in next
data = self._next_data()
File "/usr/local/lib/python3.6/dist-packages/torch/utils/data/dataloader.py", line 856, in _next_data
return self._process_data(data)
File "/usr/local/lib/python3.6/dist-packages/torch/utils/data/dataloader.py", line 881, in _process_data
data.reraise()
File "/usr/local/lib/python3.6/dist-packages/torch/_utils.py", line 395, in reraise
raise self.exc_type(msg)
RuntimeError: Caught RuntimeError in DataLoader worker process 0.
Original Traceback (most recent call last):
File "/usr/local/lib/python3.6/dist-packages/torch/utils/data/_utils/worker.py", line 178, in _worker_loop
data = fetcher.fetch(index)
File "/usr/local/lib/python3.6/dist-packages/torch/utils/data/utils/fetch.py", line 44, in fetch
data = [self.dataset[idx] for idx in possibly_batched_index]
File "/usr/local/lib/python3.6/dist-packages/torch/utils/data/utils/fetch.py", line 44, in
data = [self.dataset[idx] for idx in possibly_batched_index]
File "/content/drive/My Drive/Colab Notebooks/VirtualTryOn/cp-vton-plus/cp_dataset.py", line 140, in getitem
shape_ori = self.transform(parse_shape_ori) # [-1,1]
File "/usr/local/lib/python3.6/dist-packages/torchvision/transforms/transforms.py", line 61, in call
img = t(img)
File "/usr/local/lib/python3.6/dist-packages/torchvision/transforms/transforms.py", line 166, in call
return F.normalize(tensor, self.mean, self.std, self.inplace)
File "/usr/local/lib/python3.6/dist-packages/torchvision/transforms/functional.py", line 208, in normalize
tensor.sub(mean).div(std)
RuntimeError: output with shape [1, 256, 192] doesn't match the broadcast shape [3, 256, 192]

how much time testing will take for both networks??

Size mismatch error while running test.py for GMM model

Hey, I am trying to execute the test.py for GMM as mentioned in the repo it is giving the size mismatch error.

RuntimeError: Error(s) in loading state_dict for GMM:
size mismatch for extractionB.model.0.weight: copying a param of torch.Size([64, 3, 4, 4]) from checkpoint, where the shape is torch.Size([64, 1, 4, 4]) in current model.

Kindly help me with this.

train does not start

I get the following error.
I would like to know what the solution is.

some image-mask seem to overlap two different source images

When I run body_binary_masking.py on the test set some masks seem to consist of two different images. However, many images also appear normal. Is this a known issue?

See below for an example:

Thanks for a great implementation otherwise!

Weird tryon results after running CP-Viton-plus on colab - am I doing something wrong!

Hi Matiur, @minar09
I tried to run the cp-vton-plus on Google colab but it gave weird result images in the folder "/cp-vton-plus/result/TOM/test/try-on" see below images. Is this the correct folder of the try on results? if not, where is the correct folder to look for try-on look?
can you please help me . I am still beginner in this.

Colab implementation

Result images:

Sample Images of /result/GMM/test/overlayed_TPS :

Sample Images of /result/GMM/test/result_dir :

Sample Images of /result/GMM/test/warp-cloth :

Thanks,

Are any pretrained models available?

Hi, Great work, and thanks for sharing. I wanted to test your model, are there any pretrained weights available?

Error while testing with custom images

I really appreciate your work.

i am getting an error like this while testing custom images,

return F.conv2d(input, weight, self.bias, self.stride,
RuntimeError: Given groups=1, weight of size [64, 22, 4, 4], expected input[3, 28, 256, 192] to have 22 channels, but got 28 channels instead

Cannot untar the dataset with download.py

I encountered the same problem as shown above on both MacOS and Linux. I also tried downloading the dataset manually from the google drive but I still couldn't untar it.

About Lreg -(3)

I have a question about equation (3) in the paper.

What are Gx and Gy?

Also, I don't see Gy in the equation, is that correct?

Hello, why 40 * loss_gic

Lgic = gicloss(grid)
# 200x200 = 40.000 * 0.001
Lgic = Lgic / (grid.shape[0] * grid.shape[1] * grid.shape[2])
loss = Lwarp + 40 * Lgic    # total GMM loss

why loss_gic and 200x200=40x0.001 ?

results from tom are in shades of the rainbow

i run pretrained tom weiths and got this all time

CPU version

Does this model work on cpu version while testing/inference.
if yes,
please tell me changes I would have to make.

Thanks

Restart training from checkpoint

Hi, there is a way to restart the training from a specific checkpoint?
I'm trying this command but it starts always from step 0
python train.py
--stage GMM
--workers 8
--save_count 500
--checkpoint "checkpoints/gmm_train_new/step_000500.pth"

What is the use of grid, how does TSP work?

A question about your workshop slide page 11

In your slide , page 11, cp-vton result and your result have difference in shoulders of the cloth, but the input of upper region is the same, why does it happen?

TOM model testing error

When executing the test.py script for TOM model, FileNotFoundError appears.
Line number 56 inside getitem method of cp_dataset.py is causing the error because there is no directory present as such named data/test/warp-cloth.

Images with different size

Hi. If I would like to train the model with images at different size (e.g. 288 x 512, which is 1.5 times the current image size (192 x 256)), can I do so? If so, do I need to change any data in the model?

Thanks for your help,

dataset_neck_skin_correction.py

Got error FileNotFoundError: [Errno 2] No such file or directory: 'data/viton_resize/train/image'

Blue Hue & Blurriness

This is great work!

How do I get rid of the blue hue and blurriness?

Is the gmm reg loss same as total variation loss?

I have read the code carefully, the gic loss calculates the difference of flow value at each grid coordinate, on x and y directions relatively.

And the tv loss seems do the same thing.
https://discuss.pytorch.org/t/implement-total-variation-loss-in-pytorch/55574

Could you please explain the difference between your gic loss and tv loss?

testing gmm network gave cublas runtime error

testing gmm network gave this error

python3 test.py --name GMM --stage GMM --workers 4 --datamode test --data_list test_pairs.txt --checkpoint checkpoints/GMM/gmm_final.pth

Namespace(batch_size=4, checkpoint='checkpoints/GMM/gmm_final.pth', data_list='test_pairs.txt', datamode='test', dataroot='data', display_count=1, fine_height=256, fine_width=192, gpu_ids='', grid_size=5, name='GMM', radius=5, result_dir='result', shuffle=False, stage='GMM', tensorboard_dir='tensorboard', workers=4)
Start to test stage: GMM, named: GMM!
initialization method [normal]
initialization method [normal]
THCudaCheck FAIL file=/pytorch/aten/src/THC/THCGeneral.cpp line=663 error=11 : invalid argument
Traceback (most recent call last):
File "test.py", line 225, in
main()
File "test.py", line 211, in main
test_gmm(opt, test_loader, model, board)
File "test.py", line 99, in test_gmm
grid, theta = model(agnostic, cm)
File "/home/ubuntu/.local/lib/python3.6/site-packages/torch/nn/modules/module.py", line 477, in call
result = self.forward(*input, **kwargs)
File "/home/ubuntu/Desktop/cpvtonplus/cp-vton-plus-master/networks.py", line 522, in forward
correlation = self.correlation(featureA, featureB)
File "/home/ubuntu/.local/lib/python3.6/site-packages/torch/nn/modules/module.py", line 477, in call
result = self.forward(*input, **kwargs)
File "/home/ubuntu/Desktop/cpvtonplus/cp-vton-plus-master/networks.py", line 103, in forward
feature_mul = torch.bmm(feature_B, feature_A)
RuntimeError: cublas runtime error : the GPU program failed to execute at /pytorch/aten/src/THC/THCBlas.cu:411