ningshuliang / picture Goto Github PK

View Code? Open in Web Editor NEW

44.0 44.0 4.0 27.67 MB

Official code for paper "PICTURE: PhotorealistIC virtual Try-on from UnconstRained dEsigns"

License: MIT License

Python 15.03% Jupyter Notebook 84.96% Shell 0.01%

picture's People

Contributors

Stargazers

Watchers

Forkers

zcfrank1st fasvid thanhpham1987 happyxy

picture's Issues

solved

When will the code be released?

Looks very good
When will the code be released?

blurry results from stage1 when using img clip

I succeed to inference your work with text clip in stage1, but I saw blurry results when using img clip

I tried below code

from ldm.modules.encoders.modules import ClipImageProjector
from torchvision import transforms

version="./../pretrain_models/clip-vit-large-patch14"
clip_model2 = ClipImageProjector(version=version).to(device)
tform = transforms.ToTensor()

# text = tokenizer(text_description,truncation=True, max_length=77, return_length=True,
#             return_overflowing_tokens=False, padding="max_length", return_tensors="pt")

# text_features = clip_model(text["input_ids"].cuda(non_blocking=True))
# text_features = text_features.last_hidden_state # torch.Size([1, 77, 768])

garment_condition_path = os.path.join("./Sample_data/Cloth_White_Background", file_name[0])
garment_condition = tform(Image.open(garment_condition_path).convert("RGB"))

garment_condition = garment_condition * 2. - 1.
garment_condition = clip_model2.preprocess(garment_condition.unsqueeze(0)) # got similar results with / without preprocessing
text_features = clip_model2(garment_condition.cuda(non_blocking=True))

c = [concat_feature,text_features]
sampler.sample(S=opt.ddim_steps,
               conditioning=c,
               ...)

Could you please help me to use img clip?

densepose image

Could you please let me know how to make this densepose image?

It seems somewhat different from the densepose images I've seen before..

How can i test it on CPU?

Model link not accessible

Google link show permission error.

label to colours mapping

I'm trying to make parsing imgs similar to what you provided,

but after the segmentation generation process from VITON-HD, I got imgs similar to below figure.

Could you please provide just a little bit of code snippets, about color and labels?
For example,


labels = { # from https://github.com/shadow2496/VITON-HD
    0:  ['background',  [0]], 
    1:  ['paste',       [2, 4, 7, 8, 9, 10, 11]], 
    2:  ['upper',       [3]],
    3:  ['hair',        [1]],
    4:  ['left_arm',    [5]],
    5:  ['right_arm',   [6]],
    6:  ['noise',       [12]]
}
labels_to_colours = [(0,0,0), ...]

About pre trained weights release

Thank you for your contribution to community with this repo ! I can't reach pre trained weights when i click pre trained weights link. Will you share them?

ningshuliang / picture Goto Github PK

picture's People

Contributors

Stargazers

Watchers

Forkers

picture's Issues

solved

When will the code be released?

blurry results from stage1 when using img clip

densepose image

How can i test it on CPU?

Model link not accessible

label to colours mapping

About pre trained weights release

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent