Giter Club home page Giter Club logo

sgdiff's Introduction

Official Implementation of SGDiff (ACM MM '23)

This is the official implementation of SGDiff: A Style Guided Diffusion Model for Fashion Synthesis (ACM MM '23). SGDiff is developed based on the MMagic framework (version V1.1.0). The training scripts and dataset used in this paper will be released soon.

Todo List

  • Release the training scripts.
  • Make the dataset publicly available.

Installation Guide

To use SGDiff, you need to install a compatible version of PyTorch with CUDA support. We recommend using PyTorch version 1.10 with CUDA 11.1. However, our codebase does not specifically depend on this exact version of PyTorch or CUDA, and other versions may also work but have not been extensively tested. Please refer to the MMagic installation guide for more details on setting up your environment.

  1. (Optional if you already have)Install a compatible version of PyTorch with CUDA
    pip install torch==1.10.0+cu111 torchvision==0.11.0+cu111 torchaudio==0.10.0 -f https://download.pytorch.org/whl/torch_stable.html
  2. MMagic dependencies
    pip3 install openmim
    mim install mmcv>=2.0.0
    mim install mmengine
  3. Install this repository as editable version
    git clone https://github.com/Taited/sgdiff
    cd sgdiff
    pip3 install -e .

Inference Code Now Available ๐Ÿ”ฅ

The inference code for SGDiff is now available in this repository.

Before running inference, download the model checkpoint from the Google Drive.

After downloading, you can generate images using the SGDiff model by the following command:

python inference.py --ckpt sgdiff.pth --img_path examples/starry_night.jpg

Citation

If this repository is helpful to your research, please cite it as below.

@inproceedings{10.1145/3581783.3613806,
author = {Sun, Zhengwentai and Zhou, Yanghong and He, Honghong and Mok, P.Y.},
title = {SGDiff: A Style Guided Diffusion Model for Fashion Synthesis},
year = {2023},
isbn = {9798400701085},
publisher = {Association for Computing Machinery},
address = {New York, NY, USA},
url = {https://doi.org/10.1145/3581783.3613806},
doi = {10.1145/3581783.3613806},
booktitle = {Proceedings of the 31st ACM International Conference on Multimedia},
pages = {8433โ€“8442},
numpages = {10},
keywords = {style guidance, denoising diffusion probabilistic models, text-to-image, fashion synthesis},
location = {Ottawa ON, Canada},
series = {MM '23}
}

Acknowledgement

This work builds upon the MMagic library. We appreciate the MMagic team for their substantial contributions to the community. For the exact version of MMagic we used (V1.1.0), please refer to their repository.

Stay tuned for updates on the release of additional resources!

sgdiff's People

Contributors

yshuo-li avatar hejm37 avatar z-fran avatar ckkelvinchan avatar nbei avatar leoxing1996 avatar wangruohui avatar innerlee avatar zengyh1900 avatar liuwenran avatar plyfager avatar endlesssora avatar wwhio avatar taited avatar hellock avatar vongolawu avatar magicdream2222 avatar ryanxingql avatar quincylin1 avatar sheffieldcao avatar xinntao avatar ferryhuang avatar sunnyxiaohu avatar xiaomile avatar okotaku avatar ruoningyu avatar congee524 avatar yaochaorui avatar yanxingliu avatar rangeking avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.