Giter Club home page Giter Club logo

cocosnet's Introduction

python pytorch report

CoCosNet

Pytorch Implementation of the paper "Cross-domain Correspondence Learning for Exemplar-based Image Translation" (CVPR 2020 oral).

teaser

Update:

20200525: Training code for deepfashion complete. Due to the memory limitations, I employed the following conversions:

  • Disable the non-local layer, as the memory cost is infeasible on common hardware. If the original paper is telling the truth that the non-lacal layer works on (128-128-256) tensors, then each attention matrix would contain 128^4 elements (which takes 1GB).
  • Shrink the correspondence map size from 64 to 32, leading to 4x memory save on dense correspondence matrices.
  • Shrink the base number of filters from 64 to 16.

The truncated model barely fits in a 12GB GTX Titan X card, but the performance would not be the same.

Environment

  • Ubuntu/CentOS
  • Pytorch 1.0+
  • opencv-python
  • tqdm

TODO list

  • Prepare dataset
  • Implement the network
  • Implement the loss functions
  • Implement the trainer
  • Training on DeepFashion
  • Adjust network architecture to satisfy a single 16 GB GPU.
  • Training for other tasks

Dataset Preparation

DeepFashion

Just follow the routine in the PATN repo

Pretrained Model

The pretrained model for human pose transfer task: TO BE RELEASED

Training

run python train.py.

Citations

If you find this repo useful for your research, don't forget to cite the original paper:

@article{Zhang2020CrossdomainCL,
  title={Cross-domain Correspondence Learning for Exemplar-based Image Translation},
  author={Pan Zhang and Bo Zhang and Dong Chen and Lu Yuan and Fang Wen},
  journal={ArXiv},
  year={2020},
  volume={abs/2004.05571}
}

Acknowledgement

TODO.

cocosnet's People

Contributors

hhhzzm avatar lotayou avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.