bootcamp's Introduction

이미지 생성모델 파헤치기

일정:

8/21(월) ~ 8/23(수) Theory: Autoencoder, CycleGAN, Stable Diffusion

8/24(목) Practice

강의자료: 공지 메일 참고
Environments

CUDA 11.8
torch
torchvision
diffusers
transformers

8/24(목) Practice

Autoencoder (Training, Inference)

학습 및 추론

CUDA_VISIBLE_DEVICES=0 python3 Autoencoder.py --batch_size 128 --epochs 5 --latent_dims 10 --lr 1e-3 --use_gpu

사전학습 모델 추론

CUDA_VISIBLE_DEVICES=0 python3 Autoencoder.py --batch_size 128 --epochs 5 --latent_dims 10 --lr 1e-3 --use_gpu --pretrained

CycleGAN (Training, Inference)

학습

CUDA_VISIBLE_DEVICES=0 python main.py --batch_size 4 --epochs 2 --lr 2e-4 --sample_interval 100 --data_path ./horse2zebra --output_dir ./experiments_myname

추론 (이미지 생성)

CUDA_VISIBLE_DEVICES=0 python main.py --batch_size 1 --eval --output_dir ./experiment_ori

Download pre-trained weight: Dropbox link / Google Drive link

Stable Diffusion (Inference)

추론: StableDiffusion/std_inference.ipynb 내 실행 혹은

CUDA_VISIBLE_DEVICES=0 python std_inference.py

Frechet Inception Distance (FID)

계산

CUDA_VISIBLE_DEVICES=0 python fid_score.py /path/to/real/dataset /path/to/fake/dataset

Real Image 분포와 Fake Image 분포 간의 거리를 계산하여 Fake Image가 Real Image와 얼마나 가까운지 (얼마나 Real한지)에 대해 계산하는 metric

분포 간의 거리 계산이기 때문에 1:1 매칭되는 Target Image가 필요하지 않음

값이 작을수록 (0에 가까울수록) Fake Image가 Real Image와 비슷함

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.

Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

TensorFlow

An Open Source Machine Learning Framework for Everyone

Django

The Web framework for perfectionists with deadlines.

Laravel

A PHP framework for web artisans

D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

web

Some thing interesting about web. New door for the world.

server

A server is a program made to process requests and deliver data to clients.

Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

Visualization

Some thing interesting about visualization, use data art

Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.

Microsoft

Open source projects and samples from Microsoft.

Google

Google ❤️ Open Source for everyone.

Alibaba

Alibaba Open Source for everyone

D3

Data-Driven Documents codes.

Tencent

China tencent open source team.

amoeba04 / bootcamp Goto Github PK

bootcamp's Introduction

이미지 생성모델 파헤치기

8/24(목) Practice

Autoencoder (Training, Inference)

CycleGAN (Training, Inference)

Stable Diffusion (Inference)

Frechet Inception Distance (FID)

bootcamp's People

Contributors

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent