Giter Club home page Giter Club logo

Comments (8)

oneiroid avatar oneiroid commented on July 2, 2024 7

Karras implemented projecting to latent space right there in the original repo yo. Don't think it can be improved any more. He's using tiled (1, 512) dlatent though, I suppose as a preemptive measure against censorship activists. Made a couple of changes to project to (18, 512) space: https://github.com/oneiroid/stylegan2

image0001-step2000
image0001-target

from stylegan-encoder.

rolux avatar rolux commented on July 2, 2024 7

In the meantime, I've done two things: ported this repo to StyleGAN2 (link), at least up to the point where the tutorial notebook runs; and, in my own port of Puzer's encoder (link), added a project_images.py equivalent to encode_images.py.

My impression is that there is a lot of cool stuff in here that would be great to have in StyleGAN2. On the other hand, from a "user" - as opposed to "researcher" - perspective, a two-line change to the projector, plus a bit of glue around it, seems kind of hard to beat.

from stylegan-encoder.

pbaylies avatar pbaylies commented on July 2, 2024 1

@ofirkris I've thought about this - this would be better done as a fork of StyleGAN2. Anyone would be welcome to do this in a new repo, currently I'm taking a break from active development over the holidays.

from stylegan-encoder.

rolux avatar rolux commented on July 2, 2024 1

Karras implemented projecting to latent space right there in the original repo yo. Don't think it can be improved any more. He's using tiled (1, 512) dlatent though, I suppose as a preemptive measure against censorship activists. Made a couple of changes to project to (18, 512) space: https://github.com/oneiroid/stylegan2

@oneiroid: Uh, you're totally right!

Ginevra de Benci_01

Ginevra de Benci_01

Optimization video: https://youtu.be/DqOXWb1fvHk

This looks pretty good to me!

from stylegan-encoder.

oneiroid avatar oneiroid commented on July 2, 2024 1

Hi @woctezuma, I positively wouldn't mind explaining :))
Using (1, 512) dlatents you will never get good projection for any face that was not present in the training dataset.
Using (18, 512) dlats you can project any face. And then you can apply facial attributes directions - emotions, age, gender.
Even celebrity faces! And them celebrities got their faces copyrighted and sold for good money.
So there could've been deepfakes-style hysteria and calls to forbid everything starting from GANs down to internet and propagation of light reflected off their precioussssssss privately owned faces.
Karras and Nvidia do not need such sick activity around them. I hope I made myself clear. I'm a bit distressed recently by all the sick things happening, tried my best :)

from stylegan-encoder.

wduo avatar wduo commented on July 2, 2024

@oneiroid Did you run python align_images.py before python encode_images.py?

from stylegan-encoder.

woctezuma avatar woctezuma commented on July 2, 2024

Karras implemented projecting to latent space right there in the original repo yo. Don't think it can be improved any more. He's using tiled (1, 512) dlatent though, I suppose as a preemptive measure against censorship activists. Made a couple of changes to project to (18, 512) space: https://github.com/oneiroid/stylegan2

Hello @oneiroid, would you mind explaining what you meant as "preemptive measure against censorship activists"? I am trying to understand this (18, 512) change (rolux/stylegan2encoder#21), and so far, my understanding is that the visual fidelity is better but the semantic fidelity (for interpolation, etc.) is not. I assume that is the reason why Nvidia's implementation uses (1, 512). I would be curious about your other explanation.

Edit: I have found these related issues (#1, #2), including one created by you.

from stylegan-encoder.

woctezuma avatar woctezuma commented on July 2, 2024

Thank you for clarifying. It helps me in my understanding of this change!

from stylegan-encoder.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.