The gigagan from jiauzhang

gigagan's Issues

Could the code train now without convergence ?

I found adding the cross attention module would make the training become unstable. The G loss raised to NaN.

How can I contribute to your work

Hello, great effort on this re-implementation.
Would you need GPU resources / other assistance to complete your work? How far are you from achieving the paper results for the upscaler?

clip loss？

Hello, I only saw the definition of clip loss in clip.py. I didn't find where you used it? But your paper points out that this loss is used.

training new SG model with our own CLIP "captions"

Hello @JiauZhang - I was on StyleClips repo and we were trying to confirm something.

I need to train a new SG model, with our own CLIP "captions" and not use LAION or anything else.

So, my alpha test case was to build a new SG model with images of Men and Women.

My question is - Did you train a new model with your own "captions" or how are you adding NEW keywords/captions?

In our case we have our own internal nomenclature which we must train into some medical images for a use-case, and we have to figure out how to do this.

Inference is failing on running generate.py script

The error is IndexError: Dimension out of range (expected to be in range of [-1, 0], but got 1) at images = g(z, text_embeds)[0] statement.

rahulbhalley@192 GigaGAN % python3 generate.py
Traceback (most recent call last):
  File "/Users/rahulbhalley/Desktop/Hide-and-Seek/image-synth/GigaGAN/generate.py", line 17, in <module>
    images = g(z, text_embeds)[0]
             ^^^^^^^^^^^^^^^^^
  File "/opt/homebrew/lib/python3.11/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
    return forward_call(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/rahulbhalley/Desktop/Hide-and-Seek/image-synth/GigaGAN/model.py", line 140, in forward
    styles = [self.style(s) for s in styles]
             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/rahulbhalley/Desktop/Hide-and-Seek/image-synth/GigaGAN/model.py", line 140, in <listcomp>
    styles = [self.style(s) for s in styles]
              ^^^^^^^^^^^^^
  File "/opt/homebrew/lib/python3.11/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
    return forward_call(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/opt/homebrew/lib/python3.11/site-packages/torch/nn/modules/container.py", line 217, in forward
    input = module(input)
            ^^^^^^^^^^^^^
  File "/opt/homebrew/lib/python3.11/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
    return forward_call(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/rahulbhalley/Desktop/Hide-and-Seek/image-synth/GigaGAN/layers.py", line 11, in forward
    return input * torch.rsqrt(torch.mean(input ** 2, dim=1, keepdim=True) + 1e-8)
                               ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
IndexError: Dimension out of range (expected to be in range of [-1, 0], but got 1)

jiauzhang / gigagan Goto Github PK

gigagan's Introduction

gigagan's People

Contributors

Stargazers

Watchers

Forkers

gigagan's Issues

Could the code train now without convergence ?

How can I contribute to your work

clip loss？

training new SG model with our own CLIP "captions"

Inference is failing on running generate.py script

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent