Training plans?

Question

I've got a bunch of compute the next couple weeks and thinking to train this on LAION.

nbardy · Answer

<blockquote><a class="user-mention notranslate" data-hovercard-type="user" data-hover

francqz31 · Answer

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

nbardy · Answer

Definitely most interested in training the upscaler.

francqz31 · Answer

<blockquote><a class="user-mention notranslate" data-hovercard-type="user" data-hover

lucidrains · Answer

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

nbardy · Answer

Exiting progress.

Trying to start some jobs this week and there is n

francqz31 · Answer

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

lucidrains · Answer

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

lucidrains · Answer

ok, got the unet upsampler to a decent place, will move onwards to unconditional train

lucidrains · Answer

Haha yeah, they are busy training Gemini I heard

No worries, take yo

nbardy · Answer

Alright we've got some other preview chips now(I think their existence is under NDA ri

nbardy · Answer

I’ll be on a long weekend break. I can take a look at an upsampler training nex

lucidrains · Answer

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

nbardy · Answer

Exciting, I have cleared my schedule tomorrow and next week to work only on training t

nbardy · Answer

Awesome!Sorry, I have not been working this weekend. Laying down to re

lucidrains · Answer

ok, finished the text-conditioning logic for both base and upsampler

nbardy · Answer

512 TPUv4 from a google startup grant.

Didn't get any response in LA

lucidrains · Answer

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

francqz31 · Answer

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

francqz31 · Answer

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

lucidrains · Answer

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

nbardy · Answer

Happy to jump in and help.

How up to date is the TODO list? You ment

nbardy · Answer

🥳

nbardy · Answer

Won’t be back until Wednesday actually

nbardy · Answer

<a target="_blank" rel="noopener noreferrer" href="https://private-user-images.githubu

francqz31 · Answer

That might be the cutest dog ever , look at him laying on the bed knowing he is a good

nbardy · Answer

Okay great, I'm seeing the Generator has cross attent

nbardy · Answer

They don't indicate which upscaler was used in the paper for which samples.

nbardy · Answer

Thanks for the update. Code looks great.

nbardy · Answer

re: other project - there's been a small breakthrough leading to a few SO

nbardy · Answer

Thanks for the updates.

I split off the distributed train and

nbardy · Answer

Yea can you email me your signal. Just waking up will start work in a few hours

nbardy · Answer

:) Exciting. Managed to cancel all my meetings this week.

have you t

nbardy · Answer

<a target="_blank" rel="noopener noreferrer" href="https://private-user-images.githubusercontent.com

francqz31 · Answer

Based on the paper, it seems that GigaGAN uses separate text encoders for the generato

nbardy · Answer

Reading through training details. Some notes on datasets and models size from the pape

lucidrains · Answer

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

lucidrains · Answer

ohh sweet, though you probably should do it in jax? or has the state of pytorch xla im

lucidrains · Answer

are you doing a startup? or working for a new one?

lucidrains · Answer

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

lucidrains · Answer

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

lucidrains · Answer

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

lucidrains · Answer

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

nbardy · Answer

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

francqz31 · Answer

it is more than enough that you are willing to train the Upsampler. it is not an easy

lucidrains · Answer

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

lucidrains · Answer

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

lucidrains · Answer

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

lucidrains · Answer

didn't get to it this weekend 😢 caught up with some TTS work and Pride celebrations<

lucidrains · Answer

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

lucidrains · Answer

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

lucidrains · Answer

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

lucidrains · Answer

ok, let us reconvene on this Monday then

lucidrains · Answer

haha or Wednesday, whenever you are free

i'll take my time here then

nbardy · Answer

It's unclear to me in the paper how the ImageNet superRes model and text conditioned

nbardy · Answer

In addition,
for more controlled comparison, we train our model on

nbardy · Answer

Getting up to speed with the code today. Feel like I understood most of it.

lucidrains · Answer

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

lucidrains · Answer

oh yup, text conditioning would still make sense for low res upscaling, let me aim to

nbardy · Answer

Going to try and get training code running tomorrow and try to get the unconditioned o

nbardy · Answer

Tried to add text conditioning to the Upscaler this evening. Seems like it should just

lucidrains · Answer

Going to try and get training code running tomorrow and try to get the un

lucidrains · Answer

re: other project - there's been a small breakthrough leading to a few SOTAs in the ge

lucidrains · Answer

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

lucidrains · Answer

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

lucidrains · Answer

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

nbardy · Answer

Is it obvious to you what is a big contribution in this paper?

Seems

nbardy · Answer

Do you think LION will fail on a smaller model?

Looking at try a few

lucidrains · Answer

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

lucidrains · Answer

Is it obvious to you what is a big contribution in this paper?

lucidrains · Answer

the truth is, any of these concepts would benefit DDPMs as well.. but let's just keep

lucidrains · Answer

made a tiny bit of progress; unfortunately unconditional image synthesis didn't work o

lucidrains · Answer

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

lucidrains · Answer

revisiting all this complicated GAN training code, all I can say is, thank god for den

lucidrains · Answer

hmm, no there's still something wrong, training blows up, even when i add gradient pen

lucidrains · Answer

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

lucidrains · Answer

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

lucidrains · Answer

ok cool! yeah I'll resume trying to debug the system tomorrow morning

nbardy · Answer

I can try testing the discriminators as classifiers today.

lucidrains · Answer

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

lucidrains · Answer

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

lucidrains · Answer

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

lucidrains · Answer

<a target="_blank" rel="noopener noreferrer" href="https://private-user-images.githubu

lucidrains · Answer

<a target="_blank" rel="noopener noreferrer" href="https://private-user-images.githubu

lucidrains · Answer

bug is probably in discriminator somewhere, let me throw a few hours this morning at t

lucidrains · Answer

hey no worries, rest up!

will need to move on to some other work mid

lucidrains · Answer

good news, have gigagan training using the lightweight gan peripheral training code. l

lucidrains · Answer

ok further good news, validated multi-scale inputs and scale invariant code + skip lay

lucidrains · Answer

training is now stable in the main repo, even without reconstruction loss 👌 turns ou

lucidrains · Answer

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

lucidrains · Answer

yea, it is working with the multiscale logits being involved, but the loss is very roc

lucidrains · Answer

ok, once i took out the gradient penalty contributions for multi-scale logits, trainin

lucidrains · Answer

<a target="_blank" rel="noopener noreferrer" href="https://private-user-images.githubu

nbardy · Answer

I'm able to get the upscaler and base gan running locally.

Getting a

lucidrains · Answer

I'm able to get the upscaler and base gan running locally.

lucidrains · Answer

<blockquote><a target="_blank" rel="noopener noreferrer nofollow" href="https://user-images.githubu

lucidrains · Answer

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

nbardy · Answer

I was not able to find the t_local and t_global sizes in the paper.

lucidrains · Answer

will also aim to get the eval for both base and upsampler done, using what <a class="u

lucidrains · Answer

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Training plans? about gigagan-pytorch HOT 122 CLOSED

Comments (122)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent