Pytorch implementation of Unsupervised Attention-guided Image-to-Image Translation.

Python 94.99% Shell 5.01%

deep-learning deep-neural-networks machine-learning pytorch pytorch-implmention pytorch-implementation attention-mechanism attention-model

pytorch-attention-guided-cyclegan's People

Contributors

Stargazers

Watchers

Forkers

jifenghu caterinab chaoshen0 yyxjcc pinglmlcv muxgt lgl603 annihi1ation 459548764 mustaffa-hussain betharaghu yangminnau erikyann haiyangpeng yunfei920406 petermaner gogobd noshirocheshire cuteboyqq

pytorch-attention-guided-cyclegan's Issues

some convergence questions

i wonder a better way to make this kind of attn-base generator convergence
could you please share some train tricks while you train this architecture ? : )

Implemenetation and paper differences

Very clean code, however I have found what I believe are differences between the paper and the code implementation in the model structure. Could you please share why these differences came into existence?

According to Apendix A the last layer of generator should be c3s1-3-T, but rather c7s1-3-T is used in the code.
The second up-scaling layer in the attention network is commented out (and having this in would mean the following conv should have stride 2?)
The resblocks do not seem to relu the output and while the paper does not mention anything (just says use resblock), from what I know about them, the (out+x) should be passed through relu?
Missing the s′new part of equation num 6?

Expected input channels mismatch

I get this error whenever I try to train the network using "train.py"

Traceback (most recent call last):
  File ".\train.py", line 237, in <module>
    all()
  File ".\train.py", line 141, in all
    attnMapA = toZeroThreshold(AttnA(realA))
  File "C:\Users\User1\AppData\Local\Programs\Python\Python36\lib\site-packages\torch\nn\modules\module.py", line 489, in __call__
    result = self.forward(*input, **kwargs)
  File "C:\Users\User1\Documents\Pytorch-Attention-Guided-CycleGAN\models.py", line 141, in forward
    return self.model(x)
  File "C:\Users\User1\AppData\Local\Programs\Python\Python36\lib\site-packages\torch\nn\modules\module.py", line 489, in __call__
    result = self.forward(*input, **kwargs)
  File "C:\Users\User1\AppData\Local\Programs\Python\Python36\lib\site-packages\torch\nn\modules\container.py", line 92, in forward
    input = module(input)
  File "C:\Users\User1\AppData\Local\Programs\Python\Python36\lib\site-packages\torch\nn\modules\module.py", line 489, in __call__
    result = self.forward(*input, **kwargs)
  File "C:\Users\User1\AppData\Local\Programs\Python\Python36\lib\site-packages\torch\nn\modules\conv.py", line 320, in forward
    self.padding, self.dilation, self.groups)
RuntimeError: Given groups=1, weight of size [32, 3, 7, 7], expected input[1, 4, 256, 256] to have 3 channels, but got 4 channels instead

It's also important to mention that i had to slightly modify the code for windows multiprocessing compatibility. I created a all() function that includes the whole code and gets run like this as stated in the Pytorch windows FAQ:

if __name__ == '__main__':
    all()

If I want to make attention in background.

Hi,
Thank you for this task. If I want to make attention in the background that I need to change rather than the object. For example, in domain A and B, we have horse images, when translating from domain A - B, I want to keep the same horse in domain B but the background will be changed as domain A. How I can do that? Thank you in advance.

bug in the train script

Hi,

I believe there is a bug in this line

DisLossB = fakeTargetLoss(disB(genB)) + fakeTargetLoss(disB(genB_)) + 2*realTargetLoss(disA(realB))

in the last term, there should be disB instead of disA, right?

I have a problem

Thank you for your implementation of Unsupervised Attention-guided Image-to-Image Translation. After learning your code, I found some difference between yours and the description of the original paper.
your code:
attnMapA = toZeroThreshold(AttnA(realA))
fgA = attnMapA * realA
bgA = (1 - attnMapA) * realA
genB = genA2B(fgA)
fakeB = (attnMapA * genB) + bgA
but in original paper, After feeding the input image to the generator, Apply the learned mask
to the generated image using an element-wise product ‘* ’, and then add the background using the
inverse of the mask applied to the input image.
attnMapA = toZeroThreshold(AttnA(realA))
fgA = attnMapA * realA
bgA = (1 - attnMapA) * realA
genB = genA2B(realA)
fakeB = (attnMapA * genB) + bgA
According to the original paper, genB = genA2B(realA) rather than genB = genA2B(fgA) , and then fakeB = (attnMapA * genB) + bgA
Could you tall me why do you implement the code in this way?
Finally, please forgive me for my bad English if it annoys you. =。=！

Test Image

Hi,
How can we get test images?
Thanks in advanced.

alokwhitewolf / pytorch-attention-guided-cyclegan Goto Github PK

pytorch-attention-guided-cyclegan's People

Contributors

Stargazers

Watchers

Forkers

pytorch-attention-guided-cyclegan's Issues

some convergence questions

Implemenetation and paper differences

Expected input channels mismatch

If I want to make attention in background.

bug in the train script

I have a problem

Test Image

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent