lyttonhao / neural-style-mmd Goto Github PK

MXNet Code For Demystifying Neural Style Transfer (IJCAI 2017)

Python 95.58% Shell 4.42%

neural-style maximum-mean-discrepancy batch-normalization

neural-style-mmd's Issues

How do you set the bandwidth in Gaussian Kernel

Hi, I wonder how do you set the bandwidth parameter of Gaussian Kernel here.
https://github.com/lyttonhao/Neural-Style-MMD/blob/master/mmd_loss.py#L65
Can you give me an explanation? Thank you!

Have you ever tried the setting of --bn_loss?

I have tried the bn-loss but it always returns nan gradient and loss. So I want to know what's wrong with it.

python version problem

neural-style.py line 270
print np.prod(content_array[0].shape)

The paper might has a mistake

In the paper "Demystifying Neural Style Transfer", there might be a mistake, which will make Equation (8) incorrect.

For a layer L (in the paper, the authors used the lowcase L) in the loss network, NL is the number of feature maps in layer L. All the feature maps in layer L have the same size for a given input image.

Given different input images of different sizes, the size of those feature maps at the same layer will be different. For example, if the style image is 512x512 and the content image is 256x256, the size of a feature map of the style image at layer 4_2 (use VGG-19 as an example) will be 4 times of the feature map of the content image at layer 4_2.

On the right column of page 2 of the paper, ML is the size of a feature map at layer L for the content image and the generated image. For the style image, the size of a feature map at layer L typically is different. Therefore, the size of matrix to save the activations of the style image at layer L cannot be NL x ML.

If my understanding is correct, then the deduction in Equation (8) is incorrect.

confusion in Linear Kernel Loss functions

I'm trying to understand the code but unable to understand your loss calculation function can you please explain what are you doing because it doesn't seem you are doing anything in linear kernel you have commented it out in mmd_loss.py

mmd_loss line 43

Hi, in line 43 of mmd_loss.py, you wrote dot(x, x.T), I think it should be dot(x.T, x), correct?

Batch Processing

Is there anyway to implement processing folders full of multiple files(For video)
Maybe even do them in order like other neuralstyle transfer implemations do

gnorm = mx.nd.norm(model_executor.data_grad).asscalar()

error happens when running at "gnorm = mx.nd.norm(model_executor.data_grad).asscalar()" in /mnt/d/mahao/codes/Neural-Style-MMD/neural-style.py:

MXNetError: Check failed: reinterpret_cast( params.info->callbacks[kCustomOpForward])( ptrs.size(), const_cast<void**>(ptrs.data()), const_cast<int*>(tags.data()), reinterpret_cast<const int*>(req.data()), static_cast(ctx.is_train), params.info->contexts[kCustomOpForward]):

Why cannot get good results with Gaussian kernel?

Thxs for sharing the codes. However the experiments with Gaussian kernel seems terrible.

this is with style weight 2.0.

lyttonhao / neural-style-mmd Goto Github PK

neural-style-mmd's Issues

How do you set the bandwidth in Gaussian Kernel

Have you ever tried the setting of --bn_loss?

python version problem

The paper might has a mistake

confusion in Linear Kernel Loss functions

mmd_loss line 43

Batch Processing

gnorm = mx.nd.norm(model_executor.data_grad).asscalar()

Why cannot get good results with Gaussian kernel?

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent