google-research / morph-net Goto Github PK

View Code? Open in Web Editor NEW

1.0K 35.0 148.0 6.29 MB

Fast & Simple Resource-Constrained Learning of Deep Network Structure

License: Apache License 2.0

Python 95.63% Starlark 4.37%

neural-architecture-search automl tensorflow deep-learning python machine-learning

morph-net's Introduction

MorphNet: Fast & Simple Resource-Constrained Learning of Deep Network Structure

[TOC]

New: FiGS: Fine-Grained Stochastic Architecture Search

FiGS, is a probabilistic approach to channel regularization that we introduced in Fine-Grained Stochastic Architecture Search. It outperforms our previous regularizers and can be used as either a pruning algorithm or a full fledged Differentiable Architecture Search method. This is the recommended way to apply MorphNet. In the below documentation it is referred to as the LogisticSigmoid regularizer.

What is MorphNet?

MorphNet is a method for learning deep network structure during training. The key principle is continuous relaxation of the network-structure learning problem. In short, the MorphNet regularizer pushes the influence of filters down, and once they are small enough, the corresponding output channels are marked for removal from the network.

Specifically, activation sparsity is induced by adding regularizers that target the consumption of specific resources such as FLOPs or model size. When the regularizer loss is added to the training loss and their sum is minimized via stochastic gradient descent or a similar optimizer, the learning problem becomes a constrained optimization of the structure of the network, under the constraint represented by the regularizer. The method was first introduced in our CVPR 2018, paper "MorphNet: Fast & Simple Resource-Constrained Learning of Deep Network Structure". A overview of the approach as well as device-specific latency regularizers were prestend in GTC 2019. [slides, recording: YouTube, GTC on-demand]. Our new, probabilistic, approach to pruning is called FiGS, and is detailed in Fine-Grained Stochastic Architecture Search.

Usage

Suppose you have a working convolutional neural network for image classification but want to shrink the model to satisfy some constraints (e.g., memory, latency). Given an existing model (the “seed network”) and a target criterion, MorphNet will propose a new model by adjusting the number of output channels in each convolution layer.

Note that MorphNet does not change the topology of the network -- the proposed model will have the same number of layers and connectivity pattern as the seed network.

To use MorphNet, you must:

Choose a regularizer from morphnet.network_regularizers. The choice is based on
- your target cost (e.g., FLOPs, latency)
- Your ability to add new layers to your model:
  - Add our probabilistic gating operation after any layer you wish to prune, and use the LogisticSigmoid regularizers. [recommended]
  - If you are unable to add new layers, select regularizer type based on your network architecture: use Gamma regularizer if the seed network has BatchNorm; use GroupLasso otherwise [deprecated].
Note: If you use BatchNorm, you must enable the scale parameters (“gamma variables”), i.e., by setting scale=True if you are using tf.keras.layers.BatchNormalization.

Note: If you are using LogisticSigmoid don't forget to add the probabilistic gating op! See below for example.
Initialize the regularizer with a threshold and the output boundary ops and (optionally) the input boundary ops of your model.

MorphNet regularizer crawls your graph starting from the output boundary, and applies regularization to some of the ops it encounters. When it encounters any of the input boundary ops, it does not crawl past them (the ops in the input boundary are not regularized). The threshold determines which output channels can be eliminated.
Add the regularization term to your loss.

As always, regularization loss must be scaled. We recommend to search for the scaling hyperparameter (regularization strength) along a logarithmic scale spanning a few orders of magnitude around 1/(initial cost). For example, if the seed network starts with 1e9 FLOPs, explore regularization strength around 1e-9.

Note: MorphNet does not currently add the regularization loss to the tf.GraphKeys.REGULARIZATION_LOSSES collection; this choice is subject to revision.

Note: Do not confuse get_regularization_term() (the loss you should add to your training) with get_cost() (the estimated cost of the network if the proposed structure is applied).
Train the model.

Note: We recommend using a fixed learning rate (no decay) for this step, though this is not strictly necessary.
Save the proposed model structure with the StructureExporter.

The exported files are in JSON format. Note that as the training progresses, the proposed model structure will change. There are no specific guidelines on the stopping time, although you would likely want to wait for the regularization loss (reported via summaries) to stabilize.
(Optional) Create summary ops to monitor the training progress through TensorBoard.
Modify your model using the StructureExporter output.
Retrain the model from scratch without the MorphNet regularizer.

Note: Use the standard values for all hyperparameters (such as the learning rate schedule).
(Optional) Uniformly expand the network to adjust the accuracy vs. cost trade-off as desired. Alternatively, this step can be performed before the structure learning step.

We refer to the first round of training as structure learning and the second round as retraining.

To summarize, the key hyperparameters for MorphNet are:

Regularization strength
Alive threshold

Note that the regularizer type is not a hyperparameter because it's uniquely determined by the metric of interest (FLOPs, latency) and the presence of BatchNorm.

Regularizer Types

Regularizer classes can be found under network_regularizers/ directory. They are named by the algorithm they use and the target cost they attempt to minimize. For example, LogisticSigmoidFlopsRegularizer uses a Logistic-Sigmoid probabilistic method to to regularize the FLOP cost and GammaModelSizeRegularizer uses the batch norm gamma in order to regularize the model size cost.

Regularizer Algorithms

[NEW] LogisticSigmoid is designed to control any model type, but requires adding simple gating layers to your model.
GroupLasso is designed for models without batch norm.
Gamma is designed for models with batch norm; it requires that batch norm scale is enabled.

Regularizer Target Costs

Flops targets the FLOP count of the inference network.
Model Size targets the number of weights of the network.
Latency optimizes for the estimated inference latency of the network, based on the specific hardware characteristics.

Examples

Adding a FLOPs Regularizer

The example below demonstrates how to use MorphNet to reduce the number of FLOPs in your model. In this example, the regularizer will traverse the graph starting with logits, and will not go past any op that is earlier in the graph than the inputs or labels; this allows to specify the subgraph for MorphNet to optimize.

from morph_net.network_regularizers import flop_regularizer
from morph_net.tools import structure_exporter

def build_model(inputs, labels, is_training, ...):
  gated_relu = activation_gating.gated_relu_activation()

  net = tf.layers.conv2d(inputs, kernel=[5, 5], num_outputs=256)
  net = gated_relu(net, is_training=is_training)

  ...
  ...

  net = tf.layers.conv2d(net, kernel=[3, 3], num_outputs=1024)
  net = gated_relu(net, is_training=is_training)

  logits = tf.reduce_mean(net, [1, 2])
  logits = tf.layers.dense(logits, units=1024)
  return logits

inputs, labels = preprocessor()
logits = build_model(inputs, labels, is_training=True, ...)

network_regularizer = flop_regularizer.LogisticSigmoidFlopsRegularizer(
    output_boundary=[logits.op],
    input_boundary=[inputs.op, labels.op],
    alive_threshold=0.1  # Value in [0, 1]. This default works well for most cases.
)
regularization_strength = 1e-10
regularizer_loss = (network_regularizer.get_regularization_term() * regularization_strength)

model_loss = tf.nn.sparse_softmax_cross_entropy_with_logits(labels, logits)

optimizer = tf.train.MomentumOptimizer(learning_rate=0.01, momentum=0.9)

train_op = optimizer.minimize(model_loss + regularizer_loss)

You should monitor the progress of structure learning training via Tensorboard. In particular, you should consider adding a summary that computes the current MorphNet regularization loss and the cost if the currently proposed structure is adopted.

tf.summary.scalar('RegularizationLoss', regularizer_loss)
tf.summary.scalar(network_regularizer.cost_name, network_regularizer.get_cost())

Larger values of regularization_strength will converge to smaller effective FLOP count. If regularization_strength is large enough, the FLOP count will collapse to zero. Conversely, if it is small enough, the FLOP count will remain at its initial value and the network structure will not vary. The regularization_strength parameter is your knob to control where you want to be on the price-performance curve. The alive_threshold parameter is used for determining when an activation is alive.

Extracting the Architecture Learned by MorphNet

During training, you should save a JSON file that contains the learned structure of the network, that is the count of activations in a given layer kept alive (as opposed to removed) by MorphNet.

exporter = structure_exporter.StructureExporter(
    network_regularizer.op_regularizer_manager)

with tf.Session() as sess:
  tf.global_variables_initializer().run()
  for step in range(max_steps):
    _, structure_exporter_tensors = sess.run([train_op, exporter.tensors])
    if (step % 1000 == 0):
      exporter.populate_tensor_values(structure_exporter_tensors)
      exporter.create_file_and_save_alive_counts(train_dir, step)

Misc

Contact: [email protected]

Maintainers

Elad Eban, github: eladeban
Andrew Poon, github: ayp-google
Yair Movshovitz-Attias, github: yairmov
Max Moroz, github: pkch

Contributors

Ariel Gordon, github: gariel-google.

morph-net's People

Contributors

Stargazers

Watchers

Forkers

rahulsoibam chakri-muthyala hyzcn spencerx rubenszimbres codeaudit jaedukseo hoangcuong2011 github30 rafaelmri kleinxin kimkuang alphashi rubinliao vectorspase guidewsp xiedake refusecoffee jurjsorinliviu gds101054108 templeblock dreadlord1984 anhngml marcelomata huizhang0110 asad znsoftm xiaoye77 ywy0318 luyifanlu hhy5277 kingofoz ryokugyu lamperougeyxy mrlinning stanxii hackveda leo-xxx stjordanis njeshi tshepomk jangocity jangocheng dst1213 paintthatagain rainscut trendingtechnology lxwithgod wolf1981 mbrukman rabbitnick githubno1 mayngu feitiandemiaomi djalmabright 10183308 billyzju meitianjinbu auersberg thenerd-alexis delphigeek conrad784 suvarnak ml-lab xingyueye yuanjun1991 hazemabbas trinityzhub ornl-bsec hxl1990 xwyangjshb hi-yan caiyng xl58github jacke121 wfus lyh0208 teryang wing02 shaoqibnu shiyuan0806 renjunbakamei mboaretto25 novaintrovert muharremokutan vashishtmadhavan lovit redpoint13 bharatr21 vipin08 ritikaag qitsweauca liepieshov yangsuhui joseph-zhong sharmasecureservices jamiekang zitehe zmsunnyday schmiederx

morph-net's Issues

as_list() is not defined on an unknown TensorShape.

Hi, I got a problem when I used the VGG. The code is as follows:

def vgg_a(inputs,num_classes=10,is_training=True,dropout_keep_prob=0.5,spatial_squeeze=True,
          scope='vgg_a',fc_conv_padding='SAME',global_pool=False):
          .........
            net = slim.conv2d(net,512,[3,3],scope='conv5_2')
            net = slim.max_pool2d(net,[2,2],scope='pool5')
            net = slim.conv2d(net,4096,[1,1],padding=fc_conv_padding,scope='fc6')
            net = slim.dropout(net,dropout_keep_prob,is_training=is_training,scope='dropout6')
            net = slim.conv2d(net,4096,[1,1],scope='fc7')
            if num_classes:
                net = slim.dropout(net,dropout_keep_prob,is_training=is_training,scope='dropout7')
                net = slim.conv2d(net,num_classes,[1,1],activation_fn=None,normalizer_fn=None,scope='fc8')
    return net

if __name__ == '__main__':
    train_x,train_y,test_x,test_y = prepare_data()

    x = tf.placeholder(tf.float32,[None,image_size,image_size,3])
    y_ = tf.placeholder(tf.float32,[None,class_num])
    dropout_keep_prob = tf.placeholder(tf.float32)
    learning_rate = tf.placeholder(tf.float32)
    is_training = tf.placeholder(tf.bool)

    logits,_ = vgg_a(inputs=x, dropout_keep_prob=dropout_keep_prob)

    network_regularizer = flop_regularizer.GroupLassoFlopsRegularizer([logits.op], threshold=1e-3)
    regularization_strength = 1e-7
    regularizer_loss = (network_regularizer.get_regularization_term() * regularization_strength)
    
    .........
   _,batch_loss = sess.run([train_step,cross_entropy],feed_dict={
                    x:batch_x, y_:batch_y, dropout_keep_prob:dropout_rate, learning_rate:lr,
                    is_training: True})
   batch_acc = accuracy.eval(feed_dict={x:batch_x, y_:batch_y, dropout_keep_prob: 1.0,
                                                     is_training: True})

I would like to ask what is the cause of this error, does not support dropout? “logits,_ = vgg_a(inputs=x)” will not be wrong, but dropout requires different parameters in the train and accuracy eval. Is there any solution?
Thanks!

MorphNet not working on google colab

In my experimentation when I tried to use Morph-net on google colab there were no changes in the learned-structures while when I tried to run the same code on RTX8000 it was able to optimize the network. Would Morphnet not work on google-colab or is there a restriction to the batch size for morphnet to work.

How to display flops and regularizer_loss in tensorboard

Hi, thanks for open souring.
I added these two lines of code，“tf.summary.scalar('RegularizationLoss', regularizer_loss)
tf.summary.scalar(network_regularizer.cost_name, network_regularizer.get_cost())”
but I cannot observe them in tensorboard.
Does it still require some other operations?
Thanks for the answer.

Possibility of adding a working demo code

Hello,
Thank you for such a wonderful work and open-sourcing it. However, if a working demo code could be added, then naive users like me would find it more comfortable using the library. Therefore, could you please add some working demo code of experiments mentioned in the paper using some seed network.

How could I use flop_regularizer on MobilnetV2 model(in tf.slim)?

Hi,thank you very much for this awesome project!

I use morphnet on MobilnetV2 and get errors like:
"RuntimeError: OpRegularizerManager could not handle ops: [u'MobilenetV2/Logits/Dropout/dropout_1/Shape (Shape)', u'MobilenetV2/Logits/AvgPool (AvgPool)']"

How could I solve this error？

Problem with FLOPs

Hi, thanks for sharing open source.
Actually I got a problem with FLOPs when I'm training resnet_cifar_10 with PocketFlow.
I try several combinations of regularization_strength and gamma_threshold.
want to ask if this is correct?

Code for my above implementation is provided below :

  # data input pipeline
  with tf.variable_scope(self.data_scope):
    iterator = self.build_dataset_train() if is_train else self.build_dataset_eval()
    images, labels = iterator.get_next()
    if not isinstance(images, dict):
      tf.add_to_collection('images_final', images)
    else:
      tf.add_to_collection('images_final', images['image'])
  # model definition - distilled model
  if self.enbl_dst:
    logits_dst = self.helper_dst.calc_logits(sess, images)
  

  # model definition - primary model
  with tf.variable_scope(self.model_scope):
    # forward pass
    if is_train and self.forward_w_labels:
      logits = self.forward_train(images, labels)
    else:
      logits = self.forward_train(images) if is_train else self.forward_eval(images)
    if not isinstance(logits, dict):
      tf.add_to_collection('logits_final', logits)
    else:
      for value in logits.values():
        tf.add_to_collection('logits_final', value)
        
    
    # loss & extra evalution metrics
    loss, metrics = self.calc_loss(labels, logits, self.trainable_vars)
    if self.enbl_dst:
      loss += self.helper_dst.calc_loss(logits, logits_dst)
    tf.summary.scalar('loss', loss)
    for key, value in metrics.items():
      tf.summary.scalar(key, value)

    # optimizer & gradients
    if is_train:
      self.global_step = tf.train.get_or_create_global_step()
      lrn_rate, self.nb_iters_train = self.setup_lrn_rate(self.global_step)
      
      network_regularizer = flop_regularizer.GammaFlopsRegularizer(
        output_boundary = [logits.op],
        input_boundary=[images.op, labels.op],
        gamma_threshold=8.8e-1 # 
      )#
      exporter = structure_exporter.StructureExporter(network_regularizer.op_regularizer_manager)
      regularization_strength = 1e-8 #
      regularizer_loss = (network_regularizer.get_regularization_term() * regularization_strength)#
      model_loss = tf.nn.sparse_softmax_cross_entropy_with_logits(logits = logits, labels= tf.argmax(tf.cast(labels, dtype=tf.int32),1))#
      optimizer = tf.train.MomentumOptimizer(lrn_rate, FLAGS.momentum)
      train_op = optimizer.minimize(model_loss + regularizer_loss)#
      tf.summary.scalar('RegularizationLoss', regularizer_loss)#
      tf.summary.scalar(network_regularizer.cost_name, network_regularizer.get_cost()) 
      tf.summary.FileWriter(FLAGS.log_dir, graph=tf.get_default_graph())
      exporter = structure_exporter.StructureExporter(network_regularizer.op_regularizer_manager) 
      
      sess.run(tf.global_variables_initializer())
      for step in range(FLAGS.steps):
        _, structure_exporter_tensors = sess.run([train_op, exporter.tensors])
        if (step % 1000 == 0):
          exporter.populate_tensor_values(structure_exporter_tensors)
          exporter.create_file_and_save_alive_counts(FLAGS.log_dir, step)
           `

How to deal with the nan of regularizer_loss?

Hi, thanks for open soure.
I met a problem: the regularizer_loss is nan
the code is resnet50 in tensorflow
the goal is classification of cifar-100
the loss used is GammaFlopsRegularizer
I tried multiple regularization_strength, but it is still nan.
What should I do to solve this problem? Thanks for your answer.

Questions regarding TF.Keras fine-tuned model and support for TF2.0

Hi there, First of all excellent work. I am curious if there are any examples available on using it for TF.Keras model which uses transfer learning (fine-tuned) with pre-trained model such as resnet or inceptionv3? and what is the current support for tensorflow 2.0. I saw there it still uses sessions in the example code section.

Does MorphNet support object detection model, yolov3 or ssd?

Error when training network using GammaLatencyRegularizer

Hi, I've been trying to train a simple model on MNIST to test the out the GammaLatencyRegularizer. Since I'm testing this on CPU I'm passing FLOP_LATENCY as the hardware argument. However, it seems to give me the following error:

Traceback (most recent call last):
  File "gamma_prune.py", line 43, in <module>
    network_regularizer = latency_regularizer.GammaLatencyRegularizer([out.op], gamma_threshold=1e-3, hardware='FLOP_LATENCY')
  File "/Users/vashishtmadhavan/Documents/Code/morph-net/morph_net/network_regularizers/latency_regularizer.py", line 92, in __init__
    force_group=force_group, regularizer_blacklist=regularizer_blacklist)
  File "/Users/vashishtmadhavan/Documents/Code/morph-net/morph_net/framework/op_regularizer_manager.py", line 132, in __init__
    ['%s (%s)' % (o.name, o.type) for o in self._op_deque])
RuntimeError: OpRegularizerManager could not handle ops: ['base/Flatten/flatten/Reshape (Reshape)']

However, when I train the same model using GammaFlopsRegularizer it seems to work fine. Any comment on this issue?

Also, here is my network def'n

def base_model(x_ph, is_training_ph, scope, channels=[32, 64, 64], reuse=False):
    norm_params = {'is_training': False, 'scale': True, 'center': False}
    # Network Definition
    with tf.variable_scope(scope, reuse=reuse):
        with slim.arg_scope([slim.conv2d, slim.fully_connected],
                      normalizer_fn=slim.batch_norm,
                      normalizer_params=norm_params,
                      weights_initializer=tf.truncated_normal_initializer(0.0, 0.01),
                      weights_regularizer=slim.l2_regularizer(0.0005)):
            conv1 = slim.conv2d(x_ph, num_outputs=channels[0], kernel_size=3, scope='conv1')
            pool1 = slim.max_pool2d(conv1, kernel_size=2, scope='pool1')
            conv2 = slim.conv2d(pool1, num_outputs=channels[1], kernel_size=3, scope='conv2')
            pool2 = slim.max_pool2d(conv2, kernel_size=2, scope='pool2')
            conv3 = slim.conv2d(pool2, num_outputs=channels[2], kernel_size=3, scope='conv3')
            conv3_flat = slim.flatten(conv3)
            out = slim.fully_connected(conv3_flat, num_outputs=10, normalizer_fn=None, normalizer_params=None,
                activation_fn=None, scope='output')
    pred = tf.argmax(out, axis=1)
    return out, pred

Can morph-net output a pruned model file?

I am new to Tensorflow, and I want to konw if there is a way to output a pruned model file for future training? Now morph-net output variable name and channel number, but user must construct pruned model manaully every time after pruning

Please provide an example with end to end implementation?

I find it a bit tricky to plug any network into morphnet.. Providing an example with end to end implementation of any small network(say with 3 hidden layers) for some classification problem with BatchNormalization and plugging it with morphnet would be helpful...

Problems with grouped convolutions

Hi,

I am currently trying to shrink a segmentation model which uses grouped convolutions with the GammaFlopsRegularizer. The grouped convolutions are implemented as follows:

            +-----------+
            | op: split |
            +-----------+
                  |
       +----------------------+
       |                      |
+------------+         +------------+
| op: conv2d |   ...   | op: conv2d | 
+------------+         +------------+
       |                      |
       +----------------------+
                  |
            +------------+
            | op: concat |
            +------------+
                  |
          +---------------+
          | op: batchnorm |
          +---------------+

Using morph-net on a model which contains such layers/ structures causes an endless loop during regularizer generation. The loop seems to be caused by the split operation which expands the group rather than closing it (which makes totally sense usually).

My current workaround is adding the Split operation as a OutputNonPassthroughOpHandler in the _get_base_op_handler_dicts() function of the framework/op_handlers.py module. This seems to stop the endless loop but I am not sure it this produces the correct regularizer groups.

As this "workaround" seems rather hacky and potentially breaks the grouping for other structures, I am curious if you see a better way to solve this problem?

Thank you for your help and your awesome framework!

Questions about batch_normalization

I use resnet_v1 to train the model, should I let batch_normalization's center=false? I don't know if morphnet has an impact on beta, cause I limit the gamma, but if my beta is very large, it seems not work, I think it's a problem.

Here is my bn parameter below.
tf.layers.batch_normalization( inputs=inputs, axis=1 if data_format == 'channels_first' else 3, momentum=_BATCH_NORM_DECAY, epsilon=_BATCH_NORM_EPSILON, **_center=True_,** scale=True, training=training, fused=FLAGS.enbl_fused_batchnorm)

Possible to make available as a PyPI package

Would it be possible to make this package easily accessible through pip.

how should I deal with Depwise batchnormalization

thank you for your excellent work.
Here I decide to realize this algorithm by myself according to this paper in MobileNetV2 ,and met one problems.
In the case of using model_size regularizer , C_cost of every layer should be computed to define the cost function.And we train the model to decide how to shrink the Network.

But I notice that in MobileNetV2 , the input and output channels should be kept same in depthwise conv-layer. Is this means I should ignore this layer when shrink the model?
and in this case when computing the lambda in cost function, I should also ignore this part's model size.

thank you very much.

keras model with multiple losses and outputs +morph_net

Thanks a lot for open sourcing the morph-net code. It is something i really wish to use since i spent a lot of time trying to get my network to train last month for the RL problem i am working on. I have a model in Keras with multiple losses passed in a dict. Now if i wish to add e.g. a flops-regularizer to my model I wrote something like the following:

    state_input = Input(shape=(self.board_size, self.board_size+self.grid_size, 
    self.stack_frame_size,))
    actual_value = Input(shape=(1,))
    predicted_value = Input(shape=(1,))
    #adv = Input(shape=(1,))
    old_prediction_a = Input(shape=(self.output_size,))
    old_prediction_b = Input(shape=(self.output_size,))
    old_prediction_c = Input(shape=(self.output_size,))
    old_prediction_d = Input(shape=(self.output_size,))


    c1 = Conv2D(32, 10, strides=9, activation='relu', name='Conv1')(state_input)#6
    flat = Flatten()(c1)
    d1 = Dense(2048, activation='relu', name='Dense1')(flat)
    logits_a = Dense(self.output_size, activation = 'softmax',name='logits_a')(d1)
    logits_b = Dense(self.output_size, activation = 'softmax', name='logits_b')(d1)
    logits_c = Dense(self.output_size, activation = 'softmax', name='logits_c')(d1)
    logits_d = Dense(self.output_size, activation = 'softmax', name='logits_d')(d1)

    model = Model(inputs=[state_input, actual_value, predicted_value,
                          old_prediction_a, old_prediction_b,
                          old_prediction_d,old_prediction_c],
                          outputs=[logits_a, logits_b, logits_c, logits_d])
    model.load_weights(weights_path)

    
    network_regularizer = flop_regularizer.GammaFlopsRegularizer(
        [logits_a.op, logits_b.op, logits_c.op, logits_d.op], gamma_threshold=1e-3)
    self.actor_exporter = structure_exporter.StructureExporter(network_regularizer.op_regularizer_manager)
    regularization_strength = 1e-10
    regularizer_loss = (network_regularizer.get_regularization_term() * regularization_strength)
    losses = {
        'logits_a': ppoloss(actual_value=actual_value, predicted_value=predicted_value,
                                                       old_prediction=old_prediction_a),
        'logits_b': ppoloss(actual_value=actual_value, predicted_value=predicted_value,
                                                       old_prediction=old_prediction_b),
        'logits_c': ppoloss(actual_value=actual_value, predicted_value=predicted_value,
                                                       old_prediction=old_prediction_c),
        'logits_d': ppoloss(actual_value=actual_value, predicted_value=predicted_value,
                                                       old_prediction=old_prediction_d),
    }
    model.compile(optimizer=Adam(lr=1e-4),
                  loss=[losses])

`
If i add regularizer loss to my dict i get an error that since my outputs number 4 I must have only 4 losses. I cannot add it to the losses since then i have a problem that losses is a dict and regularization_loss is a float.
How may I add the loss in my particular problem

How chose NetworkRegularizer to different CNN model?

There are two types of OpRegularizer, GammaL1Regularizer and GroupLassoRegularizer. The classes GammaFlopsRegularizer and GroupLassoFlopsRegularizer in flop_regularizer.py implements two NetworkRegularizer targeting FLOPs. Now I have a ResNet model, how can I set regularizer? Does GammaFlopsRegularizer alone work well? Or shall I use GammaFlopsRegularizer and GroupLassoFlopsRegularizer together to train the model?

Adapt to object detection problems

Hello. Your work is great.

May I ask for your advice for adapting it to object detection problems?

Doesn't work with Deconvolutions

Hi, Im currently trying to train a simple ConvAE on MNIST, using a GammaFlopsRegularizer on out.op of the following network

def conv_ae(x_ph, is_training, scope, channels=[32, 64, 64], reuse=False):
    norm_params = {'is_training': is_training, 'scale': True, 'center': False}
    # Network Definition
    with tf.variable_scope(scope, reuse=reuse):
        with slim.arg_scope([slim.conv2d, slim.fully_connected],
                      normalizer_fn=slim.batch_norm,
                      normalizer_params=norm_params,
                      weights_initializer=tf.truncated_normal_initializer(0.0, 0.01),
                      weights_regularizer=slim.l2_regularizer(0.0005)):
            conv1 = slim.conv2d(x_ph, num_outputs=channels[0], kernel_size=3, scope='conv1')
            pool1 = slim.max_pool2d(conv1, kernel_size=2, scope='pool1')
            conv2 = slim.conv2d(pool1, num_outputs=channels[1], kernel_size=3, scope='conv2')
            pool2 = slim.max_pool2d(conv2, kernel_size=2, scope='pool2')
            conv3 = slim.conv2d(pool2, num_outputs=channels[2], kernel_size=3, scope='conv3')

            # deconv part
            deconv1 = slim.conv2d_transpose(conv3, num_outputs=channels[2], kernel_size=3, stride=2, scope='conv_transpose1')
            deconv2 = slim.conv2d_transpose(deconv1, num_outputs=channels[1], kernel_size=3, stride=2, scope='conv_transpose2')
            out = slim.conv2d_transpose(deconv2, num_outputs=1, kernel_size=3, stride=1, activation_fn=tf.nn.sigmoid, scope='conv_output')
    return out

However, I'm seeing the OpRegularizerManager hit the iteration limit and found the following error:

Traceback (most recent call last):
  File "deconv_test.py", line 33, in <module>
    network_regularizer = flop_regularizer.GammaFlopsRegularizer([pred.op], gamma_threshold=gamma_scale)
  File "/Users/vashishtmadhavan/Documents/Code/morph-net/morph_net/network_regularizers/flop_regularizer.py", line 66, in __init__
    regularizer_blacklist=regularizer_blacklist)
  File "/Users/vashishtmadhavan/Documents/Code/morph-net/morph_net/framework/op_regularizer_manager.py", line 132, in __init__
    ['%s (%s)' % (o.name, o.type) for o in self._op_deque])
RuntimeError: OpRegularizerManager could not handle ops: ['base/conv_transpose1/conv2d_transpose (Conv2DBackpropInput)', 'base/conv_output/Sigmoid (Sigmoid)', 'base/conv_output/BiasAdd (BiasAdd)', 'base/conv_output/conv2d_transpose (Conv2DBackpropInput)', 'base/conv_transpose2/Relu (Relu)', 'base/conv_transpose2/BiasAdd (BiasAdd)', 'base/conv_transpose2/conv2d_transpose (Conv2DBackpropInput)', 'base/conv_transpose1/Relu (Relu)', 'base/conv_transpose1/BiasAdd (BiasAdd)', 'base/conv_transpose1/stack (Pack)']

Any thoughts on handling deconvs?

WARNING:tensorflow:No supported ops found.

INFO:tensorflow:OpRegularizerManager starting analysis from: [<tf.Operation 'clone_0/semantic' type=Identity>].
INFO:tensorflow:OpRegularizerManager found 8092 ops and 0 sources.
INFO:tensorflow:OpRegularizerManager regularizing 0 groups.
WARNING:tensorflow:No supported ops found.
WARNING:tensorflow:No supported ops found.

Why no supported ops found? Is this correct?

Can I use MorphNet to slim my model trained by Pytorch?

Just like the title，I have a model trained by Pytorch, Can i use this project to slim it..
Eg: Convert Pytorch's model to ONNX, then read by Tensorflow?
Thanks!

In this case, MorphNet is getting stuck because it doesn't know how to handle the Shape op. You can try a couple options:

Try using self.base_model.output rather then the model output. This will skip the dropout ops where the problematic Shape op is. The dense layer is fixed size anyway so it should be fine to not regularize that layer.
You can try implementing an OpHandler for the Shape op or the dropout op. The OpHandler basically tells MorphNet how the input channels and output channels are related. For example, for the add op (e.g. c = a + b) the input channels and output channels directly map. Removing c[0] requires removing a[0] and b[0] to keep sizes consistent. I suspect the issue here is that the dropout op takes 2 inputs: the feature map and a noise shape tensor. In that case, the OpHandler for dropout should only analyze the channels for the feature map and should ignore the noise shape as that has no effect on which channels are regularized together.

Originally posted by @ayp-google in #36 (comment)

I wonder if we can treat the number of ops for the problematic Shape op as 0?

License discrepancy: MIT vs. Apache

The setup.py in the repo says the code is under the MIT license:

"License :: OSI Approved :: MIT License",

while the LICENSE file contains the Apache 2.0 license, which is confusing.

Could you please clarify which license this repo is under and fix one or the other file to ensure consistency?

Thanks!

Question about setting is_training in normalizer params

Generally, we use a placeholder for the is_training flag of normalizer_params for batch norm to toggle between training and test mode. When using morph net regularization though, we can't set is_training to a placeholder since we see the following error:

Traceback (most recent call last):
  File "gamma_prune.py", line 47, in <module>
    network_regularizer = latency_regularizer.GammaLatencyRegularizer([out.op], gamma_threshold=gamma_scale, hardware='K80')
  File "/Users/vashishtmadhavan/Documents/Code/morph-net/morph_net/network_regularizers/latency_regularizer.py", line 90, in __init__
    force_group=force_group, regularizer_blacklist=regularizer_blacklist)
  File "/Users/vashishtmadhavan/Documents/Code/morph-net/morph_net/framework/op_regularizer_manager.py", line 126, in __init__
    self._op_handler_dict[op.type].assign_grouping(op, self)
  File "/Users/vashishtmadhavan/Documents/Code/morph-net/morph_net/framework/grouping_op_handler.py", line 35, in assign_grouping
    input_ops, op_reg_manager)
  File "/Users/vashishtmadhavan/Documents/Code/morph-net/morph_net/framework/op_handler_util.py", line 73, in get_ops_without_groups
    op_slices = op_reg_manager.get_op_slices(op)
  File "/Users/vashishtmadhavan/Documents/Code/morph-net/morph_net/framework/op_regularizer_manager.py", line 437, in get_op_slices
    size = op_handler_util.get_op_size(op)
  File "/Users/vashishtmadhavan/Documents/Code/morph-net/morph_net/framework/op_handler_util.py", line 505, in get_op_size
    shape = op.outputs[0].shape.as_list()
  File "/Users/vashishtmadhavan/anaconda3/lib/python3.6/site-packages/tensorflow/python/framework/tensor_shape.py", line 904, in as_list
    raise ValueError("as_list() is not defined on an unknown TensorShape.")
ValueError: as_list() is not defined on an unknown TensorShape.

What's the appropriate strategy here? Would we just set is_training to False for both training and testing or True?

How to save the weights learned by MorphNet

As the proposed structure had been learned and exported in JSON, meanwhile the weights had been also optimized. Is there any way to save and restore the trained weights along with proposed structure, instead of retrain for scratch? I think it is very helpful and time-saving.

Getting this Error

AttributeError Traceback (most recent call last)
in ()
4 logits = build_model()
5
----> 6 network_regularizer = flop_regularizer.GammaFlopsRegularizer( output_boundary=[logits.op],
7 gamma_threshold=1e-3)
8

AttributeError: 'Sequential' object has no attribute 'op'

Query on Morph Net Structure Exporter

I have a Keras Sequential Model and I train the model using,
model.fit(x_train.values, y_train.values, batch_size=1024, epochs=10, verbose=2, validation_data=(x_valid,y_valid))
I have configured morphnet to this model. In this case, is there a way to save the model structure using structure exporter?

the morphnet can act on some layers of the model？such as act on backbone.

I just use the first three bn layers(conv0-bn0-leakyrelu0, conv1-bn1-leakyrelu1, conv2-bn2-leakyrelu2 ) of network that total 74 layers， leakyrelu2 as output.op，input_data as input.op.
when runing have a error:
tensorflow.python.framework.errors_impl.InvalidArgumentError: Retval[0] does not have value

(1)()->None

the morphnet can act on some layers of the model？

What's the tf version this repo used?

And is there any example released?

Any comparisons with the tf_graph optimization approach?

Does anyone has any results of this approach in comparison with the tensorflow graph optimization approach?

OpRegularizerManager could not handle ops

Hello,

I have tried a few examples from tensorflow/model with morphnet (lenet and resnet), a simple mnist model (https://github.com/mengdong/morph-net/blob/master/morph_net/examples/mnist/mnist-tutorial.py) works. However, I ran into problems in some other more complex models under tensorflow estimator interface.

I wonder is there a recommended way to use morphnet in tf estimator inferface? I know there is quite some overhead in the estimator's graph. Detailed infromation below:

Regarding lenet (https://github.com/mengdong/morph-net/blob/master/morph_net/examples/mnist/mnist.py) from https://github.com/tensorflow/models/tree/master/official/mnist, I observe that:

    I0904 13:14:27.240477 140031449261888 op_regularizer_manager.py:125] 
    OpRegularizerManager found 63 ops and 4 sources.
    ......
    File "/home/dongm/workspace/laptop_mapping/morph-net/morph_net/framework/op_regularizer_manager.py", line 137, in __init__
    ['%s (%s)' % (o.name, o.type) for o in self._op_deque])
    RuntimeError: OpRegularizerManager could not handle ops: ['sequential/conv2d/BiasAdd (BiasAdd)', 'sequential/max_pooling2d_1/MaxPool (MaxPool)', 'sequential/conv2d_1/BiasAdd (BiasAdd)', 'sequential/max_pooling2d/MaxPool (MaxPool)', 'sequential/conv2d/BiasAdd/ReadVariableOp (ReadVariableOp)']

Regarding ResNet (https://github.com/mengdong/morph-net/blob/master/morph_net/examples/resnet/imagenet_main.py), I observe:

    I0904 11:27:34.397989 139699288442688 op_regularizer_manager.py:125] 
    OpRegularizerManager found 629 ops and 53 sources.
    .....
    RuntimeError: OpRegularizerManager could not handle ops: 
    ['resnet_model/batch_normalization_45/FusedBatchNormV3 (FusedBatchNormV3)', 
    'resnet_model/Pad_6 (Pad)', 'resnet_model/batch_normalization_44/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/batch_normalization_49/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/batch_normalization_48/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/batch_normalization_47/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/batch_normalization_52/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/batch_normalization_51/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/Squeeze (Squeeze)', 'resnet_model/final_reduce_mean (Identity)', 'resnet_model/Mean (Mean)', 'resnet_model/batch_normalization_50/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/batch_normalization_43/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/batch_normalization_24/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/batch_normalization_11/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/batch_normalization_1/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/Pad (Pad)', 
    'resnet_model/batch_normalization/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/batch_normalization_4/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/batch_normalization_3/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/max_pooling2d/MaxPool (MaxPool)', 'resnet_model/initial_max_pool (Identity)', 'resnet_model/batch_normalization_2/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/batch_normalization_7/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/batch_normalization_6/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/batch_normalization_5/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/batch_normalization_10/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/batch_normalization_9/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/batch_normalization_8/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/batch_normalization_14/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/batch_normalization_13/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/Pad_2 (Pad)', 'resnet_model/batch_normalization_12/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/batch_normalization_17/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/batch_normalization_16/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/batch_normalization_15/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/batch_normalization_20/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/batch_normalization_19/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/batch_normalization_18/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/batch_normalization_23/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/batch_normalization_22/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/batch_normalization_21/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/batch_normalization_27/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/batch_normalization_26/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/Pad_4 (Pad)', 
    'resnet_model/batch_normalization_25/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/batch_normalization_30/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/batch_normalization_29/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/batch_normalization_28/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/batch_normalization_33/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/batch_normalization_32/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/batch_normalization_31/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/batch_normalization_36/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/batch_normalization_35/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/batch_normalization_34/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/batch_normalization_39/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/batch_normalization_38/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/batch_normalization_37/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/batch_normalization_42/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/batch_normalization_41/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/batch_normalization_40/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/batch_normalization_46/FusedBatchNormV3 (FusedBatchNormV3)', 'resnet_model/batch_normalization_45/ReadVariableOp (ReadVariableOp)', 'resnet_model/batch_normalization_45/ReadVariableOp_1 (ReadVariableOp)']

Does MorphNet support conv3d?

Hi,

I'm applying MorphNet on tf.contrib.layers.conv3d(), but it seems that no ops are found.
INFO:tensorflow:OpRegularizerManager starting analysis from: [<tf.Operation 'fc/Relu' type=Relu>].
INFO:tensorflow:OpRegularizerManager found 46 ops and 0 sources.
INFO:tensorflow:OpRegularizerManager regularizing 0 groups.
WARNING:tensorflow:No supported ops found.

Here is my code:
imgs = tf.placeholder(tf.float32, (4, 8, 24, 24, 3))
net = tf.contrib.layers.conv3d(imgs, num_outputs=64, kernel_size=7, name='conv1')
net = tf.nn.max_pool3d(net, [1, 2, 2, 2, 1], [1, 2, 2, 2, 1], padding='VALID', name='maxpool1')
net = tf.contrib.layers.batch_norm(net, center=True, scale=True, is_training=True, scope='bn1')
logits = tf.contrib.layers.fully_connected(net, 10, scope='fc')
network_regularizer = model_size_regularizer.GammaModelSizeRegularizer([logits.op], gamma_threshold=1e-3)
regularization_strength = ....

Does MorphNet support 3d convolution? Thanks!

iteration_limit hit in OpRegularizerManager

I have a situation in which the ITERATION_LIMIT is even when len(self._all_ops) is only about 3000 in OpRegularizerManager. It seems that certain ops keep cycling back on the self._op_deque. Is there a specific reason this happens? More generally, why are the same ops processed multiple times, even when they have the same type and handler?

In a simple LeNet training example, I go through 34 iterations of assign_grouping when there are far fewer ops to process

Itr: 0 Op: base/conv1/BatchNorm/FusedBatchNorm Type: FusedBatchNorm Handler: batch_norm_source_op_handler
Itr: 1 Op: base/conv1/BatchNorm/gamma/read Type: Identity Handler: grouping_op_handler
Itr: 2 Op: base/conv1/BatchNorm/gamma Type: VariableV2 Handler: grouping_op_handler
Itr: 3 Op: base/conv1/Conv2D Type: Conv2D Handler: output_non_passthrough_op_handler
Itr: 4 Op: Placeholder Type: Placeholder Handler: grouping_op_handler
Itr: 5 Op: base/conv1/Relu Type: Relu Handler: grouping_op_handler
Itr: 6 Op: base/pool1/MaxPool Type: MaxPool Handler: grouping_op_handler
Itr: 7 Op: base/conv2/Conv2D Type: Conv2D Handler: output_non_passthrough_op_handler
Itr: 8 Op: base/conv2/BatchNorm/FusedBatchNorm Type: FusedBatchNorm Handler: batch_norm_source_op_handler
Itr: 9 Op: base/conv2/BatchNorm/gamma/read Type: Identity Handler: grouping_op_handler
Itr: 10 Op: base/conv2/BatchNorm/gamma Type: VariableV2 Handler: grouping_op_handler
Itr: 11 Op: base/conv2/Relu Type: Relu Handler: grouping_op_handler
Itr: 12 Op: base/pool2/MaxPool Type: MaxPool Handler: grouping_op_handler
Itr: 13 Op: base/conv3/Conv2D Type: Conv2D Handler: output_non_passthrough_op_handler
Itr: 14 Op: base/conv3/BatchNorm/FusedBatchNorm Type: FusedBatchNorm Handler: batch_norm_source_op_handler
Itr: 15 Op: base/conv3/BatchNorm/gamma/read Type: Identity Handler: grouping_op_handler
Itr: 16 Op: base/conv3/BatchNorm/gamma Type: VariableV2 Handler: grouping_op_handler
Itr: 17 Op: base/conv3/Relu Type: Relu Handler: grouping_op_handler
Itr: 18 Op: base/conv_output/Conv2D Type: Conv2D Handler: output_non_passthrough_op_handler
Itr: 19 Op: base/conv_output/BiasAdd Type: BiasAdd Handler: grouping_op_handler
Itr: 20 Op: base/conv_output/biases/read Type: Identity Handler: grouping_op_handler
Itr: 21 Op: base/conv_output/biases Type: VariableV2 Handler: grouping_op_handler
Itr: 22 Op: Mean Type: Mean Handler: grouping_op_handler
Itr: 23 Op: base/conv1/BatchNorm/FusedBatchNorm Type: FusedBatchNorm Handler: batch_norm_source_op_handler
Itr: 24 Op: base/conv1/BatchNorm/gamma/read Type: Identity Handler: grouping_op_handler
Itr: 25 Op: base/conv2/BatchNorm/FusedBatchNorm Type: FusedBatchNorm Handler: batch_norm_source_op_handler
Itr: 26 Op: base/conv2/BatchNorm/gamma/read Type: Identity Handler: grouping_op_handler
Itr: 27 Op: base/conv3/BatchNorm/FusedBatchNorm Type: FusedBatchNorm Handler: batch_norm_source_op_handler
Itr: 28 Op: base/conv3/BatchNorm/gamma/read Type: Identity Handler: grouping_op_handler
Itr: 29 Op: base/conv_output/BiasAdd Type: BiasAdd Handler: grouping_op_handler
Itr: 30 Op: base/conv_output/biases/read Type: Identity Handler: grouping_op_handler
Itr: 31 Op: base/conv1/BatchNorm/FusedBatchNorm Type: FusedBatchNorm Handler: batch_norm_source_op_handler
Itr: 32 Op: base/conv2/BatchNorm/FusedBatchNorm Type: FusedBatchNorm Handler: batch_norm_source_op_handler
Itr: 33 Op: base/conv3/BatchNorm/FusedBatchNorm Type: FusedBatchNorm Handler: batch_norm_source_op_handler
Itr: 34 Op: base/conv_output/BiasAdd Type: BiasAdd Handler: grouping_op_handler

ValueError: Total size for op slices do not match

when I use multiple output like this

network_regularizer = flop_regularizer.GammaFlopsRegularizer([box_pred.op, cls_pred.op], gamma_threshold=1e-3)

I got this error

INFO:tensorflow:OpRegularizerManager starting analysis from: [<tf.Operation 'concat_2' type=ConcatV2>, <tf.Operation 'concat_3' type=ConcatV2>].
INFO:tensorflow:OpRegularizerManager found 3414 ops and 61 sources.
Total size for op slices do not match: %s' % op_slice_sizes)
ValueError: Total size for op slices do not match: [[3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3], [3]]

I am confused so much. Thank you for reply!
the shape of outputs are:
<class 'tensorflow.python.framework.ops.Tensor'> (48, 2046, 4)
<class 'tensorflow.python.framework.ops.Tensor'> (48, 2046, 4)
@batzner @eladeban

Getting error like this

Hello , Thanks for good work.

This model build by keras. and i use keras.layers.BatchNormaliztion.
But when i type this code, i get this error

with tf.name_scope('Morph_net'):
    network_regularizer = flop_regularizer.GammaFlopsRegularizer(
    output_boundary=[out.op],
        input_boundary=[left_input.op, right_input.op],
        gamma_threshold= 0.45,
        regularizer_blacklist= None,
        regularizer_decorator=dummy_decorator.DummyDecorator,
        decorator_parameters={'scale': 0.5}
    )
    regularization_strength = 1e-10
    regularizer_loss = (network_regularizer.get_regularization_term() * regularization_strength)

All error massage is like this

---------------------------------------------------------------------------
AssertionError                            Traceback (most recent call last)
<ipython-input-69-27398095666a> in <module>
    161         regularizer_blacklist= None,
    162         regularizer_decorator=dummy_decorator.DummyDecorator,
--> 163         decorator_parameters={'scale': 0.5}
    164     )
    165     regularization_strength = 1e-10

~/anaconda3/lib/python3.7/site-packages/morph_net-0.2.1-py3.7.egg/morph_net/network_regularizers/flop_regularizer.py in __init__(self, output_boundary, gamma_threshold, regularizer_decorator, decorator_parameters, input_boundary, force_group, regularizer_blacklist)
     70         input_boundary=input_boundary,
     71         force_group=force_group,
---> 72         regularizer_blacklist=regularizer_blacklist)
     73     self._calculator = cost_calculator.CostCalculator(
     74         self._manager, resource_function.flop_function)

~/anaconda3/lib/python3.7/site-packages/morph_net-0.2.1-py3.7.egg/morph_net/framework/op_regularizer_manager.py in __init__(self, output_boundary, op_handler_dict, create_grouping_regularizer, force_group, regularizer_blacklist, input_boundary, iteration_limit)
    168           handler = self._op_handler_dict[source_op_slice.op.type]
    169           source_op_slices.append(source_op_slice)
--> 170           regularizers.append(handler.create_regularizer(source_op_slice))
    171 
    172       # Create a group regularizer and assign to all OpSlice in the OpGroup.  If

~/anaconda3/lib/python3.7/site-packages/morph_net-0.2.1-py3.7.egg/morph_net/framework/op_handler_decorator.py in create_regularizer(self, op_slice)
     51       A decorated OpRegularizer for the given OpSlice.
     52     """
---> 53     regularizer = self._op_handler.create_regularizer(op_slice)
     54     if regularizer and self._regularization_decorator:
     55       regularizer = self._regularization_decorator(

~/anaconda3/lib/python3.7/site-packages/morph_net-0.2.1-py3.7.egg/morph_net/framework/batch_norm_source_op_handler.py in create_regularizer(self, op_slice)
     62     if start_index == 0 and size == gamma.shape.as_list()[-1]:
     63       return gamma_l1_regularizer.GammaL1Regularizer(
---> 64           gamma, self._gamma_threshold)
     65     else:
     66       # Note: this conversion is also attempted inside GammaL1Regularizer

~/anaconda3/lib/python3.7/site-packages/morph_net-0.2.1-py3.7.egg/morph_net/op_regularizers/gamma_l1_regularizer.py in __init__(self, gamma, gamma_threshold)
     21         considered 'alive'.
     22     """
---> 23     self._gamma = tpu_util.maybe_convert_to_variable(gamma)
     24     self._gamma_threshold = gamma_threshold
     25     abs_gamma = tf.abs(self._gamma)

~/anaconda3/lib/python3.7/site-packages/morph_net-0.2.1-py3.7.egg/morph_net/framework/tpu_util.py in maybe_convert_to_variable(tensor)
     74       reuse=True,
     75   ):
---> 76     variable_name = get_variable_name(op)
     77     tf.logging.info('Converting tensor %s --> variable %s',
     78                     tensor, variable_name)

~/anaconda3/lib/python3.7/site-packages/morph_net-0.2.1-py3.7.egg/morph_net/framework/tpu_util.py in get_variable_name(read_variable_op)
     37   op = read_variable_op
     38   while op.type != 'VarHandleOp':
---> 39     assert len(op.inputs) == 1
     40     op = op.inputs[0].op
     41   return op.name

AssertionError:

Would you answer me how can i fix this?

Whether it doesn't work for the full connection layer

Thanks for your open souring. I'm confused about whether it doesn't work for the full connection layer?

No Changes In Learned Structures

Am getting all the learned structures same as my initial network , i.e morphnet did not change my network at all. what could be the possible reasons for this.. and how to correct it ?

How to save the JSON file which extract the Architeture Learned by MorphNet?

Hi, thanks for open souring. when I followed the instructions for extracting the Architecture Learned by MorphNet, the code reported an error.

NotFoundError: xxxxxxx/epoch_1/learned_structure/alive_999; No such file or directory

However, when I see the code following:

  def create_file_and_save_alive_counts(self, train_dir: Text,
                                        global_step: tf.Tensor) -> None:
    """Creates a file and saves live counts to it.

    Creates the directory {train_dir}/learned_structure/ and saves the current
    alive counts to {path}/{_ALIVE_FILENAME}_{global_step} and overwrites
    {path}/{_ALIVE_FILENAME}.

    Args:
      train_dir: where to export the alive counts.
      global_step: current value of global step, used as a suffix in filename.
    """
    current_filename = '%s_%s' % (_ALIVE_FILENAME, global_step)
    directory = os.path.join(train_dir, 'learned_structure')
    try:
      tf.gfile.MkDir(directory)
    except tf.errors.OpError:
      # Probably already exists. If not, we'll see the error in the next line.
      pass
    with tf.gfile.Open(os.path.join(directory, current_filename), 'w') as f:
      self.save_alive_counts(f)
    with tf.gfile.Open(os.path.join(directory, _ALIVE_FILENAME), 'w') as f:
      self.save_alive_counts(f)

the code does not try to make file for os.path.join(directory, current_filename) and os.path.join(directory, _ALIVE_FILENAME).

Is the way I use incorrectly?

thanks for your reply

Something wrong happened when I use flop_regularizer on keras model.

Hi,thank you very much for open this project.
Here is my error.This is my model:

from keras.models import Model
from keras.layers import Dense, GlobalAveragePooling2D, Dropout

self.base_model = keras.applications.inception_v3.InceptionV3(include_top=False,weights='inception_v3_weights_tf_dim_ordering_tf_kernels_notop.h5',input_shape=input_shape)
x = self.base_model.output
x = GlobalAveragePooling2D()(x)
x = Dense(self.config.fc_size, activation='relu',)(x)
features = Dense(self.config.fc_size, activation='relu',name='features')(Dropout(self.config.dropout_ratio)(x))
predictions = Dense(1, activation='sigmoid',name='predictions')(features)
self.model = Model(input=self.base_model.input, output=[predictions,features])

network_regularizer = flop_regularizer.GammaFlopsRegularizer(
[self.model.output[0].op], gamma_threshold=1e-3)

When I run this ,it returns an error:

  File "E:\20190425_MorphNet\20190425_morph-net-master\model_local.py", line 59, in add_new_last_layer
    flop_regularizer.GammaFlopsRegularizer([self.model.output[0].op], gamma_threshold=1e-3)
  File "E:\20190425_MorphNet\20190425_morph-net-master\morph_net\network_regularizers\flop_regularizer.py", line 95, in __init__
    regularizer_blacklist=regularizer_blacklist)
  File "E:\20190425_MorphNet\20190425_morph-net-master\morph_net\framework\op_regularizer_manager.py", line 132, in __init__
    ['%s (%s)' % (o.name, o.type) for o in self._op_deque])
RuntimeError: OpRegularizerManager could not handle ops: ['dropout_1/cond/dropout/Shape (Shape)', 'dropout_2/cond/dropout/Shape (Shape)']

How could I solve this error？
And I have another question, can I use MorphNet on Keras model?

Can I export model optimized by morph-net to tflite？

Errors with Distributed Training

Hi, I am trying to train a network with the GammaFlopsRegularizer by distributing it over 4 gpus on my machine. Im using the horovod library to train a simple convolutional autencoder following the script below:

import numpy as np
import os
import tensorflow as tf
import horovod.tensorflow as hvd

from morph_net.network_regularizers import flop_regularizer
from morph_net.network_regularizers import latency_regularizer
from morph_net.tools import structure_exporter

# Define the Network
def conv_ae(x_ph, is_training, scope, channels=[32, 64, 64], reuse=False):
    norm_params = {'is_training': is_training, 'scale': True, 'center': False}
    # Network Definition
    with tf.variable_scope(scope, reuse=reuse):
        with slim.arg_scope([slim.conv2d, slim.conv2d_transpose],
                      normalizer_fn=slim.batch_norm,
                      normalizer_params=norm_params,
                      weights_initializer=tf.truncated_normal_initializer(0.0, 0.01),
                      weights_regularizer=slim.l2_regularizer(0.0005)):
            conv1 = slim.conv2d(x_ph, num_outputs=channels[0], kernel_size=3, scope='conv1')
            pool1 = slim.max_pool2d(conv1, kernel_size=2, scope='pool1')
            conv2 = slim.conv2d(pool1, num_outputs=channels[1], kernel_size=3, scope='conv2')
            pool2 = slim.max_pool2d(conv2, kernel_size=2, scope='pool2')
            conv3 = slim.conv2d(pool2, num_outputs=channels[2], kernel_size=3, scope='conv3')

            # deconv part
            deconv1 = slim.conv2d_transpose(conv3, num_outputs=channels[2], kernel_size=3, stride=2, scope='conv_transpose1')
            deconv2 = slim.conv2d_transpose(deconv1, num_outputs=channels[1], kernel_size=3, stride=2, scope='conv_transpose2')
            out = slim.conv2d_transpose(deconv2, num_outputs=1, kernel_size=3, stride=1, normalizer_fn=None, normalizer_params=None, activation_fn=tf.nn.sigmoid, scope='conv_output')
    return out


# Arguments
total_steps = 10000
batch_size = 128
val_batch_size = 1000
learning_rate = 1e-3
outdir = 'ae_morph_saves/'
gamma_scale = 1e-3
reg_strength = 1e-8
channels = [64, 64, 64]

# Initialize horovod
hvd.init()

# Load MNIST Data
train_data, test_data = tf.keras.datasets.mnist.load_data()
X_train, X_test = train_data[0], test_data[0]
global_step = tf.train.get_or_create_global_step()

# Optimization
N, H, W = X_train.shape
X_train_ph = tf.placeholder(tf.float32, [None, H, W, 1])
pred = conv_ae(X_train_ph, True, scope='base', channels=channels, reuse=False)
loss = tf.reduce_mean(tf.reduce_sum(tf.squared_difference(X_train_ph, pred), axis=[1,2,3]), axis=0)
model_vars = tf.get_collection(tf.GraphKeys.TRAINABLE_VARIABLES, scope='base')

# applying L1 loss on batch norm gamma
network_regularizer = flop_regularizer.GammaFlopsRegularizer(
		output_boundary=[pred.op],
		input_boundary=[X_train_ph.op],
		gamma_threshold=gamma_scale)
reg_loss = network_regularizer.get_regularization_term() * reg_strength
cost_op = network_regularizer.get_cost()

# setting up distributed optimizer
optimizer = tf.train.AdamOptimizer(learning_rate=learning_rate * hvd.size())
optimizer = hvd.DistributedOptimizer(optimizer)

gradients = optimizer.compute_gradients(loss + reg_loss, var_list=model_vars)
train_op = optimizer.apply_gradients(gradients, global_step=global_step)
exporter = structure_exporter.StructureExporter(network_regularizer.op_regularizer_manager)

hooks = [
	hvd.BroadcastGlobalVariablesHook(0),
	tf.train.StopAtStepHook(last_step=total_steps // hvd.size()),
	tf.train.LoggingTensorHook(tensors={'step': global_step, 'loss': loss}, every_n_iter=10),
]

# pin GPU to be used to process local rank (one GPU per process)
config = tf.ConfigProto()
config.gpu_options.allow_growth = True
config.gpu_options.visible_device_list = str(hvd.local_rank())
outdir = outdir if hvd.rank() == 0 else None

# Training Loop
with tf.train.MonitoredTrainingSession(checkpoint_dir=outdir,
										hooks=hooks, 
										config=config) as mon_sess:
	while not mon_sess.should_stop():
		idx = np.random.choice(X_train.shape[0], batch_size, replace=False)
		x_t = np.expand_dims(X_train[idx], axis=-1)
		feed_dict = {X_train_ph: x_t}
		g_step_val = mon_sess.run(global_step, feed_dict=feed_dict)
		_, grad_vals, structure_exporter_tensors, t_loss, reg_cost = mon_sess.run(
			[train_op, gradients, exporter.tensors, loss, cost_op],
			feed_dict=feed_dict)

		print("Step: ", g_step_val)
		print("Training Loss: ", t_loss)
		print("Reg Cost: ", reg_cost)

		# exporting model to JSON
		if hvd.rank() == 0:
			exporter.populate_tensor_values(structure_exporter_tensors)
			exporter.create_file_and_save_alive_counts(outdir, g_step_val)

When trying to train the model using mpirun -np 4 -localhost localhost:4 -bind-to none -map-by slot -genvlist LD_LIBRARY_PATH,PATH python <script-above> it starts to hang, suggesting that the gradients are out of sync. The error is shown below:

Fri May 31 19:02:24 2019[1,0]<stderr>:[2019-05-31 19:02:24.396204: W horovod/common/operations.cc:588] One or more tensors were submitted to be reduced, gathered or broadcasted by subset of ranks and are waiting for remainder of ranks for more than 60 seconds. This may indicate that different ranks are trying to submit different tensors or that only subset of ranks is submitting tensors, which will cause deadlock. 
Fri May 31 19:02:24 2019[1,0]<stderr>:Stalled ops:DistributedAdamOptimizer_Allreduce/HorovodAllreduce_gradients_AddN_3_0 [missing ranks: 0, 1, 3]
Fri May 31 19:02:24 2019[1,0]<stderr>:[2019-05-31 19:02:24.396257: W horovod/common/operations.cc:588] DistributedAdamOptimizer_Allreduce/HorovodAllreduce_gradients_AddN_8_0 [missing ranks: 0, 1, 3]

If I remove the GammaFlopsRegularizer portion, the error disappears and the model trains fine. My guess is that the regularizer is doing some non-deterministic pruning of connections. There seem to be some nodes where the corresponding gradient has been removed from the graph. I may be wrong, but is there a way to enforce this pruning to be deterministic across the distributed workers.

If not, is there another fix? I think this would be a good thing to look into as most large models that can be pruned rely on some form of distributed training.

as_list() is not defined on an unknown TensorShape？

Code：
`import tensorflow as tf
from morph_net.network_regularizers import flop_regularizer
from morph_net.network_regularizers import latency_regularizer
from morph_net.tools import structure_exporter
import numpy as np
from datetime import datetime
import time

def conv2d(input, w, b):
return tf.nn.relu(tf.nn.bias_add(tf.nn.conv2d(input, w, strides=[1, 1, 1, 1], padding='SAME'),
b))
def max_pool(input):
return tf.nn.max_pool(input, ksize=[1, 2, 2, 1], strides=[1, 2, 2, 1],
padding='SAME')
def fc(input, w, b):
return tf.nn.relu(tf.add(tf.matmul(input, w), b))

def alex_net(_input_r, _weights, _biases, _keep_prob):
with tf.name_scope('conv1'):
_conv1 = conv2d(_input_r, _weights['wc1'], _biases['bc1'])

with tf.name_scope('pool1'):
    _pool1 = max_pool(_conv1)

with tf.name_scope('conv2'):
    _conv2 = conv2d(_pool1, _weights['wc2'], _biases['bc2'])

with tf.name_scope('pool2'):
    _pool2 = max_pool(_conv2)

with tf.name_scope('conv3'):
    _conv3 = conv2d(_pool2, _weights['wc3'], _biases['bc3'])

with tf.name_scope('conv4'):
    _conv4 = conv2d(_conv3, _weights['wc4'], _biases['bc4'])

with tf.name_scope('conv5'):
    _conv5 = conv2d(_conv4, _weights['wc5'], _biases['bc5'])

with tf.name_scope('pool3'):
    _pool3 = max_pool(_conv5)

_densel = tf.reshape(_pool3,
                     [-1, _weights['wd1'].get_shape().as_list()[0]])
with tf.name_scope('fc1'):
    _fc1 = fc(_densel, _weights['wd1'], _biases['bd1'])
    _fc1_drop = tf.nn.dropout(_fc1, _keep_prob)

with tf.name_scope('fc2'):
    _fc2 = fc(_fc1_drop, _weights['wd2'], _biases['bd2'])
    _fc2_drop = tf.nn.dropout(_fc2, _keep_prob)

with tf.name_scope('out'):
    _out = tf.add(tf.matmul(_fc2_drop, _weights['wd3']), _biases['bd3'])

return _out

n_output = 10
learning_rate = 0.001
dropout = 0.75
train_data, test_data = tf.keras.datasets.mnist.load_data()
X_train, X_test = train_data[0], test_data[0]
Y_train, Y_test = train_data[1], test_data[1]
train_dir = "./Morph_save/"
print("CNN READY")

x = tf.placeholder(tf.float32, [None, 28, 28, 1])
y = tf.placeholder(tf.float32, [None, 10])
keep_prob = tf.placeholder(tf.float32)

weights = {

'wc1': tf.Variable(tf.truncated_normal([3, 3, 1, 64], dtype=tf.float32, stddev=0.1), name='weights1'),
'wc2': tf.Variable(tf.truncated_normal([3, 3, 64, 128], dtype=tf.float32, stddev=0.1), name='weights2'),
'wc3': tf.Variable(tf.truncated_normal([3, 3, 128, 256], dtype=tf.float32, stddev=0.1), name='weights3'),
'wc4': tf.Variable(tf.truncated_normal([3, 3, 256, 256], dtype=tf.float32, stddev=0.1), name='weights4'),
'wc5': tf.Variable(tf.truncated_normal([3, 3, 256, 128], dtype=tf.float32, stddev=0.1), name='weights5'),
'wd1': tf.Variable(tf.truncated_normal([4*4*128, 1024], dtype=tf.float32, stddev=0.1), name='weights_fc1'),
'wd2': tf.Variable(tf.random_normal([1024, 1024], dtype=tf.float32, stddev=0.1), name='weights_fc2'),
'wd3': tf.Variable(tf.random_normal([1024, n_output], dtype=tf.float32, stddev=0.1), name='weights_output')

}
biases = {
'bc1': tf.Variable(tf.constant(0.0, shape=[64], dtype=tf.float32), trainable=True, name='biases1'),
'bc2': tf.Variable(tf.constant(0.0, shape=[128], dtype=tf.float32), trainable=True, name='biases2'),
'bc3': tf.Variable(tf.constant(0.0, shape=[256], dtype=tf.float32), trainable=True, name='biases3'),
'bc4': tf.Variable(tf.constant(0.0, shape=[256], dtype=tf.float32), trainable=True, name='biases4'),
'bc5': tf.Variable(tf.constant(0.0, shape=[128], dtype=tf.float32), trainable=True, name='biases5'),
'bd1': tf.Variable(tf.constant(0.0, shape=[1024], dtype=tf.float32), trainable=True, name='biases_fc1'),
'bd2': tf.Variable(tf.constant(0.0, shape=[1024], dtype=tf.float32), trainable=True, name='biases_fc2'),
'bd3': tf.Variable(tf.constant(0.0, shape=[n_output], dtype=tf.float32), trainable=True, name='biases_output')
}
print("CNN READY")
pred = alex_net(x, weights, biases, keep_prob)
network_regularizer = flop_regularizer.GroupLassoFlopsRegularizer(output_boundary=[pred.op],input_boundary=[x.op, y.op],threshold=1e-3)
regularization_strength = 1e-8
regularizer_loss = (network_regularizer.get_regularization_term() * regularization_strength)
model_loss = tf.reduce_mean(tf.nn.sparse_softmax_cross_entropy_with_logits(y, pred))
cost_op = network_regularizer.get_cost()
optimizer = tf.train.MomentumOptimizer(learning_rate=0.01, momentum=0.9)
train_op = optimizer.minimize(model_loss + regularizer_loss)
exporter = structure_exporter.StructureExporter(network_regularizer.op_regularizer_manager)
init = tf.global_variables_initializer()
print("FUNCTIONS READY")

training_epochs = 1000
batch_size = 500
display_step = 10

sess = tf.Session()
sess.run(init)
for epoch in range(training_epochs):
avg_cost = 0.
total_batch = int(60000/batch_size)
start_time = time.time()
for i in range(total_batch):
batch_xs, batch_ys = X_train[ibatch_size:ibatch_size+batch_size], Y_train[ibatch_size:ibatch_size+batch_size]

    _, get_cost, loss_val, regloss_val,  structure_exporter_tensors=\
        sess.run([train_op,cost_op,model_loss,regularizer_loss,exporter.tensors], feed_dict={x: batch_xs, y: batch_ys, keep_prob:dropout})

if epoch % display_step == 0:
    train_accuracy = sess.run(model_loss, feed_dict={x: batch_xs, y: batch_ys, keep_prob: 1.0})
    test_accuracy = sess.run(model_loss, feed_dict={x: X_train, y: Y_test, keep_prob:1.0})
    exporter.populate_tensor_values(structure_exporter_tensors)
    exporter.create_file_and_save_alive_counts(train_dir, epoch)
    print("Epoch: %03d/%03d cost: %.9f TRAIN ACCURACY: %.3f TEST ACCURACY: %.3f" % (epoch, training_epochs, avg_cost, train_accuracy, test_accuracy))
duration = time.time() - start_time
print('%s: step %d, duration = %.3f' % (datetime.now(), epoch, duration))

print("DONE")

Bad structure

HI, I have pruned the resnet18 with flops regularizer, but the final structure of pruned resnet18 model is strange. The first conv of 7×7 kernel and 64 filter is pruned to zero channels, so that the network can not run. Do you have any idea to deal with this situation?

flops regularizer with multiple output logits

Hi
Thanks a lot for making morphnet open source. I wish to try it out but it seems I am having some trouble with getting it to work.
I have multiple outputs from my network. I am using the GroupLassoFlopsRegularizer and it seems to give me an error when i pass it a list of logits which come out of my network.
My network

`
def build_model(self):
self.g = tf.Graph()
with self.g.as_default():
tf.set_random_seed(42)
self.inputs = tf.placeholder(tf.float32, [None, *(self.state_size)], name="inputs_")
print(self.inputs)
self.out_a = tf.placeholder(tf.int32, [None, ], name="out_a")
self.out_b = tf.placeholder(tf.int32, [None, ], name="out_b")
self.out_c = tf.placeholder(tf.int32, [None, ], name="out_c")
self.out_d = tf.placeholder(tf.int32, [None, ], name="out_d")
self.discounted_episode_rewards_norm = tf.placeholder(tf.float32, [None, ],
name="discounted_episode_rewards_")
self.avg_reward_ = tf.placeholder(tf.float32, name="avg_reward")
print(self.discounted_episode_rewards_norm)
# Add this placeholder for having this variable in tensorboard
self.mean_reward_ = tf.placeholder(tf.float32, name="mean_reward")
#model=k.Sequential()
#inp = Input(shape=self.state_size, name='stack_of_4_images')(self.inputs)
c1 = Conv2D(32,8,strides=2,activation='relu',name='Conv1')(self.inputs) #kernel_initializer=glorot_uniform(seed=42),
c2 = Conv2D(64, (6,8), strides=2, activation='relu', name='Conv2')(c1) # kernel_initializer=glorot_uniform(seed=42),
c3 = Conv2D(128, (6,8), strides=2, activation='relu', name='Conv3')(c2) # kernel_initializer=glorot_uniform(seed=42),
flat = Flatten()(c3)
d1 = Dense(1024, activation='selu', name='Dense1')(flat)
d2 = Dense(1024, activation='selu', name='Dense2')(d1)
logits_a = Dense(self.output_size, name='logits_a')(d2)
logits_b = Dense(self.output_size, name='logits_b')(d2)
logits_c = Dense(self.output_size, name='logits_c')(d2)
logits_d = Dense(self.output_size, name='logits_d')(d2)

        self.outputs_softmax_a= Softmax()(logits_a)
        self.outputs_softmax_b = Softmax()(logits_b)
        self.outputs_softmax_c = Softmax()(logits_c)
        self.outputs_softmax_d = Softmax()(logits_d)
        self.network_regularizer = flop_regularizer.GroupLassoFlopsRegularizer(
            [logits_a.op, logits_b.op, logits_c.op, logits_d.op], threshold=1e-3)
        self.exporter = structure_exporter.StructureExporter(
            self.network_regularizer.op_regularizer_manager)
        self.regularization_strength = 1e-10

        with tf.name_scope("loss"):
            neg_log_prob_a = tf.nn.sparse_softmax_cross_entropy_with_logits(logits=logits_a, labels=self.out_a)
            neg_log_prob_b = tf.nn.sparse_softmax_cross_entropy_with_logits(logits=logits_b, labels=self.out_b)
            neg_log_prob_c = tf.nn.sparse_softmax_cross_entropy_with_logits(logits=logits_c, labels=self.out_c)
            neg_log_prob_d = tf.nn.sparse_softmax_cross_entropy_with_logits(logits=logits_d, labels=self.out_d)

            loss_a = tf.reduce_mean(neg_log_prob_a * self.discounted_episode_rewards_norm)  # reward guided loss
            loss_b = tf.reduce_mean(neg_log_prob_b * self.discounted_episode_rewards_norm)  # reward guided loss
            loss_c = tf.reduce_mean(neg_log_prob_c * self.discounted_episode_rewards_norm)  # reward guided loss
            loss_d = tf.reduce_mean(neg_log_prob_d * self.discounted_episode_rewards_norm)  # reward guided loss
            self.regularizer_loss = (self.network_regularizer.get_regularization_term() *
                                     self.regularization_strength)

            self.loss = loss_a + loss_b + loss_c + loss_d + self.regularizer_loss
        with tf.name_scope("train"):
            #update_ops = tf.get_collection(tf.GraphKeys.UPDATE_OPS)
            train_ops = tf.train.AdamOptimizer(self.lr).minimize(self.loss)
            self.train_op = tf.group([train_ops])#, update_ops])

        self.setup_writer()  # has to be before global variables initializer inside the graph for write_op to be
        # populated
        self.init = tf.global_variables_initializer()`

The error i get when i run my script that instantiates the model

File "/home/ubuntu/morph_basic/basic_bline.py", line 78, in build_model [logits_a.op, logits_b.op, logits_c.op, logits_d.op], threshold=1e-3) File "/home/ubuntu/anaconda3/envs/tensorflow_p36/lib/python3.6/site-packages/morph_net/network_regularizers/flop_regularizer.py", line 153, in __init__ regularizer_blacklist=regularizer_blacklist) File "/home/ubuntu/anaconda3/envs/tensorflow_p36/lib/python3.6/site-packages/morph_net/framework/op_regularizer_manager.py", line 126, in __init__ self._op_handler_dict[op.type].assign_grouping(op, self) File "/home/ubuntu/anaconda3/envs/tensorflow_p36/lib/python3.6/site-packages/morph_net/framework/grouping_op_handler.py", line 40, in assign_grouping output_ops, op_reg_manager) File "/home/ubuntu/anaconda3/envs/tensorflow_p36/lib/python3.6/site-packages/morph_net/framework/op_handler_util.py", line 73, in get_ops_without_groups op_slices = op_reg_manager.get_op_slices(op) File "/home/ubuntu/anaconda3/envs/tensorflow_p36/lib/python3.6/site-packages/morph_net/framework/op_regularizer_manager.py", line 438, in get_op_slices if size > 0: TypeError: '>' not supported between instances of 'NoneType' and 'int'

setup.py does not create a morph-net module

Hi,

I am trying to start working with Morph-Net and can't seem to install the required Python modules. Running the setup.py script generates the expected .egg file and it is copied to the appropriate location. However I can't seem to get the various modules recognised. Not sure if I am making a rookie mistake or missing something here.

Does morph-net have an operation to build a new model with it's FLOP/Latency optimized outputs?

Just wanted to confirm. Once we have trained our initial model we have a json file containing the optimized output sizes of the different layers. When I want to retrain a new model with these newly optimized output sizes I have to create a new model from scratch with these newly optimized output sizes, correct? I've been looking through the documentation to see if there is some kind of function that will do it for us, but haven't been able to find anything.

Applying Morph-net while training training Yolov3 using darknet

can anyone please tell me how to use the morph-net to find the optimized number of filter size for Tiny-yolo-v3 during training using darknet (https://github.com/AlexeyAB/darknet).

I want to optimize the model for latency (that is my regularizer target cost)

Any documentation or information on how to apply morph-net during darknet training will be highly helpful

Application to model including Keras upsampling layers (missing op handler)

I tried morphnet on my Keras-TF implementation of a skip network. Each deep module includes a Keras UpSampling2D layer, which upsamples a tensor from (batch_size, a, b, channels) to (batch_size, 2a, 2b, channels). That's an issue because the tf ops ResizeNearestNeighbor and ResizeBilinear, which are used by that layer, are not supported by morphnet at the moment.

Before trying to implement the op handlers for these I quickly wanted to ask, whether the optimization routine might have inherent issues with upsamplings in general that might not be obvious at first.

Secondly, after reading #36 it is my understanding I could optimize a subnetwork between two upsampling layers by setting the input and output boundaries to these two layers respectively. Is that correct?

how to do the expansion?

hi, I have a question about the expansion.
I trained the regularizer_loss and got the following network structure.
How should I use a uniform width multiplier to do the expansion?

Do I need to manually adjust the number of channels, if I use the multiplier of 2, does that mean I need to set the number of the layer('g_net/enc1_2/conv1/Conv2D: 12')'s channels to 24.

Also, I would like to ask if morph-net can be used for pixel to pixel tasks, such as super resolution and de-blur. Can keep the same accuracy after shrinking the network?

google-research / morph-net Goto Github PK

morph-net's Introduction

MorphNet: Fast & Simple Resource-Constrained Learning of Deep Network Structure

New: FiGS: Fine-Grained Stochastic Architecture Search

What is MorphNet?

Usage

Regularizer Types

Regularizer Algorithms

Regularizer Target Costs

Examples

Adding a FLOPs Regularizer

Extracting the Architecture Learned by MorphNet

Misc

Maintainers

Contributors

morph-net's People

Contributors

Stargazers

Watchers

Forkers

morph-net's Issues

Recommend Projects

Recommend Topics

Recommend Org