Giter Club home page Giter Club logo

experiments's Introduction

Here is the experiment code! Here is what you need to do to get this running.

  1. Rewrite train_model_complex in the run_experiments.py file. If you need new configurable inputs, let me know. Make sure that the outputs are of the same format.
  • You should be able to copy all of the argument pre-processing code (as described in the comments).
  • Be sure to deepcopy the quantizer every time you pass it into a layer. These quantizers have to know the shape of the input (this is done in the build step of QConv2DClean)
  • Do not quantize the activations.
  1. In my model, I quantize all weights except for the final dense layer, i.e. the last two elements of the weights list. If this is NOT the case for you, please let me know and I can make this configurable.
  • You'll notice that this is the only place that I use the straight_initializer. This just ensures that the weights are initialized without any remapping. Be sure to do the same for any non-quantized layers.
  1. The parameters for the run are located in the config. We may need to meet to discuss what all the config options mean. For any new run, you should create a new <directory>, put a config in that directory, and run python run_experiments.py <directory>. Results for each model are cached, as well as all images. The final metrics are in results.json.
  2. At the moment there is no "delayed start" like the one we discussed before. Instead we're using a warmup schedule, which gives good results.
  3. I've run into some CPU memory issues with this code, primarily due to the change point calculations. I only got this to work with I used a t3.2xlarge instance on AWS (see here).

experiments's People

Contributors

mschoenb97 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.