Giter Club home page Giter Club logo

slash's People

Contributors

shady-cs15 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Forkers

isohamnemesis

slash's Issues

Choice of Optimizer

  • Adam (Better accuracy but memory state is always negative i.e. global context becomes 0)
  • SGD (Less accuracy but memory state is ~ 50 % +ve, i.e. 50 % active after ReLU)

Preprocessing script

  • Save raw wave and masks in uint8 in stead of float32.
  • No need to save labels
  • read directly from data/abc.wav and create tmp/abc_x.npy and tmp/abc_m.npy

batch_size > 1

one batch corresponds to one clip.
select batch_size number of anchor pointers in the clip.
use masking as required.

check sample rnn generation

  • train sample rnn original code (theano) and test generative capabilities of the same.
  • train custom arch on same dataset for same iterations and test generative capabilities.
  • Let model overfit if need be

Formattting

format print statements

  • floats up to 4 decimal places
  • print global step number in training
  • remove song #
  • print epoch # once for every song not for every iter

Add clip iterations

clip_iter=20
n_epochs = 10
If accuracy doesn't improve, use constant divisor for normalizing in stead of tf.reduce_sum.

Check predictions during training phase

dev-script.py

line 29

is_training=False

line 52

bptt_batch_loss, acc, np_state, op, out = 
sess.run([t_model.loss, t_model.mean_acc, t_model.final_state, optimizer, t_model.outputs],
feed_dict={input:bptt_batch_x, label:bptt_batch_y, t_model.initial_state[0]:np_state[0], t_model.initial_state[1]:np_state[1]})
				

need to dump state

state includes

  • best validation loss
  • iter before saving last params
  • epoch (ep) before saving last params
  • clip # (i) before saving last params

convert and process data in gpu

  • Use one hot encoding in the computation graph
  • Feed only integers quantized to range(0-15)
  • For masks use vector of shape [batch_size, n_steps] in stead of [batch_size, n_steps, 16]. Reason follows.
>>> x = tf.placeholder('float32',[2, 3])
>>> y = tf.placeholder('float32',[2, 1])
>>> z = x*y
>>> with tf.Session as s:
...     X = [[1., 1., 1.], [1., 1., 1.]]
...     Y = [[1.], [0.]]
...     Z = s.run([z], feed_dict={x:X, y:Y})
...     print Z

Weighted downsampling

  • In stead of looping, use linear downsampling (from lstm_dim to global_context_size); or repeat the tensor along one axis
  • for each local context, segment from downsample and add in stead of concatenating, if need be. Logically though, concatenation preserves the local context and global context separately and hence is more rich in information.
  • build computation graph faster

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.