Giter Club home page Giter Club logo

residual.mnist's People

Stargazers

Julia Gusak avatar Qiang Chen avatar lu jiawen avatar Jose Antonio avatar Ricard Racinskij avatar Pratheeksha avatar  avatar Kelvin avatar Yili Zhao avatar Li Xinqi avatar Wei avatar Yu Zhang avatar sdf0615 avatar Luca Antiga avatar  avatar Kaiyu Yang avatar Kimmy avatar Xinwei He avatar  avatar cheaster avatar  avatar Jonghoon Jin avatar  avatar  avatar Jun Deng avatar Idris Yusupov avatar

Watchers

Kimmy avatar James Cloos avatar  avatar Mingkun Yang avatar  avatar  avatar

residual.mnist's Issues

Curious about ReLU layers inside `resUnit`

Hello!

I saw your MNIST writeup here. Looks really cool!

I'm also trying to implement residual networks in torch with mixed success. It would be great to compare notes.

I'm curious: the call to convunit() always adds a ReLU layer to the end of each unit it constructs. This means that each resUnit is calculating ReLU( ReLU(f(x)) + x ). If I'm interpreting the architecture correctly, it looks like this:

Example architecture

Doing this means that the first term of the sum can never be negative. What that means is that x will never decrease as it propagates through the network. I think this differs from the original paper: if I read correctly, they don't add that second ReLU until after the building block.

Would this affect performance at all?

(I'm not sure how much of an issue this is. My experiments on CIFAR are pretty inconclusive...)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.