Comments (3)
I didn't expect such a behavior because caches are created for each AbstractNode
whose complied codes does not include any tensors. (As a design), users do not have to consider the existence of the function as it is not exported.
I suspect that this is caused by some caching in the compiler and different initializations ...
I thought about that one too; and I guess the current implementation of adam has something to do with since if i change these lines:
;; Case1. Adam -> SGD
;; From
(mapc (hooker x (Adam x :lr lr)) (model-parameters model))
;; To
(mapc (hooker x (SGD x :lr lr)) (model-parameters model))
;; Case2. Deleting Adam
;; deleting this line (and fails to optimize the model)
(mapc #'call-optimizer! (model-parameters model))
In the both two cases, the loss is in the same range for each epoch, and each run.
I'm still analyzing the issue. Thank you for the bug report.
from cl-waffe2.
The issue should be fixed at the latest PR #149; so closing.
I have been had a lot on my plates this month and i could not tackle this issue quickly; sorry for my delayed answer.
from cl-waffe2.
Yes, it works, thanks!
from cl-waffe2.
Related Issues (20)
- [Enhancement] macros to add HOT 2
- [Fix] a ton of undefined-type HOT 1
- [BugFix] Proceed -> Proceed HOT 1
- [BugFix] Batched Matmul won't work HOT 1
- [Enhancement] package naming could be confusing
- [FixME] features on cl-waffe2 is perfectly working on REPL, but the same operation in the test isn't working... HOT 1
- [API Enhancement] Project Simplification for those who is new to Common Lisp HOT 1
- [BugFix] Lambda wrapped backward definition can't return more than one gradient. (i.e.: define-static-node) HOT 1
- My TODO List HOT 1
- [Refactoring] I'm considering rewriting the entire code of VM HOT 5
- Various discussions arising from examples and tutorials HOT 21
- [Fix] Some specifications on Subscript DSL should be changed in the future release HOT 1
- [TODO] AVX512/SSE Extension, CUDA Backend, and more, other backends...
- A survey of improving performance
- [Refactor] APIs for Network Construction HOT 2
- Wrapping/Adapting another frameworks?
- [FixME] Dynamically Shaped Conv2D isn't working due to !reshape HOT 1
- [WIP] Petalisp as a high-level IR?
- [WIP] Remove obsolete SIMD and BLAS dependencies (for cpu) HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from cl-waffe2.