Comments (4)
For 1: for mini-imagenet, HPs were tuned to maximize validation performance. For omniglot, they we're tuned to maximize training performance, which tended to correlate well with test performance. The CMA hyperparameter optimization code is not included in this repo since it is rather specific to our infrastructure.
For 2: the run scripts in this repo are mainly intended to be used to reproduce our results. If you want to further optimize hyper-parameters, you will want to make sure to only look at the outputs for the validation/training set.
from supervised-reptile.
Did you run the transductive or non-transductive version? The accuracy for transduction is slightly better. Also, note the error margins on these experiments.
from supervised-reptile.
@unixpickle I recap the paper. My mistanke. I thought the accuarcy should be above 90% at least. I am a newbie to meta learning area. I ran the non-transductive version, I will switch to omniglot to try again.Thanks for your quick response!
from supervised-reptile.
@unixpickle Two more question.
- How did you determine the optimal value of meta_batch_size without any validation process within the outer loop?
- Why did you include the test set in your training process rather than validation set? It seems a bit odd.
from supervised-reptile.
Related Issues (20)
- About batchnorm HOT 3
- About the role of training set in the process of prediction HOT 1
- 1-shot 5-way Mini-ImageNet setting HOT 1
- What are 5-shot 5-way Reptile + Transduction hyperparameters? HOT 1
- Cannot reproduce the results for 1-shot 5-way Mini-ImageNet HOT 10
- Seems that reptile produce similar gridients as vanilla SGD
- some problems about the dataset
- Model Issue
- demo code for reinforcement learning?
- Reptile for numeric data HOT 1
- When using the pre-trained model for retraining, the accuracy declines. What is the reason and is it normal? HOT 1
- Training hyperparameters HOT 4
- Question regarding the evaluation
- moving average in AdamOptimizer when conducting evaluation HOT 3
- question about dataset HOT 1
- Update Omniglot URL
- How to interpret the batch accuracy for train and test HOT 1
- Question reagarding the mata gradient computation.
- How to convert the saved models to tflite format?
- How to understand the transductive in the code?
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from supervised-reptile.