lijunnan1992 / mlnt Goto Github PK
View Code? Open in Web Editor NEWMeta-Learning based Noise-Tolerant Training
Meta-Learning based Noise-Tolerant Training
Could you please help me out?
When I use --batch_num=32, I cannot run the code on a single GPU. My GPU is Tesla P100-SXM2 16G. I only can run the code with --batch_num=1. Since we have create_graph=True and retain_graph=True in the inner loop with torch.autograd.grad, and M=10, the GPU memory gets allocated too fast. It is also very tricky to run MAML on multiple GPUs.
I am wondering in your implementation, what was the batch size? and how did you address the problem of increasing GPU memory allocation in the inner for loop? Did you use multiple GPUs or a single GPU?
Thanks alot
when I run the main code with my dataset, it comes to this error:
line 116, in train
targets_fast[idx] = targets[neighbor[random.randint(1,num_neighbor)]]
IndexError: index 5 is out of bounds for dimension 0 with size 5
how to select num_neighbor?is that based on batch_size ,or my dataset's label?
It seems that the current version do not contain iterative training? Or I miss it?
It's my mistake. Closed
Dear authors, your ideas are interesting and novel:
In this case, the baseline should be Iterative training without Meta-learning. That is without meta-learning on synthetic noisy examples.
It is more interesting to see how much exactly meta-learning proposal improves the performance versus the true baseline.
Could you please share something about this? Thanks so much.
Dear author.
Why are you updating class_loss and consistent_loss at the same time?
In Algorithm1, it seems that its processing is decoupled.
I'm sorry if I have misunderstood.
RuntimeError: Can't detach views in-place. Use detach() instead
I have no idea why it can't detach views in-place. Any way to get around this problem? thanks!
pytorch 1.3.1
我试过了0.3.1, 0.4.1和最新版本,都出现不同程度的错误(都发生在main.py),谢谢
Dear Junnan Li,
I read your paper and it was so impressed. I have two questions for your paper and code.
First of all, what is args.alpha in your code (main.py line 31)? I read your paper, but it seems that it was not written. Could you tell me about this alpha?
Finally, how can I do iterative learning? I could train 1 epoch, but I couldn't do iterative learning 3 epochs like your paper. Could you help me reproduce your paper?
Sorry to ask this of you when you are busy but I appreciate your help.
Thanks so much.
since the pretrain size is 224 but cifar size is 32, so, directly use this code would cause the out size error, could you please tell me how can I use the cifar 10 data in this code?
Your baseline model seems to be no noise, so what is his role?
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.