lijunnan1992 / mlnt Goto Github PK

View Code? Open in Web Editor NEW

122.0 122.0 29.0 24 KB

Meta-Learning based Noise-Tolerant Training

Python 100.00%

mlnt's People

Contributors

Stargazers

Watchers

mlnt's Issues

GPU runs out of memory with --batch_num=32

Could you please help me out?

When I use --batch_num=32, I cannot run the code on a single GPU. My GPU is Tesla P100-SXM2 16G. I only can run the code with --batch_num=1. Since we have create_graph=True and retain_graph=True in the inner loop with torch.autograd.grad, and M=10, the GPU memory gets allocated too fast. It is also very tricky to run MAML on multiple GPUs.

I am wondering in your implementation, what was the batch size? and how did you address the problem of increasing GPU memory allocation in the inner for loop? Did you use multiple GPUs or a single GPU?

Thanks alot

how can i run the main code with my own dataset?

when I run the main code with my dataset, it comes to this error:

line 116, in train
targets_fast[idx] = targets[neighbor[random.randint(1,num_neighbor)]]
IndexError: index 5 is out of bounds for dimension 0 with size 5

how to select num_neighbor?is that based on batch_size ,or my dataset's label?

Missing implementation of iterative training?

It seems that the current version do not contain iterative training? Or I miss it?

After one iter training speed become very slow

pytorch version = 1.2.0
Tesla v100
CUDA 10.0

I thing your envirment is pytorch<0.4.0, so I have to change code in order to run it.
The baseline.py has no problem, but the main.py has.

In baseline.py training epoch issue

It's my mistake. Closed

The true baseline should be Iterative training without Meta-learning?

Dear authors, your ideas are interesting and novel:

Oracle/Mentor (Consistency loss): To make meta-test reliable, the teacher/mentor model should be reliable and robust to real noisy examples. Therefore, they apply iterative training and iterative data cleaning to make the meta-test consistency loss reliable and an optimisation oracle against real noise. (I suppose this should be the true baseline.)
Unaffected by synthetic noise: The meta-training sees synthetic noisy training examples. After training on them, the meta-testing evaluates its consistency with oracle and aims to maximise the consistency, i.e., making it unaffected after seeing synthetic noise. (I suppose this is the key meta-learning proposal.)

In this case, the baseline should be Iterative training without Meta-learning. That is without meta-learning on synthetic noisy examples.
It is more interesting to see how much exactly meta-learning proposal improves the performance versus the true baseline.

Could you please share something about this? Thanks so much.

Updating class_loss and consistent_loss at the same time?

Dear author.
Why are you updating class_loss and consistent_loss at the same time?
In Algorithm1, it seems that its processing is decoupled.
I'm sorry if I have misunderstood.

Can't detach views in-place.

RuntimeError: Can't detach views in-place. Use detach() instead
I have no idea why it can't detach views in-place. Any way to get around this problem? thanks!

pytorch 1.3.1

能告诉一下您用的是哪个pytorch版本吗？

我试过了0.3.1, 0.4.1和最新版本，都出现不同程度的错误（都发生在main.py），谢谢

I have two questions for your paper and code.

Dear Junnan Li,

I read your paper and it was so impressed. I have two questions for your paper and code.
First of all, what is args.alpha in your code (main.py line 31)? I read your paper, but it seems that it was not written. Could you tell me about this alpha?
Finally, how can I do iterative learning? I could train 1 epoch, but I couldn't do iterative learning 3 epochs like your paper. Could you help me reproduce your paper?

Sorry to ask this of you when you are busy but I appreciate your help.
Thanks so much.

lijunnan1992 / mlnt Goto Github PK

mlnt's People

Contributors

Stargazers

Watchers

Forkers

mlnt's Issues

Recommend Projects

Recommend Topics

Recommend Org