Giter Club home page Giter Club logo

Comments (5)

yuhuixu1993 avatar yuhuixu1993 commented on June 19, 2024

Hi, @KleinXin , thanks for your interest of our work. As you mentioned that it is a different dataset and task. the hyperparamters need to be consistent with your usual hyper parameters in your task which may have some difference with the training on imagenet. Besides, the accuracy of the search phase is not informative enough to judge the goodness of the searched architectures. You need to train from scratch with the final searched architectures. Besides, the stop epoch is also changeable.

from pc-darts.

KleinXin avatar KleinXin commented on June 19, 2024

Hi, @KleinXin , thanks for your interest of our work. As you mentioned that it is a different dataset and task. the hyperparamters need to be consistent with your usual hyper parameters in your task which may have some difference with the training on imagenet. Besides, the accuracy of the search phase is not informative enough to judge the goodness of the searched architectures. You need to train from scratch with the final searched architectures. Besides, the stop epoch is also changeable.

Do you think it is possible to complete searching and training in one-shot?
In fact, now we are trying to finish the recognition job in the searching step instead of searching architecture followed by training model.

from pc-darts.

KleinXin avatar KleinXin commented on June 19, 2024

Another question is that in 'train_search_imagenet.py' line 178, the parameter 'architect' is not transferred into train function as in 'train_searcch.py'.
In line 206 of 'train_search_imagenet.py', architect.step is commented.

Does this mean the architecture is not updated in 'train_search_imagenet.py' ?

from pc-darts.

yuhuixu1993 avatar yuhuixu1993 commented on June 19, 2024

@KleinXin, in fact the architecture is updated. I just moved the update codes from architecture.py towards this file. The update code is the corresponding lines: 200~205. Hyperpaprameter args.begin controls from which epoch the arch paras begin to update. Besides, you can double check by see the printed arch para.

from pc-darts.

rrryan2016 avatar rrryan2016 commented on June 19, 2024

Hi, @KleinXin , thanks for your interest of our work. As you mentioned that it is a different dataset and task. the hyperparamters need to be consistent with your usual hyper parameters in your task which may have some difference with the training on imagenet. Besides, the accuracy of the search phase is not informative enough to judge the goodness of the searched architectures. You need to train from scratch with the final searched architectures. Besides, the stop epoch is also changeable.

Thanks for your kind reply, really helpful.

I came across the same problem when applying to another classification task, and I just wonder do you meet similar problem that accuracy significantly decreases after epoch args.begin, when searching on CIFAR10 or ImageNet?

And I am interested in your log files when searching on CIFAR10 or ImageNet, just for reference. Would you like to share it ?

from pc-darts.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.