Giter Club home page Giter Club logo

Comments (11)

kloudkl avatar kloudkl commented on May 1, 2024

Decaf has already used MPI in a few places.

from caffe.

Yangqing avatar Yangqing commented on May 1, 2024

Just a precaution type note: I used mpi in my earlier projects that never
got open-sourced (parallel linear models over a reasonably sized cluster,
see e.g. my ICCV 2013 task adaptation paper). I don't recall completely
making mpi runnable under either decaf and caffe, though...

Yangqing

On Thu, Feb 6, 2014 at 9:30 PM, kloudkl [email protected] wrote:

Decaf has already used MPI in a few placeshttps://github.com/UCB-ICSI-Vision-Group/decaf-release/search?q=mpi&ref=cmdform
.

Reply to this email directly or view it on GitHubhttps://github.com//issues/65#issuecomment-34405438
.

from caffe.

kloudkl avatar kloudkl commented on May 1, 2024

The first open source large scale machine learning projects that I encountered were Vowpal Wabbit[1] and Edward Y. Chang's PSVM, PLDA, Parallel Spectral Clustering which all used MPI but none were based on CUDA. Neither did they train deep nonlinear models. But the achievements of the industry such as Baidu IDL should motivate the academy towards a comparable large scale distributed training framework. A progressive roadmap may be to implement a version on CPU at first and add GPU capability after the initial success.

[1] Alekh Agarwal, Olivier Chapelle, Miroslav Dudik, John Langford, A Reliable Effective Terascale Linear Learning System, 2011.

from caffe.

jamt9000 avatar jamt9000 commented on May 1, 2024

I am interested in working on this. Is there still work ongoing?

I am thinking something like [1], which uses MPI + cuda-convnet. There is also an interesting write-up by Netflix [2] where they use distributed computing for hyperparameter tuning.

[1] Paine, T., Jin, H., Yang, J., Lin, Z., & Huang, T. (2013). GPU Asynchronous Stochastic Gradient Descent to Speed Up Neural Network Training. arXiv Preprint arXiv:1312.6186

[2] http://techblog.netflix.com/2014/02/distributed-neural-networks-with-gpus.html

from caffe.

kloudkl avatar kloudkl commented on May 1, 2024

@Yangqing, would you please recover the related commits?

for commit in 64e28ba 591c36b a3eb62a a48147c; do git cherry-pick $commit; done

from caffe.

kloudkl avatar kloudkl commented on May 1, 2024

Microsoft Project Adam sounds very promising [1].

[1] Trishul Chilimbi, Yutaka Suzue, Johnson Apacible, Karthik Kalyanaraman, "Project Adam: Building an Efficient and Scalable Deep Learning Training System"
To appear in the 11th USENIX Symposium on Operating Systems Design and Implementation '14 (OSDI), Oct. 2014.

Unfortunately, the paper won't be public until the conference is held in October. Did anyone register the ODSI 2014 and have access to the paper?

from caffe.

bhack avatar bhack commented on May 1, 2024

Is it based on Hogwild ?

from caffe.

kloudkl avatar kloudkl commented on May 1, 2024

The answer is only in the paper.

from caffe.

bhack avatar bhack commented on May 1, 2024

@kloudkl Some other preliminar info on Project Adam.

from caffe.

futurely avatar futurely commented on May 1, 2024

The paper became public this Monday.

from caffe.

shelhamer avatar shelhamer commented on May 1, 2024

This is in-progress through #1148 so this place-holder issue is no longer needed.

from caffe.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.