Giter Club home page Giter Club logo

Comments (7)

ntrost57 avatar ntrost57 commented on June 9, 2024 2

There is going to be a change soon, which will introduce multiple streams in order to overlap packing and sending with the interior computation. If you are using MPI, I guess you compile from source. I'll update this issue, once that change has been implemented, so you can re do your performance tests.

from rocalution.

ntrost57 avatar ntrost57 commented on June 9, 2024

Currently, rocALUTION does not support direct GPU-GPU communication. This is something we will be adding in a future release, however.

from rocalution.

jdechard avatar jdechard commented on June 9, 2024

Thanks for you quick answer.
Do you have an idea of when this feature will be supported ?

from rocalution.

ntrost57 avatar ntrost57 commented on June 9, 2024

I am not able to give you a specific release. May I ask why you are interested in such feature? I do not expect any performance improvement on most solvers, as time critical algorithms such as SpMV can fully hide communication by computation.

from rocalution.

jdechard avatar jdechard commented on June 9, 2024

I understand your point but the bench case we are working on shows that about 10% of the time of each rocALUTION solver call (1 per time step) is dedicated to hipMemcpy for these ghost cells exchange, with almost no computational kernels overlap. We use the CG solver with the c-amg preconditionner. It would have been nice to try this feature on AMD nodes specifically designed to do so. We will test it when it will be available :).

from rocalution.

jdechard avatar jdechard commented on June 9, 2024

Sounds a good feature to try, thanks !

from rocalution.

ntrost57 avatar ntrost57 commented on June 9, 2024

I have merged some major modifications to GlobalMatrix::Apply() in order to overlap communication and packing with interior matrix vector multiplication. This should slightly improve SpMV performance and scale linear with additional nodes / GPUs. Let me know, if this fixes the issue you have been observing.
It will still use the host for communication, but you should not see any communication related slow downs.
Note, that you will have to checkout develop branch, in order to see those changes.

from rocalution.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.