Giter Club home page Giter Club logo

Comments (5)

andrea-cimatoribus-pix4d avatar andrea-cimatoribus-pix4d commented on August 25, 2024 1

@yuslepukhin

I could collect some more runtime measurements, using the publicly available python bindings (v1.18.1). The results are interesting: The DirectML bindings are the fastest, at 1.5s per image, CUDA is at least as slow. If I run the same model with pytorch+CUDA on windows, I get ~0.5s per image, which is the same runtime I get on linux on identical hardware. On linux, our C++ onnxruntime integration is faster than pytorch by ~20%. It seems that the model has some issue on windows in onnxruntime, not clearly related to the execution provider. As from my original message, the issue seems to come from some extra memcpy, but I don't really understand what is causing them.

from onnxruntime.

sophies927 avatar sophies927 commented on August 25, 2024

@smk2007

from onnxruntime.

andrea-cimatoribus-pix4d avatar andrea-cimatoribus-pix4d commented on August 25, 2024

I made some extra experiments with dynamic axes. I could confirm that removing any dynamic axes provides a small speed-up (few % with DirectML), but once again it does not fill the gap between Windows+DirectML and Linux+CUDA.

from onnxruntime.

yuslepukhin avatar yuslepukhin commented on August 25, 2024

Any reason not to compare Windows CUDA with Linux CUDA?

from onnxruntime.

andrea-cimatoribus-pix4d avatar andrea-cimatoribus-pix4d commented on August 25, 2024

Any reason not to compare Windows CUDA with Linux CUDA?

The reason is that I don't have a have a reliable way to build onnxruntime with CUDA on Windows, cudart/cudnn distribution for Windows has been sketchy at best in recent times (at least up to 11.7 which is what I currently use). So, I cannot do the measurements on the same infrastructure I use for the other cases. A hacky test however suggests that Windows+CUDA suffers from a similar performance penalty than Windows+DirectML.

from onnxruntime.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.