Giter Club home page Giter Club logo

Comments (4)

daniel-jasinski avatar daniel-jasinski commented on July 24, 2024

Hello,

The commit you linked seems to be private and not accessible for me.

In theory nvcc should itself generate the __ldg instruction for pointers marked const and __restrict __. However in our case the pointers are not regular function arguments but are passed around inside functors, which makes the compiler hesitate about generating this optimization. In my experience with CUDA 7.0 and 6.5, the compiler treated __restrict __ pointers inside functors as regular pointers, therefore I decided to include this instruction manually.

from rapidcfd-dev.

TonkomoLLC avatar TonkomoLLC commented on July 24, 2024

Hello,

For reasons I cannot explain, clinking on the embedded link does lead to a page not found error. But if I copy and paste the URL into a browser I can see the commit on my fork of RapidCFD. You can also try copying and pasting this tinyURL into your browser to get to my commit: https://tinyurl.com/y6v7xgrs Again, simply clicking on the link fails, but copy and paste works.

Your feedback on the __restrict __ pointers is very helpful. If you can access the commit, I am interested in your feedback on where the intrinsic __ldg function should be applied. If you do not have time, I will continue to think about this and work it out in time.

from rapidcfd-dev.

TonkomoLLC avatar TonkomoLLC commented on July 24, 2024

Hello,

I may have figured out where to apply the intrinsic __ldg function. In addition to the URL above, please look at https://github.com/TonkomoLLC/RapidCFD-dev/commit/27dfbb054849e2e7b622aa69921f9fd8bd9357ca
You may need to copy and paste the above URL into your browser.

The code compiles and executes with the proposed changes. However, I am interested in feedback on this implementation.

from rapidcfd-dev.

daniel-jasinski avatar daniel-jasinski commented on July 24, 2024

Since commit 6b40d01 constant cache is for devices with CC >= 3.5.

from rapidcfd-dev.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.