Comments (4)
Hello,
The commit you linked seems to be private and not accessible for me.
In theory nvcc should itself generate the __ldg instruction for pointers marked const and __restrict __. However in our case the pointers are not regular function arguments but are passed around inside functors, which makes the compiler hesitate about generating this optimization. In my experience with CUDA 7.0 and 6.5, the compiler treated __restrict __ pointers inside functors as regular pointers, therefore I decided to include this instruction manually.
from rapidcfd-dev.
Hello,
For reasons I cannot explain, clinking on the embedded link does lead to a page not found error. But if I copy and paste the URL into a browser I can see the commit on my fork of RapidCFD. You can also try copying and pasting this tinyURL into your browser to get to my commit: https://tinyurl.com/y6v7xgrs Again, simply clicking on the link fails, but copy and paste works.
Your feedback on the __restrict __ pointers is very helpful. If you can access the commit, I am interested in your feedback on where the intrinsic __ldg function should be applied. If you do not have time, I will continue to think about this and work it out in time.
from rapidcfd-dev.
Hello,
I may have figured out where to apply the intrinsic __ldg function. In addition to the URL above, please look at https://github.com/TonkomoLLC/RapidCFD-dev/commit/27dfbb054849e2e7b622aa69921f9fd8bd9357ca
You may need to copy and paste the above URL into your browser.
The code compiles and executes with the proposed changes. However, I am interested in feedback on this implementation.
from rapidcfd-dev.
Since commit 6b40d01 constant cache is for devices with CC >= 3.5.
from rapidcfd-dev.
Related Issues (20)
- You can run RC on multiple GPUs on different nodes. You just need to provide mapping from MPI rank to CUDA device ID on a particular node. You can do it using **-devices** argument. HOT 1
- What is the actual dependency that requires ThirdParty-dev for a multi GPU setup? HOT 1
- supersonicFreestream HOT 26
- RapidCFD on windows (without root rigts) HOT 1
- Ld error during compilation HOT 2
- dynamic mesh calculations on GPU? HOT 10
- mpi.h is missing even if openmpi exists in ThirdParty-dev
- compressibleMultiphaseInterFoam HOT 1
- Attempt to update to newer OpenFOAM v2306 HOT 3
- Supported Linear Algebra Operations? HOT 1
- Processing with a list of fields on the GPU.
- Compiler setup and path to thrust for porting RapidCFD to AMD GPUs HOT 7
- AINV Preconditioner
- OSHA1stream functionality of RapidCFD needs to be updated to C++11 or many simulations don't run HOT 1
- Errors compiling solver HOT 2
- Custom preconditioner implemantation approach? HOT 7
- Running RapidCFD on openFoam9 HOT 18
- RapidCFD-container HOT 66
- rapid cfd not able to link the library HOT 12
- Probe function is not available HOT 11
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from rapidcfd-dev.