Comments (3)
from cuda-programming.
感谢您的提问。书中没有说共享内存的层数一定是32层,只不过图8.1显示的是32层(因为是以数组转置为例进行讲解)。共享内存只规定每一层有32个bank,至于有多少层,那是由共享内存数组的数据量决定的。如果您仔细看,会发现书中第87页的第一段所举的例子就只有4层。
from cuda-programming.
再明确回答一下,如果定义了
#define TILE_DIM 32
// in the kernel:
__shared__ float S[TILE_DIM][TILE_DIM+1];
那么该共享内存数组 S
的数据需要使用33层,每层有32个4字节的bank。
from cuda-programming.
Related Issues (20)
- GeForce RTX2070或者RTX2080系列,双精度峰值很低,这个原因是什么? HOT 2
- 请问有pdf吗? HOT 4
- 关于error.cuh中的CHECK宏报错信息的疑问。 HOT 4
- 97页 印刷错误 HOT 2
- 34页缺字 HOT 1
- P55 因为我们的CPU程序中的计算是串行的,其性能基本上与数组长度无关。 HOT 2
- 请教一下第51页的算术强度-理论寄存器带宽-每个FMA的操作数 是怎么求得的? HOT 2
- BUG-73页 listing7.1 行12 HOT 1
- 是否有CUDA交流群? HOT 2
- P108 代码有错 HOT 4
- extern __shared__ real s_y[]; HOT 2
- block、thread、wrap间的执行顺序
- Github提供的代码与书本所示代码(p129)不一致 HOT 2
- 英文版第三章有错误 HOT 3
- 代码优化 HOT 3
- 书与GitHub代码不一致(及再勘误) HOT 1
- 关于cublas_gemm.cu代码的计算结果 HOT 2
- p39页上方第14行代码 HOT 4
- 44页的cuda-memcheck工具无输出 HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from cuda-programming.