slahiruk / cuda-tiled-matrix-multiplication Goto Github PK
View Code? Open in Web Editor NEWThis project forked from debowin/cuda-tiled-matrix-multiplication
Optimized Parallel Tiled Approach to perform Matrix Multiplication by taking advantage of the lower latency, higher bandwidth shared memory within GPU thread blocks.