debowin / cuda-tiled-matrix-multiplication Goto Github PK
View Code? Open in Web Editor NEWOptimized Parallel Tiled Approach to perform Matrix Multiplication by taking advantage of the lower latency, higher bandwidth shared memory within GPU thread blocks.