cuda's Introduction

CUDA

implementing several image processing functions using CUDA

Histogram Computation

used PRIVATIZATION via shared memory along with aggregation of atomic operations to account for repeated values in specific regions of the image. Histogram computation causes many race conditions which need atomic operations but these interensic functions leads to high memory latency because it serializes the work to solve race conditions which results in decreased throughput. by directing the atomic operations to shared memeory and then copying the results back to memory latency is reduced significantly.

Intensity Transformations

Histogram Equalization
Image Thresholding
Bit-plane Slicing: compute / memeory access ratio is 8 for 8-bit gray scale images using global memory

Sparse Matrix Computation

Graph Search

Merge Sort

Convolution

used cached for

Matrix Multiplication

comparison of matrix multiplication between CPU and GPU both using global memroy and shared memroy

Recommend Projects

kareimgazer / cuda Goto Github PK

cuda's Introduction

CUDA

Histogram Computation

Intensity Transformations

Sparse Matrix Computation

Graph Search

Merge Sort

Convolution

Matrix Multiplication

cuda's People

Contributors

Stargazers

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent