Ashutosh Mishra's Projects
abc_bee_hotel
AMD's graph optimization engine.
Config files for my GitHub profile.
Some Code which I have submitted to different programming problems
Checkpoint/Restore tool
DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
Simple dictionary to yaml converter for fixed kind application
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
The world's simplest facial recognition api for Python and the command line
JustSomePrograming Problems Solution
The C based gRPC (C++, Python, Ruby, Objective-C, PHP, C#)
HIPIFY: Convert CUDA to Portable C++ Code
A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).
AMD's Machine Intelligence Library
Tiny, fast(ish), self-contained and fully loaded printf, sprinf etc. implementation, mainly for embedded systems.
Hancrafted little Codes made for testing various things made during debugging
Tensors and Dynamic neural networks in Python with strong GPU acceleration
ROCm Communication Collectives Library (RCCL)
Rocm cmake modules
ROCm OpenOpenCL Runtime
Bandwidth test for ROCm
The ROCm Validation Suite is a system administrator’s and cluster manager's tool for detecting and troubleshooting common problems affecting AMD GPU(s) running in a high-performance computing environment, enabled using the ROCm software stack on a compatible platform.