StrongBob's Projects
Aaron: Compile-time Kernel Adaptation for Multi-DNN Inference Acceleration on Edge GPU [SenSys'22 Best Poster]
👀
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
ARCore SDK for Android Studio
股票行情实时数据接口-A股,完全免费的沪深证券股票数据-**股市,python最简封装的API接口,包含日线,历史K线,分时线,分钟线,全部实时采集,系统包括新浪腾讯双数据核心采集获取,自动故障切换,STOCK数据格式成DataFrame格式,可用来查询研究量化分析,股票程序自动化交易系统.为量化研究者在数据获取方面极大地减轻工作量,更加专注于策略和模型的研究与实现。
AutoGrow: Automatic Layer Growing in Deep Convolutional Networks (KDD 2020)
:computer: 🎉 An awesome & curated list of best applications and tools for Windows.
Foundation Model for X and X for Foundation Model
Awesome resources for GPUs
A curated list of image captioning and related area resources. :-)
A curated list of references for MLOps
Collection of awesome podcasts
This is a list of awesome quantum computing applicaitons (e.g. quantum computing for ML acceleration) related works.
This is a list of awesome edgeAI inference related papers.
A list of awesome compiler projects and papers for tensor computation and deep learning.
BlockDrop: Dynamic Inference Paths in Residual Networks
Quantized Neural Networks (QNNs) on PYNQ
Training and evaluation pipeline for MEG and EEG brain signal encoding and decoding using deep learning. Code for our paper "Decoding speech perception from non-invasive brain recordings" published in Nature Machine Intelligence, 2023.
An MLIR-Based Ideas Landing Project
Vehicle detection using YOLO in Keras runs at 21FPS
Collaborative Collection of C++ Best Practices. This online resource is part of Jason Turner's collection of C++ Best Practices resources. See README.md for more information.
Code for <Confidence Regularized Self-Training> in ICCV19 (Oral)
An unofficial cuda assembler, for all generations of SASS, hopefully :)
Used to profile cuda kernel
This is a repo for my CUDA learning.
English version