Giter Club home page Giter Club logo

litianjian's Projects

blocksparse icon blocksparse

Efficient GPU kernels for block-sparse matrix multiplication and convolution

caffe icon caffe

Caffe: a fast open framework for deep learning.

caffe-model icon caffe-model

Caffe models (including classification, detection and segmentation) and deploy files for famouse networks

cuda-convnet2 icon cuda-convnet2

Automatically exported from code.google.com/p/cuda-convnet2

cuda-samples icon cuda-samples

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

cudabmk icon cudabmk

Source for Demystifying GPU Microarchitecture through Microbenchmarking

cutlass icon cutlass

CUDA Templates for Linear Algebra Subroutines

deepcore icon deepcore

一款基于CUDA的针对NVIDIA的GPU进行了深度优化的深度学习底层核心计算库,在相同的算法下矩阵乘法和卷积的性能同于甚至大于最新版本的cudnn,该项目仍在持续开发中!

deepdetect icon deepdetect

Deep Learning API and Server in C++11 support for Caffe, Caffe2, Dlib, Tensorflow, XGBoost and TSNE

deepperf icon deepperf

DeepPerf is a set of cuda assembling developing tools

digits icon digits

Deep Learning GPU Training System

dive-into-dl-pytorch icon dive-into-dl-pytorch

本项目将《动手学深度学习》原书中的MXNet代码实现改为PyTorch实现。

gpgpu-sim_simulations icon gpgpu-sim_simulations

A repository that compliments gpgpu-sim, providing automated regression scripts, simulation launching utilities and the code + arguments for simulations that complete in a reasonable amount of time on GPGPU-Sim.

hgemmtest icon hgemmtest

Trying to write a hgemm using opencl for tensor cores. Involves inline assembly

isaac icon isaac

Automatically-Tuned Input-Aware implementations of HPC/DNN primitives

leetcodeanimation icon leetcodeanimation

Demonstrate all the questions on LeetCode in the form of animation.(用动画的形式呈现解LeetCode题目的思路)

marlin icon marlin

FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

maxas icon maxas

Assembler for NVIDIA Maxwell architecture

maxdnn icon maxdnn

High Efficiency Convolution Kernel for Maxwell GPU Architecture

mvision icon mvision

机器人视觉 移动机器人 VS-SLAM ORB-SLAM2 深度学习目标检测 yolov3 行为检测 opencv PCL 机器学习 无人驾驶

ncnn icon ncnn

ncnn is a high-performance neural network inference framework optimized for the mobile platform

netron icon netron

Visualizer for deep learning and machine learning models

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.