Light

lvnwpu Goto Github PK

followers: 0.0 following: 0.0 repos: 37.0 gists: 0.0

Type: User

lvnwpu's Projects

algorithm-visualizer

:fireworks:Interactive Online Platform that Visualizes Algorithms from Code

autogptq

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

caffe

Caffe: a fast open framework for deep learning.

chromium

The official GitHub mirror of the Chromium source

cnn

This is a matlab-code implementation of convolutional neural network

code

Learning

code-eval

Run evaluation on LLMs using human-eval benchmark

code1

learning

cuda-samples

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

cuda_gemm

A simple high performance CUDA GEMM implementation.

cuda_hgemm

Several optimization methods of half-precision general matrix multiplication (HGEMM) using tensor core with WMMA API and MMA PTX instruction.

cuda_op_benchmark

方便扩展的Cuda算子理解和优化框架，仅用在学习使用

cutlass

CUDA Templates for Linear Algebra Subroutines

data_analysis

Python数据分析实战及资料

deepspeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

deepspeed-mii

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

emopy

A deep neural net toolkit for emotion analysis via Facial Expression Recognition (FER)

gitkills

googletest

Google C++ Testing Framework

gp_cnn

GP_CNN

gptq

Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

gtest

gtest

how_to_optimize_in_gpu

This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.

leveldb

LevelDB is a fast key-value storage library written at Google that provides an ordered mapping from string keys to string values.

lightseq

LightSeq: A High Performance Library for Sequence Processing and Generation

nn-cuda-example

Several simple examples for popular neural network toolkits calling custom CUDA operators.

open3gpp

Open Source 3GPP Protocol Stack

qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

rep

server

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

1
2

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.