Giter Club home page Giter Club logo

sirius93123's Projects

libxsmm icon libxsmm

Library for specialized dense and sparse matrix operations, and deep learning primitives.

llvm-project icon llvm-project

The LLVM Project is a collection of modular and reusable compiler and toolchain technologies. Note: the repository does not accept github pull requests at this moment. Please submit your patches at http://reviews.llvm.org.

marlin icon marlin

FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

max-pool-cuda icon max-pool-cuda

Implemented the max pool filter in CUDA using the in built library and using shared memory

maxas icon maxas

Assembler for NVIDIA Maxwell architecture

megatron-lm icon megatron-lm

Ongoing research training transformer language models at scale, including: BERT & GPT-2

mlc-llm icon mlc-llm

Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.

mmcv icon mmcv

OpenMMLab Computer Vision Foundation

model-compression-deploy icon model-compression-deploy

model compression and deploy. compression: 1、quantization: quantization-aware-training, 16/8/4/2-bit(dorefa/Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference)、ternary/binary(twn/bnn/xnor-net); post-training-quantization, 8-bit(tensorrt); 2、 pruning: normal、regular and group convolutional channel pruning; 3、 group convolution structure; 4、batch-normalization folding for quantization. deploy: tensorrt, fp32/fp16/int8(ptq-calibration)、op-adapt(upsample)、dynamic_shape

natural-gradients icon natural-gradients

Collection of algorithms for approximating Fisher Information Matrix for Natural Gradient (and second order method in general)

nbassembler icon nbassembler

Assembler and Decompiler for NVIDIA (Maxwell Pascal Volta Turing Ampere) GPUs.

nncf icon nncf

PyTorch*-based Neural Network Compression Framework for enhanced OpenVINO™ inference

nnfusion icon nnfusion

A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.

nni icon nni

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

nvcc-llvm-ir icon nvcc-llvm-ir

Enabling on-the-fly manipulations with LLVM IR code of CUDA sources

once-for-all icon once-for-all

[ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment

onnx icon onnx

Open standard for machine learning interoperability

onnxruntime icon onnxruntime

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.