Giter Club home page Giter Club logo

sihouzi21c's Projects

ao icon ao

PyTorch native quantization and sparsity for training and inference

blis icon blis

BLAS-like Library Instantiation Software Framework

cpp-httplib icon cpp-httplib

A C++ header-only HTTP/HTTPS server and client library

cpuinfo icon cpuinfo

CPU INFOrmation library (x86/x86-64/ARM/ARM64, Linux/Windows/Android/macOS/iOS)

cudnn-frontend icon cudnn-frontend

cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it

cutlass icon cutlass

CUDA Templates for Linear Algebra Subroutines

executorch icon executorch

On-device AI across mobile, embedded and edge for PyTorch

fbgemm icon fbgemm

FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/

fbjni icon fbjni

A library designed to simplify the usage of the Java Native Interface

flatbuffers icon flatbuffers

FlatBuffers: Memory Efficient Serialization Library

fmt icon fmt

A modern formatting library

fp16 icon fp16

Conversion to/from half-precision floating point formats

fxdiv icon fxdiv

C99/C++ header-only library for division via fixed-point multiplication by inverse

gloo icon gloo

Collective communications library with various primitives for multi-machine training.

googletest icon googletest

GoogleTest - Google Testing and Mocking Framework

ideep icon ideep

Intel® Optimization for Chainer*, a Chainer module providing numpy like API and DNN acceleration using MKL-DNN.

ittapi icon ittapi

Intel® Instrumentation and Tracing Technology (ITT) and Just-In-Time (JIT) API

kineto icon kineto

A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.

llama.onnx icon llama.onnx

LLaMa/RWKV onnx models, quantization and testcase

llava icon llava

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

llm4decompile icon llm4decompile

Reverse Engineering: Decompiling Binary Code with Large Language Models

mimalloc icon mimalloc

mimalloc is a compact general purpose allocator with excellent performance.

nccl icon nccl

Optimized primitives for collective multi-GPU communication

nnpack icon nnpack

Acceleration package for neural networks on multi-core CPUs

nvtx icon nvtx

The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resources in your applications.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.