jiqing-feng Goto Github PK

followers: 5.0 following: 7.0 repos: 25.0 gists: 0.0

Type: User

jiqing-feng's Projects

accelerate

🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision

autoawq

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

clipbert

[CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning on image-text and video-text tasks.

deepspeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch

The official implementation of the paper "To Generate or Not? Safety-Driven Unlearned Diffusion Models Are Still Easy To Generate Unsafe Images ... For Now". This work introduces one fast and efficient attack methods to generate toxic content for safety-driven diffusion models.

flexflow

A distributed deep learning framework.

gear

GEAR: An Efficient KV Cache Compression Recipefor Near-Lossless Generative Inference of LLM

intel-extension-for-transformers

Extending Hugging Face transformers APIs for Transformer-based models and improve the productivity of inference deployment. With extremely compressed models, the toolkit can greatly improve the inference efficiency on Intel platforms.

lm-evaluation-harness

A framework for few-shot evaluation of autoregressive language models.

lookaheaddecoding

lora

models

Intel® AI Reference Models: contains Intel optimizations for running deep learning workloads on Intel® Xeon® Scalable processors and Intel® Data Center GPUs

neural-compressor

Intel® Neural Compressor (formerly known as Intel® Low Precision Optimization Tool), targeting to provide unified APIs for network compression technologies, such as low precision quantization, sparsity, pruning, knowledge distillation, across different deep learning frameworks to pursue optimal inference performance.

jiqing-feng Goto Github PK

jiqing-feng's Projects

Recommend Projects

Recommend Topics

Recommend Org