jiqing-feng Goto Github PK
Type: User
Type: User
🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
Accessible large language models via k-bit quantization for PyTorch.
[CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning on image-text and video-text tasks.
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
The official implementation of the paper "To Generate or Not? Safety-Driven Unlearned Diffusion Models Are Still Easy To Generate Unsafe Images ... For Now". This work introduces one fast and efficient attack methods to generate toxic content for safety-driven diffusion models.
A distributed deep learning framework.
GEAR: An Efficient KV Cache Compression Recipefor Near-Lossless Generative Inference of LLM
Extending Hugging Face transformers APIs for Transformer-based models and improve the productivity of inference deployment. With extremely compressed models, the toolkit can greatly improve the inference efficiency on Intel platforms.
A framework for few-shot evaluation of autoregressive language models.
Intel® AI Reference Models: contains Intel optimizations for running deep learning workloads on Intel® Xeon® Scalable processors and Intel® Data Center GPUs
Intel® Neural Compressor (formerly known as Intel® Low Precision Optimization Tool), targeting to provide unified APIs for network compression technologies, such as low precision quantization, sparsity, pruning, knowledge distillation, across different deep learning frameworks to pursue optimal inference performance.
🏎️ Accelerate training and inference of 🤗 Transformers with easy to use hardware optimization tools
Accelerate inference of 🤗 Transformers with Intel optimization tools
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Camera-ready repo for ProtST
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Graph Neural Network Library for PyTorch
[ICCV 2023] Q-Diffusion: Quantizing Diffusion Models.
Pipeline Parallelism for PyTorch
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
A high-throughput and memory-efficient inference and serving engine for LLMs
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.