Young Jin Kim's Projects
DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.
FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
Intel(R) Math Kernel Library for Deep Neural Networks (Intel(R) MKL-DNN)
Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
Summarization, translation, sentiment-analysis, text-generation and more at blazing speed using a T5 version implemented in ONNX.
🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.
A high-throughput and memory-efficient inference and serving engine for LLMs