yu-gyoung-yun Goto Github PK
Type: User
Type: User
Training and serving large-scale neural networks
RISCV Gem5 simulator flow for Architetture dei Sistemi di Elaborazione
A curated list of awesome projects and papers for distributed training or inference
A collection of research papers on efficient training of DNNs
Embedded and mobile deep learning research resources
Must read research papers and links to tools and datasets that are related to using machine learning for compilers and systems optimisation
A list of awesome compiler projects and papers for tensor computation and deep learning.
Source code examples from the Parallel Forall Blog
NVIDIA CUPTI samples mirror.
CUDA Templates for Linear Algebra Subroutines
DGIST, 참고 코드: 밑바닥 부터 시작하는 딥러닝
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Example models using DeepSpeed
Documentation for NVDLA.
A host-based framework that transparently extends the GPU addressable global memory space beyond the host memory using NVM-backed data pointers
Transformer related optimization, including BERT, GPT
Hands-On GPU Programming with Python and CUDA, published by Packt
HeteroSim is a full system simulator supporting x86 multicore processors combined with a FPGA via bus-based architecture. Flexible design space exploration is enabled by a wide range of system configurations. A complete simulation flow with compiler support is provided so that a full system simulation can be performed with various performance metri
A retargetable MLIR-based machine learning compiler and runtime toolkit.
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
LLM Systems Paper List
A framework for few-shot evaluation of autoregressive language models.
News and Paper Collections for Machine Learning Hardware
A curated list of papers, theses, datasets, and tools related to the application of Machine Learning for Software Engineering
Examples demonstrating available options to program multiple GPUs in a single node or a cluster
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.