Giter Club home page Giter Club logo

Shintaro Iwasaki's Projects

apex icon apex

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

bolt icon bolt

Official BOLT Repository

ecp-st-car-public icon ecp-st-car-public

The Exascale Computing Project Software Technologies Capability Assessment Report - Public Version

fbgemm icon fbgemm

FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/

fbtt-embedding icon fbtt-embedding

This is a Tensor Train based compression library to compress sparse embedding tables used in large-scale machine learning models such as recommendation and natural language processing. We showed this library can reduce the total model size by up to 100x in Facebook’s open sourced DLRM model while achieving same model quality. Our implementation is faster than the state-of-the-art implementations. Existing the state-of-the-art library also decompresses the whole embedding tables on the fly therefore they do not provide memory reduction during runtime of the training. Our library decompresses only the requested rows therefore can provide 10,000 times memory footprint reduction per embedding table. The library also includes a software cache to store a portion of the entries in the table in decompressed format for faster lookup and process.

folly icon folly

An open-source C++ library developed and used at Facebook.

jekyll-action icon jekyll-action

A GitHub Action to publish Jekyll based content as a GitHub Pages site

kineto icon kineto

A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.

llvm-project icon llvm-project

The LLVM Project is a collection of modular and reusable compiler and toolchain technologies. Note: the repository does not accept github pull requests at this moment. Please submit your patches at http://reviews.llvm.org.

ompi icon ompi

Open MPI main development repository

osu-abt icon osu-abt

OSU Micro-Benchmarks 5.7 + Argobots

p2s2-www icon p2s2-www

International Workshop on Parallel Programming Models and Systems Software for High-End Computing Website

pytorch icon pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

qthreads icon qthreads

Lightweight locality-aware user-level threading runtime.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.