Giter Club home page Giter Club logo

fast's Introduction

FaST-GShare: Enabling Efficient Spatio-Temporal GPU Sharing in Serverless Computing for Deep Learning Inference

Paper

Authors: Jianfeng Gu, Yichao Zhu, Puxuan Wang, Mohak Chadha, Michael Gerndt

1. Base Infrastructure Configuraiton and Deployment:

  • Install CUDA Driver. (both Master and Node)
sudo apt-get update
sudo apt-get install -y nvidia-driver-525
nvidia-smi
## if has problem, reboot the server
sudo reboot
  • Install CUDA toolkit (both Master and Node)
sudo apt install -y nvidia-cuda-toolkit
  • Install Kubernetes (Master Node)
bash install_k8s_master_node.sh

check if the master node's is under untiant

kubectl describe node <node_name> | grep -i taint

if not untaint and the master node is also regard as a computing node, untiant the node

kubectl taint node `hostname` node-role.kubernetes.io/control-plane:NoSchedule-
kubectl taint node `hostname` node-role.kubernetes.io/master:NoSchedule-

  • Install Kubernetes (Node)
bash install_k8s_node.sh

Then join the master node after install nvidia-container-toolkit shown below

# check the join command
kubeadm token create --print-join-command
# follow the command prompted and join the node to the master node
kubeadm join <ip:port> --token <the_join_token>
  • Install/configure nvidia-container-toolkit (both Master and Node)
bash install_nvidia_container_toolkit.sh
  • Install and deploy nvidia-device-plugin: (only the Master)
kubectl create -f https://raw.githubusercontent.com/NVIDIA/k8s-device-plugin/v0.13.0/nvidia-device-plugin.yml

2. Deployment:

  • clone repo:
git clone [email protected]:YukioZzz/FaST.git
  • pull submodules:
git submodule init && git submodule update --recursive --remote
  • install prerequisites and deploy components:
bash ./deploy.sh
  • apply test function:
kubectl apply -f FaSTAutoscaler/config/samples/fastsvc_v1_fastsvc.yaml
  • load test(remember to edit gateway ip):
cd faas-share-test/MLPerf-based-workloads/resnet/client && k6 run k6.js 

Publication

If you use FaST-GShare, please cite us:

@inproceedings{gu2023fast,
  title={FaST-GShare: Enabling Efficient Spatio-Temporal GPU Sharing in Serverless Computing for Deep Learning Inference},
  author={Gu, Jianfeng and Zhu, Yichao and Wang, Puxuan and Chadha, Mohak and Gerndt, Michael},
  booktitle={Proceedings of the 52nd International Conference on Parallel Processing},
  pages={635--644},
  year={2023}
}

fast's People

Contributors

yukiozzz avatar kontongu avatar interstellarss avatar

Stargazers

Xiaorang Guo avatar Mohak Chadha avatar Isaac Núñez avatar  avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.