Giter Club home page Giter Club logo

aisystem's Issues

建议在《分布式并行》里面增加“ZeRO优化器”

Megatron-LM支持:数据并行+张量并行+流水并行。
DeepSpedd支持:数据并行+ZeRO优化器。
Megatron-DeepSpeed支持:数据并行+张量并行+流水并行+ZeRO优化器。
ZeRO优化器分为三种:ZeRO-stage1、ZeRO-stage1、ZeRO-stage3。
其中ZeRO-stage3也被称作FSDP。

ZeRO或者FSDP的应用示例:

  1. https://github.com/tatsu-lab/stanford_alpaca
  2. https://github.com/lm-sys/FastChat
  3. https://github.com/hiyouga/LLaMA-Efficient-Tuning/blob/main/README_zh.md

ZeRO相关参考资料:

  1. Samyam Rajbhandari, Jeff Rasley, Olatunji Ruwase, Yuxiong He. (2019) ZeRO: memory optimizations toward training trillion parameter models. arXiv:1910.02054 and In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC ‘20).
  2. ZeRO++: A leap in speed for LLM and chat model training with 4X less communication](https://www.microsoft.com/en-us/research/blog/deepspeed-zero-a-leap-in-speed-for-llm-and-chat-model-training-with-4x-less-communication/)
  3. FairScale FULLY SHARDED DATA PARALLEL DOC https://fairscale.readthedocs.io/en/stable/api/nn/fsdp.html

希望加入~

一直也有学习ai编译器的想法,就是希望有类似的项目可以参与,虽然我是新手,但也想尽份力,我目前是从事深度学习工作的,不知道可以做点什么

[视频文章补充计划] AI编译器 后端优化

文章补充内容与计划排期

AI编译原理

后端优化

章节标题 计划完成时间 实际完成时间 对应链接
内容介绍 2024/3/1 2024/2/28 #80
算子的计算与调度 2024/3/10
算子手工优化方式 2024/3/20
算子循环优化 2024/3/30 2024/3/13 #85
指令和内存优化 2024/4/10
AutoTuning原理 2024/4/20

Incorrect Bandwidth Units in NVLink 1.0 image for P100

Path of the image: 02Hardware/04NVIDIA/images/05DeepNvlink07.png

The image depicting NVLink 1.0 for the P100 shows the bandwidth as "20 Gbps" per link. This is incorrect and should be "20 GBps" (Gigabytes per second) per link.
Additionally, the total unidirectional bandwidth is listed as "80 Gbps" and the bidirectional bandwidth as "160 Gbps," which should be "80 GBps" and "160 GBps," respectively.
The image is referenced in 02Hardware/04NVIDIA/05DeepNvlink.md

image

希望优化一下自动微分部分

我在b站看到您关于AI芯片的介绍从而发现了这个repo,非常感谢您能够开源这些优秀的资料。
结合我之前学习的资料,希望能对自动微分部分进行一些修改,或者添加一个新的章节。

新的章节主要结合代码实现进行讲解,从基本的支持自动求导的Value类开始,最终实现一个MLP网络的训练,总代码量大概200行不到。

如果您觉得合适的话,我们可以讨论一下怎么修改。

修改主要参考:

  1. Andrej Karpathyyoutube 视频
  2. github repo
  3. jupyter notebook

另外文档中还有一些笔误,我看到的话也可以提交修改。
目前我正在学习cuda和ai编译相关的内容,如果有机会的话也可以看看有没有合适的内容可以贡献。

多交流

Hi Zomi,Thx for your job. I am a AI Infra owner in car maker company. Hopefully we can be connected. Wechat ID: hello_jijing

我是一名车企的AI基础架构,希望可以多多交流。微信号是:hello_jijing

希望加入

这个开源项目做的非常不错,随着LLM大火,也需要了解学习AIsys相关内容,才能完成企业内AI落地,因此希望加入开源项目一起学习讨论

Add some learning resources

我学校开过《智能计算系统》这门课,有一些视频链接和实验笔记以及其他资源,我可以放上来吗,额外放一个文件夹收集

资料(pdf,pptx)无法打开和下载

点开课程列表,选择任意一条.pdf文件点击进去,显示
Error rendering embedded code

Invalid PD
image

请问怎么解决,是我环境的问题吗?
谢谢作者

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.