Since multiple experts can take a lot of VRAM, especially for SDXL, it would useful to

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Multi-GPU support about segmoe HOT 5 OPEN

Lissanro commented on August 28, 2024

Multi-GPU support

from segmoe.

Comments (5)

Andrey36652 commented on August 28, 2024

@Lissanro wouldn't it be killed by pcie latency?

from segmoe.

Lissanro commented on August 28, 2024

I think PCI-E latency is only relevant during training (not to mention it could be quite good if PCI-E 4.0 or PCI-E 5.0 with sufficient number of lanes is used, or NVLink in case of a pair 3090 cards).

For inference, PCI-E latency should not matter much, it is just independent experts doing their job once their fully loaded to the VRAM. This is how for example running Mixtral (8x7B MoE) is possible at 4-bit or higher quantization with 24GB cards - since it cannot fit in 24GB of a single card, it gets split across more than 1 GPU, and speed is comparable to running on a single GPU.

Potentially, it could be even better if parallelism across multiple GPUs is implemented (for a case when one expert is fully allocated at one GPU, and another expert at different GPU, and the gate network decided it needs to use both). In any case, even naive sequential implementation (to process experts one-by-one even if they are on different GPUs) is still better than crashing with OOM, and in terms of speed should be at least comparable to running on a single GPU with the higher VRAM.

from segmoe.

Warlord-K commented on August 28, 2024

Thanks for the suggestion, we are working on optimizing the memory usage, but feel free to create a PR for Multi-GPU usage.

from segmoe.

g29times commented on August 28, 2024

@Warlord-K Hi Admin, is there any possible that the homepage README file that tells the GPU needs or specifications?

from segmoe.

Warlord-K commented on August 28, 2024

@g29times I have added the GPU requirements, thanks for the suggestion!

from segmoe.

Recommend Projects

Multi-GPU support about segmoe HOT 5 OPEN

Comments (5)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent