Comments (8)
Hi,
NCCL is an inter-GPU communication library ; it is therefore accelerating computation on multiple GPUs.
That means it won't make a difference with only one GPU.
from nccl.
Hello,I begin use one GPU, but then use four GPUs! But the result has no different.
from nccl.
The CIFAR-10 example is not very computationally intensive. The GPU(s) are probably limited by kernel launch latencies. Perhaps try something like AlexNet, or better ResNet-50, on ImageNet 1000.
from nccl.
OK!Thanks,I have a try the AlexNet!
from nccl.
@nluehr @sjeaugey When I use the caffe-0.15.9/build/tools/caffe time --model=./imagenet_winners/alexnet.prototxt --iterations=1000 --gpu all, the result said no use multi-gpu. Why?
I0303 04:52:48.248244 21608 caffe.cpp:334] Not using GPU #3 for single-GPU function
I0303 04:52:48.248602 21608 caffe.cpp:334] Not using GPU #2 for single-GPU function
I0303 04:52:48.248613 21608 caffe.cpp:334] Not using GPU #1 for single-GPU function
I0303 04:52:48.359235 21608 caffe.cpp:341] Use GPU with device ID 0
I0303 04:52:48.359807 21608 caffe.cpp:345] GPU 0: Tesla P100-SXM2-16GB
from nccl.
@getengqing, I'm not that familiar with Caffe. You probably want to direct this question to the caffe project.
from nccl.
@getengqing Please, see int time() function in tools/caffe.cpp. Multi-GPU performance measurement is not supported.
from nccl.
@sergey-serebryakov OK!Thanks!
from nccl.
Related Issues (20)
- question about a new single-node communication mode
- what does non-blocking communicator forοΌ HOT 4
- deadlock when using multiple communicators for Point-To-Point Communication within the same GPU Group
- Network IP setup and physical wiring
- Enabling read for P2p transport HOT 1
- How to tell nccl that those network communication is disabled? HOT 2
- Is it possible to swap the calling order of `initTransportsRank` and `ncclTunerPluginLoad` HOT 1
- NCCL Logs Communicator Query HOT 1
- work request complete err: status 5 and vendor err 249 HOT 7
- Is there someway to measure gpu i/o usage or allreduce waiting time? HOT 1
- About sync in nvls algorithm
- NCCL Tree allreduce test cannot reach the theoretical bus bandwidth on 2 nodes with 4 nics HOT 6
- how does NCCL support peer-to-peer connections across NUMA nodes without the features of NICs and NVLinks? HOT 2
- How can I test IB bandwidth when NCCL is running?
- Single or double ring HOT 1
- Missing header file HOT 3
- Why does NVLSTree Allreduce perform worse than Ring Allreduce?
- Encountering Random Segmentation Fault During NCCL-Tests HOT 7
- Ring broadcast
- inter-node nvls process when ib sharp not supported
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
π Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. πππ
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google β€οΈ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from nccl.