Comments (3)
I'm using ROCm 6.1 RC, on Linux Ubuntu 22.04 kernel 6.8.1, GPU Radeon Pro VII and Radeon VII.
from clr.
I think I found a clue: in clinfo for all my GPUs (Radeon VII and Radeon Pro VII) I see:
Queue on Host properties:
Out-of-Order: No
Profiling : Yes
Queue on Device properties:
Out-of-Order: Yes
Profiling : Yes
So it seems that the Host Command Queue does not implement out-of-order. OK.
Why is that -- is it a limitation of the hardware (particular GPU models), of the ROCm version, not implemented yet in software, something else?
Thanks anyway.
My observation was correct (i.e. the Out-of-order queue is not actually running out-of-order), but the reason was not "a bug" but rather "by design".
from clr.
One more observation: it's obvious that the HW is capable of running multiple compute kernels in parallel, as it does so when the kernels are queued from multiple processes. So it seems that the non-existent "out-of-order" can't be a limitation of the hardware. It's probably more like "not implemented". Are there plans in that direction?
Is it implemented in HIP?
from clr.
Related Issues (20)
- [Issue]: gfx900 *ERROR* ring page0 timeout HOT 1
- [QA] error: use of undeclared identifier '__asm__' HOT 1
- [Feature]: Run kernel with cooperative thread blocks and performance measurement ability HOT 1
- [Issue]: clr-rocm-6.0.2/rocclr/os/os_posix.cpp:321: static void amd::Os::currentStackInfo(unsigned char**, size_t*): Assertion `Os::currentStackPtr() >= *base - *size && Os::currentStackPtr() < *base && "just checking"' failed. HOT 3
- Build fails with strict-aliasing violations HOT 1
- dispatchGenericAqlPacket,If the size parameter in the function is not equal to 1, there may be problems. HOT 3
- Update AMD_PLATFORM_BUILD_NUMBER
- [Issue]: `roc::NullDevice::importExtSemaphore` (`hipImportExternalSemaphore`) crash
- [Issue]: ROCM5.7.3, RCCL2.19.4 GPU kernel can't printf。Hash value collision detected HOT 3
- [Issue]: standard math operations on vector types fail to build with GNU g++ HOT 2
- [Issue]: Simplest offline compiled saxpy kernel fails to load on gfx90c HOT 3
- [Issue]: No performance improvement using hipGraph HOT 1
- [Issue]: hipMemcpyWithStream causes severe stall in Hugginface Transformers LLM generation with Pytorch HOT 1
- [Issue]: amd_math_functions.h is missing many math functions like `max`? HOT 2
- [Issue]: Failing Unit Tests HOT 2
- [Issue]: build fails to non-existing llvm path HOT 15
- [Issue]: no definition of struct tiled_partition_internal HOT 1
- [Documentation]: hipamd/src/hip_prof_gen.py no explanation of its use.
- [Issue]: question about source code: https://github.com/ROCm/clr/blob/amd-staging/rocclr/platform/memory.cpp#L526
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from clr.