Comments (1)
Hi @rajagond, thanks for the detailed report. I've confirmed the problem with -B 8
:
input_tensor_2d: (16384, 24576), device id: <CUDA Device 0>
input_tensor_3d: (8, 2048, 24576), device id: <CUDA Device 0>
weights: (24576, 12288), device id: <CUDA Device 0>
output_2d: (16384, 12288), device id: <CUDA Device 0>
output_3d: (8, 2048, 12288), device id: <CUDA Device 0>
CuPy MatMul Time (2d): 51.69049 ms
CuPy MatMul Time (3d): 790.34568 ms
CuPy MatMul Time (% diff): 1429.00%
matmul
has a fast path for 2d case (which just uses cupy.dot
), and looks there is a room for improvement for 3d case.
cupy/cupy/_core/_routines_linalg.pyx
Line 809 in 34f8edb
from cupy.
Related Issues (20)
- Jitify build version is wrong in CuPy v13 HOT 1
- cp.dot HOT 3
- Higher Kernel Launch CPU Overhead
- `matmul` VRAM outage that affects ubuntu, but runs okay on ubuntu inside WSL2 HOT 5
- Unexpected shape with cupy.fuse and multiple outputs HOT 1
- bug in distributed/_store.py HOT 1
- `expm` fails to compute matrix exponential of complex matrix
- cuDNN 9 causes build-time error HOT 1
- Pre-populate and ship a Jitify cache? HOT 7
- Support Cuda Stream creation with Priority HOT 3
- [Deleted]
- Typecasting issue HOT 2
- Support cuDNN 9.0 HOT 4
- Jitify cache paths on Windows in Cupy13 HOT 3
- Revisit: Using upstream Thrust complex headers and drop the vendored ones HOT 1
- Failed to import CuPy
- Failed to import CuPy. HOT 2
- How does CuPy work? HOT 1
- Can I launch cupy kernels in C++? HOT 15
- Cannot install CuPy from source on ROCm 6.0.2 HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from cupy.