Comments (4)
Hi @AtomicVar, thanks for the report. You're right cuBLAS doesn't support it. The reason we disabled CUTLASS is because of the compile times taking extremely long in the unit tests. Even though the functionality and speed were both adequate, the compiles were taking over 40 minutes. We can add a message for this saying it's an unsupported type.
from matx.
@AtomicVar I've submitted MR #540 to resolve this
from matx.
@AtomicVar according to the cublas team we're not meeting the requirements here: https://docs.nvidia.com/cuda/cublas/index.html#cublasltmatmul
Namely A must be transposed.
If that still works for you, we can add support for it when those requirements are met.
from matx.
@cliffburdick Actually I can use float
matmul instead of int
matmul to accomplish the same task. So it won't be a problem if int
is not supported. I just think the unsupported conditions should be documented and we need to throw and print clear error messages.
from matx.
Related Issues (20)
- [FEA] Better error messages when allocating memory
- [QST] MatX is around x15 slower than CuPy for the same task HOT 55
- [DOC] Add detailed descriptions of memory space options
- [BUG] update CUB Cache
- [FEA] add argminmax function
- [FEA] Support UINT random generation with API
- [FEA] Add Normalize function
- [FEA] Add argsort function
- [BUG] ‘::rsqrt’ has not been declared when building MatX HOT 1
- [BUG] Less than operator doesn't seem to work HOT 2
- [BUG] Building docs too aggressive with recursive folder searching
- [QST] How to use the `sort` function? HOT 1
- [QST] How to multiply two random tensors together? HOT 3
- [FEA] add ability to print shape only HOT 1
- [BUG] MATX_EN_CUTENSOR / MATX_ENABLE_CUTENSOR Unified Language
- [BUG] Warning Message for Host call in device function for Einsum
- [BUG] sum function produces incorrect results HOT 2
- [BUG] Transform Ops have incorrect rank and size
- [QST] How to use norm ops with repmat? HOT 6
- [BUG] Random op can't be casted to tensor_t HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from matx.