Comments (4)
FYI:https://github.com/triton-lang/triton/blob/main/python/triton/ops/matmul.py. SPLIT-K
should be able to solve such problems.
from triton.
SPLIT-K
Great! It seems that the tutorail itself don's support splitk, so we need to make it change to have it tested with 03 tutorial?
And after applying with splitk, how much performance you saw comparing with torch?
Thx~
from triton.
IMHO, I don't think we should change 03tutorial.
You can do benchmarking on your own.
because your M=1, it also applies to the problem domain of GEMV
.
from triton.
FYI:https://github.com/triton-lang/triton/blob/main/python/triton/ops/matmul.py.
SPLIT-K
should be able to solve such problems.
Hi, this file is deleted, why ?
from triton.
Related Issues (20)
- Introduce `tl.assume` or use `assert` expression in non-debug builds to guide optimization?
- run into dead loop when tuning the tma persistent kernel HOT 6
- [BUG] error load fp32 value from 2D tensor HOT 2
- [BUG] device_print - Triton nightly, 3.0 incorrect values (zero) when using pointer arithmetics(constexpr etc.) other than with triton.language.arange HOT 6
- Incompatible type error with "torch.onnx.export()"
- Syncing blocks in SplitK GEMM HOT 1
- [BUG] triton.language.associative_scan returning incorrect results when `reverse=True` HOT 4
- 模型直接转为triton
- It will report an error when directly start the .cubin using cuModuleGetFunction in the. cu HOT 1
- Segfault when using `tl.range(...,num_stages)` HOT 4
- os.environ["TRITON_INTERPRET"] = "1" may makes tl.load error. HOT 2
- tl.interleave crashes Python HOT 4
- 保存中间IR文件 HOT 2
- How can I use high-order function in triton jit kernel?
- How to make compiler execute Instruction reordering properly to avoid register spilling HOT 5
- NameError("name 'jit' is not defined") HOT 5
- Wrong reduction result in Triton 3.0.0
- Segmentation fault in triton==3.0.0
- RuntimeError: Triton Error [CUDA]: device kernel image is invalid HOT 1
- With billion dollars you still don't support Windows and thus we can't use CogVLM V2 on Windows! HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from triton.