Comments (4)
#1340 helps when the data types are scalar. But it does not help when the data types are vectors.
Passing -O0
to clspv does not stop it from adding the two early InstCombinePass
too :(
from clspv.
Yes -O0
is not really used in clspv
(#1228 (comment)).
Maybe we should consider adding something in https://github.com/google/clspv/blob/main/lib/UndoInstCombinePass.cpp?
from clspv.
@olvaffe interesting.. can share perf numbers in clpeak scalar tests with and without your patch? to get an idea of expexted perf speedups..
from clspv.
The perf numbers doubled for short
and char
. But I also needed to teach mesa to replace (x + 1) * y
by x * y + y
, for that to be identified as MAD.
It sounds like UndoInstCombinePass.cpp might be a better place to undo the combining. I can certainly look into that.
from clspv.
Related Issues (20)
- Loads of i32s are fragmented into 4 bytes HOT 3
- how can I cross compile clspv in x86 for risc-v HOT 1
- Loads and Stores of i32s from offset addresses are fragmented HOT 1
- Loads and Stores from offset addresses are fragmented HOT 5
- Implement compatibility for external LLVM-IL HOT 15
- Clspv Fragments access to global memory by the Smallest access size
- Improve `-cl-mad-enable` support
- Segfault with default initialization
- Issue with C++ references
- Error in LinkBuiltinLibrary HOT 5
- -mfmt=c is not a valid option HOT 3
- Addrspace cast lowering pass creates invalid IR for intrinsic calls
- Bitcast lost due to llvm's opaque pointer HOT 4
- Crash with structures containing generic pointers
- LowerAddrSpaceCast cannot handle multiple llvm.memcpy
- Always generate MemoryAccessAlignedMask for loads/stores
- physical_ptrtoint tests failing
- Invalid OpPtrAccessChain into a Function Storage pointer
- Handle PHINode in LowerAddrSpaceCast better
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from clspv.