Comments (4)
Hello,
You were correct, this was indeed a bug that occurred when buffers were provided at launch time. I have fixed it in the last version and added an option to provide offsets at launch time.
from vkfft.
Thanks for addressing this so quickly! (I should've thought to point out that I was indeed setting buffers only at launch time.)
I'm now testing all combinations of zero and nonzero offsets for transforms, and I think I've found one bug. I observed that transforms that pass input and main buffers are incorrect when the input offset is zero but the main buffer offset is nonzero. I think these two lines of the diff are the culprit; commenting them out fixed this issue. Was there a separate reason for those offsets to be changed as such?
from vkfft.
This was more of a failsafe, that backfired when inputOffset is actually zero and bufferOffset is not zero. It is not really needed, as offsets are configured in the same layout as buffers, so should always use respective values.
from vkfft.
Great, then to the extent of my testing this issue can be closed. Thank you again!
from vkfft.
Related Issues (20)
- 1.3.1 and 1.2.12 performance comparison HOT 2
- WebGPU compute backend HOT 1
- Large errors using Intel OpenCL for CPUs HOT 8
- Allow complex transforms to use non-interleaved memory layouts (separate I and Q buffers) HOT 2
- Segmentation fault with 1.3.3 HOT 2
- Merge DCT and DST into DTT HOT 4
- [1.3.4] #include glslang_c_interface.h -> glslang/Include/glslang_c_interface.h HOT 1
- [1.3.4] Could you detect and use the system glslang via cmake HOT 1
- Support VK_KHR_push_descriptor to allow multiple invocations with different buffers in a single command buffer HOT 3
- initializeVkFFT throws an exception for small-size convolutions HOT 3
- Multiple systems - one kernel - multiple outputs convolution HOT 10
- in order to use VkFFT, my machine must have a Vulkan driver installed ? HOT 1
- how to dump the generated source kernel? HOT 6
- CUDA&HIP stream asynchronicity HOT 2
- Use enum for precision HOT 2
- Error module and new error handling HOT 2
- Allow HIP to be used with NVIDIA GPUs HOT 1
- [Metal] red herring
- Suggestion: add CITATION.cff HOT 1
- device function performing FFT/DCT of data in shared memory (feature request / feasability) HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from vkfft.