Comments (4)
Hi Chengwei, we're looking at this and will get back to you. Note, given the speed of the head turn, a 255 sample (~5 ms) alignment difference will be a very small angle - but we'll get back to you with an answer shortly so that this can be treated consistently. Jon
from clarity.
Thanks for your reply. I had tried to align the hr.wav file with audio files by both ways (neglect 255 points in the beginning or the end of audio files), and neglecting 255 points in the beginning performs better. However, the experiment is not rigorous enough, and I'm looking forward to your further confirmation. Thank you!
from clarity.
Hi Chengwei,
What you are seeing is an effect of the binaural rendering. The head rotation vector is generated at the same times as the ambisonic audio which is used as a mid-stage in the processing for the scene rendering (please see the function generate_hoa_signals(self, scene) within scene_renderer_cec2.py.
The ambisonic audio isn't saved to disk but is rendered using generate_binaural_signals().
The HRTFs that we use have a kernel size of 256. The 255 sample difference in length that you're observing is a property of the convolution operation. The length of the output of a convolution is:
len(conv(A,B)) == len(A) + len(B) - 1
In this case, you can just remove/ignore the last 255 samples of the audio.
Due to the nature of the impulse responses used, there is a group delay and interaural delay associated with these filters. However, these will be very small (by definition, no longer than the kernel length) and an intrinsic property of the kind of signal processing used.
If you have any other queries relating to this issue, please get in touch.
Will
from clarity.
Thanks for your patient! I think it solved my problem.
from clarity.
Related Issues (20)
- Cad1-Task1 - Rollback changes in score
- CAD1 - Task1 second baseline
- introducing dataclass to represent listener
- Build multiple versions of documentation HOT 7
- Alternative resampling methods
- write_signal should clip signals outside PCM_16 range
- Conflicts in torch devices in torchloudnorm amd ConvTasNet
- Bring CAD1 recipes improvements to main
- There is a typo in CAD 1 Task 2
- Release to ORDA HOT 4
- Working on Evaluation code CAD1-CPC2
- ICASSP 2024 Cadenza Baseline
- [BUG] smearer tests are failing when using numpy 1.25 or later
- [BUG] ICASSP 2024 is not generating all the data
- Close
- Tutorial notebooks not working with latest version [BUG]
- Forthcoming NumPy2.0 release
- Cleaning MSBG code
- Add precomputations to HAAQI HOT 4
- From HAAQI - HASPI - HASQI
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from clarity.