Comments (2)
Hi, The Griffin-Lim is an iterative algorithm that creates artificial phase spectrum derived from the magnitude spectrum. The MagPhase vocoder encodes the magnitude and phase spectra of speech. So, during reconstruction, it uses the phase spectrum extracted from natural speech (or predicted, in case of TTS), which results in a more natural sound quality.
For your second question: As you mentioned, MagPhase encodes the magnitude and phase, so ideally you could use just the simple IFFT for reconstruction. Actually, it does it for lossless decoding (see demo_copy_synthesis_lossless.py). However, for acoustic modelling, the parameters are smoothed by the model (e.g., DNN) not capturing aperiodicities ("randomness") in speech. So, you need to recreate the "randomness" in certain parts of the signal, and for that, MagPhase uses withe noise, which is filtered and mixed with components predicted by the acoustic model.
from magphase.
Thank you for your reply!
from magphase.
Related Issues (13)
- Adding magphase to Merlin configuration.py, output dims? HOT 7
- MagPhase vocoder v2.0 released (April 2018) HOT 8
- the number of frames in label and acoustic features are different: 1285 vs 1276 (Casting_Correct_0080) HOT 7
- Constant-rate features vs variable-rate labels HOT 1
- 16k sample rate audio error HOT 1
- Magphase vocoder HOT 2
- run 0_batch_feature_extraction_for_merlin.py error HOT 3
- RuntimeWarning: invalid value encountered in divide HOT 1
- read_reaper_est_file: IndexError: too many indices for array HOT 4
- run copy_syn error HOT 7
- Is variable frame rate essential for quality? HOT 3
- demo_run_for_merlin has lower result voice quality than demo_copy_synthesis_low_dim HOT 6
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from magphase.