ide8 / tacotron2 Goto Github PK
View Code? Open in Web Editor NEWMultispeaker & Emotional TTS based on Tacotron 2 and Waveglow
License: BSD 3-Clause "New" or "Revised" License
Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow
License: BSD 3-Clause "New" or "Revised" License
After 750 epochs, we tested the created tacotron model via inference.ipynb and realized that for same input text sequence we get different generated output audio file. Additionally, there is always a large empty space at the beginning of the audio file of approximately 30 seconds. Just to mention, the data was previously preprocessed as explained in the readme file. Sometimes there is just a noise in the audio file, and the other times there is some speaking at the end of the audio file.
Do you have any experience with this issue?
hey it will be good, if you can share your pretrained model with proper alignment, i am training from scratch since 6 days and not getting any alignment
Hi
First of all, thanks for the repository.
I am trying to train another dataset in other language using this repository, and since I do not have any pretrained waveglow model I cannot train a new Tacotron2 model... Is there any way to perform Griffin Lim on the inferred Mel spectrograms? I am having some issues regarding tensor dimensionality and I did not manage to get any audio...
Thanks in advance
Ander
What dataset do you train & test your network? I cannt find any information about it except how to process data.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.