Comments (8)
@r4nc0r Keep watch for the release of Mimic 3 (samples), which should be this month. You should get a 8-10x speedup with it; I typically get an RTF of 0.03, but I'm also on a Ryzen 5950X.
from thorsten-voice.
@domcross and i are working on new/better models using HifiGAN vocoder. Samples available on Thorsten-Voice project website. These models might be faster than the current one available. But maybe you should check work by @synesthesiam with larynx. My voice is available there too and it's really fast.
Did you test with "WaveGrad" or "Fullband-MelGAN" vocoder (Fullband-MelGAN is way faster).
from thorsten-voice.
Thanks for your quick reply and for pointing me in the right direction!
I just used your model with the parameters specified in the readme: tts-server --model_name tts_models/de/thorsten/tacotron2-DCA
from thorsten-voice.
I tried following pip3 instal tts==0.5.0
and run tts-server --model_name tts_models/de/thorsten/tacotron2-DCA
. Got an RTF around 0,6 - 1 on my notebook cpu which i think isn't too bad. What RTF do you have?
Just if you're interested in:
https://www.thorsten-voice.de/2022/03/20/vergleich-thorsten-aktuell-mit-dem-neuen-modell/
from thorsten-voice.
I just did that with the addition of --show_details SHOW_DETAILS
and my RTF is about 0,6:
> Processing time: 3.101564407348633
> Real-time factor: 0.5756691513639508
I use a 12 Core Ryzen 3000 Processor.
But the Processing time of 3s is extremly high given my use case of generating just in time responses for my voice Assistant.
I build a workaround wich caches most wav files, but if I generate Responses with variable in the text this doenst work.
Also i would love to use your new model, is there a way to use it?
from thorsten-voice.
The new model is not released yet. I'll keep community updated on release date on Twitter or my Youtube channel.
I'd recommend you taking a look larynx as it's designed for small compute power (like a raspberry) and my german voice is available too.
from thorsten-voice.
Also i would love to use your new model, is there a way to use it?
Hi @r4nc0r ,
you can download model and config on @coqui-ai prerelease 0.7.0 here: https://github.com/coqui-ai/TTS/releases
Easy pip
based installation will follow when final 0.7.0 will be released.
Keep watch for the release of Mimic 3
You can play around with beta of Mimic 3 with my german voice (and some more german voices) as mentioned by @synesthesiam: https://mycroft.ai/blog/mimic-3-preview/
from thorsten-voice.
As Mimic 3 is already released you can easily use this. You can watch this video on how to set it up and use it and/or check official doc.
If you want to use Coqui TTS (little bit slower, but better quality) you can do this by:
pip install tts==0.7.1
tts-server --model_name tts_models/de/thorsten/vits
I close this issue for now, but feel free top reopen if you have further questions.
from thorsten-voice.
Related Issues (20)
- Finetuning Tacotron2 on your pretrained model HOT 6
- Recommendation for Training/ HOT 2
- Question with Phonemes HOT 4
- Documenting the process of building an open voice model out of audio files HOT 2
- ValueError: Phonemizer is not defined in the TTS config. HOT 2
- Porting the German voice into RHVoice HOT 7
- Eigene TTS erstellen HOT 3
- 44khz 16 bit available? HOT 2
- Source of Text Prompts HOT 1
- Help for vocoder training for Coqui HOT 6
- Multispeaker-Finetuning on Single-Speaker-VITS-Model HOT 2
- NumPy (Torch) issues HOT 2
- training duration / female voice? HOT 2
- Request for an oobabooga extension HOT 2
- TTS-Models: Download-Links broken? HOT 2
- Made with Thorsten-Voice 😊 HOT 2
- Windows: tts_to_file ignoring German Umlauts HOT 5
- Request - "synthesize_csv.py" from YouTube "Coqui TTS Audio samples of all models (Version 0.7.1)" HOT 4
- Voz Português Brazil HOT 1
- Emphasis on syllables – How to choose? HOT 6
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from thorsten-voice.