Comments (3)
Thanks for the question.
In general it is hard. The problem here is that there are too many English voices! I didn't want to decide which one was better and which one was worse, so I included them all. Maybe it wasn't good strategy, maybe I should make some selection but I didn't. Everything what is available in Piper is also available in Speech Note.
Piper LibriTTS and Piper LibriTTS-R are multi speaker models, so one checkpoint file can generate many totally different voices. In Speech Note every "voice" is presented as separated model but under the hood all LibriTTS/LibriTTS voices use the same checkpoint file. The file is downloaded only once, so no worries.
The names "P7910" or "3615" comes from original name of speaker in the training data. My initial idea was to add at least a male/female indication to the name, but I gave up because there are just too many of them. That's why you see these long and meaningless names :/
In this particular example, LibriTTS-R is a restored version of LibriTTS corpus. Voices are similar but LibriTTS-R should sound a bit better.
from dsnote.
My suggestion to fix this issue of many voices using one model is to enable downloading of the model as only one option. And then within the main interface, a person can choose the many different voices that are available. Similar to how the CoQui X-TTS model work
That's a very good idea! 👍🏿
If you choose to do that, then I can go through some of the different voice models and add some metadata to them to indicate whether it is male or female.
That would be super great :) I will let you know when it is ready. Most likely I won't be able to implement this in an upcoming release, but later.
from dsnote.
Piper LibriTTS and Piper LibriTTS-R are multi speaker models, so one checkpoint file can generate many totally different voices. In Speech Note every "voice" is presented as separated model but under the hood all LibriTTS/LibriTTS voices use the same checkpoint file. The file is downloaded only once, so no worries.
My suggestion to fix this issue of many voices using one model is to enable downloading of the model as only one option. And then within the main interface, a person can choose the many different voices that are available. Similar to how the CoQui X-TTS model works, where a person can choose different voice options from the main interface.
If you choose to do that, then I can go through some of the different voice models and add some metadata to them to indicate whether it is male or female.
I can submit this file as a text file here in the GitHub issues. Or you can indicate your preferred format and I can provide that for you. So that adding it to the application will be as easy as adding the file and linking to the file's data.
from dsnote.
Related Issues (20)
- [Request] Open SpeechNote from Application Tray HOT 4
- "Processing" system tray icon, processing status HOT 1
- Stop listening global keyboard shortcut doesn't work HOT 8
- Integrate with libspiel as dbus Speech Provider HOT 1
- README suggestions HOT 2
- Accented characters not displaying correctly HOT 10
- rocBLAS error: Cannot read /app/extensions/amd/rocm/lib/rocblas/library/TensileLibrary.dat: No such file or directory HOT 7
- AUR package build error HOT 1
- Read on command HOT 3
- AUR package build error in ffmpeg HOT 2
- CUDA graphics card is not detected (v4.5.0)
- Enhancement: Hold to listen / Toggle listener in Global Shortcuts HOT 1
- (Request) Able to save generated audio (TTS) as MP3 or WAV HOT 2
- Diacritics restoration in Text-to-text (Arabic) HOT 5
- Other -> use AMD ROCm crashes app HOT 13
- Add more Arabic models HOT 5
- Feature Request Mimic 3 voices HOT 11
- [FR] Improve build system HOT 21
- Sort language models into sublevels. HOT 5
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from dsnote.