Comments (2)
Thanks for letting know about this model. This creation of voice characteristics using text prompt is very interesting and unique! Parler-TTS would be a great addition to Speech Note.
The only problem is that this model cannot generate long text. Therefore, you need split the text into sentences and generate each sentence separately. Unfortunately, this causes each sentence to be generated in a different voice because this problem occurs.
I'm waiting for Parler-TTS devs to fix this problem and then I will definitely integrate it into Speech Note.
from dsnote.
Valid points there - thanks for considering the model and finding this out!
I agree that switching voices between any two consecutive sentences is pretty awkward (IIRC, some included models - maybe WhisperSpeech? - already exhibit this to some extent) - particularly for my main use case (helping to quickly but still accurately skim scientific papers).
Feel free to close the issue unless you prefer to keep it open as a reminder.
from dsnote.
Related Issues (20)
- Punctuation model no longer available HOT 2
- Unable to add Custom TTS model (i.e Coqui TTS) HOT 3
- Guidance about settings for realtime STT on GPU HOT 2
- Flatpak Runtime End-of-Life HOT 5
- AppImage HOT 1
- mimic3 voices fail to download HOT 5
- I hope this app can use llms to chat to do more things HOT 1
- Read only selected text. HOT 4
- Added dictionary support HOT 1
- Error: “translation engine initialization has failed”. HOT 3
- Speech Note instantly crashes when opened on KDE Plasma. HOT 3
- Crashes when clicking listen with any whisper model HOT 23
- Start listening, text to active window not working HOT 4
- App stuck in tray icon HOT 4
- runtime org.kde.Platform branch 5.15-22.08 is end-of-life HOT 2
- The app is crashing when GPU acceleration is enabled using any Whisper model HOT 11
- flatpak v4.5.0 won't start showing `std::runtime error pa failed` HOT 12
- Flatpak Add-ons are missing HOT 4
- Add a good voice? HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from dsnote.