Comments (2)
Best to post directly in the fork, but there is an issue for that already: idiap#65
The issue with the XTTS streaming code is that it relies a lot on internals of the transformers
library that can change a lot between versions. Is there any specific reason that you need both Parler and Coqui in the same environment?
from tts.
I'm building an app that uses a lot of different models, and I don't want to be restricted in what I can use, so I'm trying to stay on top of it. The original repo here (which I accidentally opened an issue on I guess) only goes up to 4.40, so at least your fork gets up to 4.42.4. I went to use Llama 3.1 recently in 4.42.4 and it had issues with configuration data not matching how 4.43 can understand (for rope scaling). I like in parler being able to describe the speaker, which I can't do in coqui, but parler isn't realtime on my machine and coqui is. I'm using streaming results for most of the things I'm doing so I don't really want to deal with the hassle of any overhead separating into more than one project. So I don't want to have two environments to maintain coqui being stuck in 4.42. So basically some things I want to use parler for what I think seems more natural but not real time, or coqui if real time is important given the use case. (I have use cases for both things). Sorry my train of thought is all over the place. I think I was just sort of hoping since I've seen some people apparently get past their issues where they have transformers 4.43 in their post, that maybe I'm missing a branch or patch that would let me install 4.43 and move on. Thank you for your work on this!
from tts.
Related Issues (20)
- [Feature request] faster load at startup
- [Feature request] Add progress bar for large text
- [Install Erroer]安装TTS库时反复出错 error: Microsoft Visual C++ 14.0 or greater is required. Get it with "Microsoft C++ Build Tools": https://visualstudio.microsoft.com/visual-cpp-build-tools/ HOT 3
- [Bug] Can not install the library inside Docker container HOT 2
- [Feature request] Speed up voice cloning from the same speaker HOT 5
- [Feature request] Upgrade to Python3.12
- [Feature request] How should I use the server API in the source code? HOT 4
- How can I customize a speaker on server? HOT 1
- [Bug] occur a reference error when using it with flask_socketio HOT 1
- [Bug] Illegal Instruction (core dump) when running on Ubuntu 22.04 LTS on Raspberry Pi 4 hardware HOT 1
- [Bug] Lots of issues when installing, tried to fix it, says "No module named 'coqpit'" HOT 1
- [Bug] XTTS v2 - short utterances finetune doesn't work HOT 4
- [Bug] Assertion srcIndex < srcSelectDimSize HOT 1
- [Feature request] Adjust output audio speed in YourTTS HOT 5
- [Feature request] what causes the SHODAN effect?
- [Bug] ValueError: [!] Model file not found in the output path HOT 3
- [Feature request] Reducing logging (stdout) information when initializing TTS HOT 2
- [Bug?] TTS of "10. 9. 8. 7. 6. 5. 4. 3. 2. 1. Finished" seems to clog the system HOT 2
- why xtts v2 inference time used RAM double(or more 3x) then GPU or VRAM
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from tts.