Giter Club home page Giter Club logo

Comments (60)

AndrewBarfield avatar AndrewBarfield commented on May 16, 2024 17

I'm learning the code/API and performing experiments. I hope to contribute soon.

I'm also wondering if I can donate (money) to Coqui?

from tts.

lucascassiano avatar lucascassiano commented on May 16, 2024 7

great project! Excited to see this growing!

from tts.

HoKim98 avatar HoKim98 commented on May 16, 2024 5

Hello, thanks for great works! I'm a fan of Coqui TTS.

I'm porting some of the stuffs in the project to the Rust for the following reasons.

  • Predictable Performance
  • Static-typed Metadata & Model Management
  • Multithreaded Server Implementation
  • Just I love Rust

The VC in the YourTTS has been successfully implemented. And for this purpose, an example of saving/loading a pretrained Vits model has been added in the repo. I write it on Milestones PR because I think my work can be helpful to others :)

from tts.

windowshopr avatar windowshopr commented on May 16, 2024 5

Looking forward to the SSML implementation!

from tts.

erogol avatar erogol commented on May 16, 2024 5

This roadmap issue is quite outdated. I'll keep it open to keep the references to some of the issues and models we like to tackle but won't be updating until one day officially becomes 48 hours.

from tts.

csukuangfj avatar csukuangfj commented on May 16, 2024 4

Any plan to a port of coqui-ai engine for android? TTS on android is very robotic (espeak, rhvoice, festival lite).

@paolo-caroni

Please take a look at
#3194

You can use sherpa-onnx to run VITS models from Coqui on Android and also embedded devices, e.g., raspberry pi.

We have pre-built Android APKs for the VITS English models from Coqui.
https://k2-fsa.github.io/sherpa/onnx/tts/apk.html

image

from tts.

kdavis-coqui avatar kdavis-coqui commented on May 16, 2024 3

I'm learning the code/API and performing experiments. I hope to contribute soon.

I'm also wondering if I can donate (money) to Coqui?

Wow! Thanks! Humbling.

We were setting up GitHub sponsors, but the tax implications were onerous.

We're currently exploring Patreon. So stay tuned!

from tts.

a-froghyar avatar a-froghyar commented on May 16, 2024 3

@lucashueda Capacitron VAE: #510

from tts.

erogol avatar erogol commented on May 16, 2024 3

@BillyBobQuebec I don't think I will implement these models anytime soon. But as they stand, contributions are welcome

from tts.

Darth-Carrotpie avatar Darth-Carrotpie commented on May 16, 2024 2

Thumbs up for planning ONNX support. Hope it gets prioritized more!

from tts.

agrinh avatar agrinh commented on May 16, 2024 1

@agrinh Why do you need to keep your own fork exactly? It'd be better to expand the conversation on gitter if you like.

Wow, thanks for the super fast reply. Sure, we can move the discussion to gitter.

from tts.

Sadam1195 avatar Sadam1195 commented on May 16, 2024 1

Please add DC-TTS to the the list of models.

DC-TTS implementation available with MIT Licence code available here
EFFICIENTLY TRAINABLE TEXT-TO-SPEECH SYSTEM BASED ON DEEP CONVOLUTIONAL NETWORKS WITH GUIDED ATTENTION paper
@erogol

from tts.

erogol avatar erogol commented on May 16, 2024 1

@will-rice the plan is to mirror what we have in torch to TF as much as possible. It'd be great if you initiate the work

from tts.

lucashueda avatar lucashueda commented on May 16, 2024 1

@lucashueda Capacitron VAE: #510

Oh nice, hope to see Capacitron integrated soon. So maybe, in the future I'll be able to contribute with some others expressive architectures

from tts.

hemath1001 avatar hemath1001 commented on May 16, 2024 1

Hi there! Thanks for your great work! I'm looking forward to training YourTTS on other languages. Will training and fine-tuning code of YourTTS be published soon? I would be very grateful if you could tell me an approximate time~ Have a nice day :-D

from tts.

erogol avatar erogol commented on May 16, 2024 1

No immediate plans on that

from tts.

erogol avatar erogol commented on May 16, 2024 1

@Darth-Carrotpie run in unity means in the code or integrate it to Unity editor?

Also better to move this to a separate post under the Discussions

from tts.

Darth-Carrotpie avatar Darth-Carrotpie commented on May 16, 2024 1

@Darth-Carrotpie run in unity means in the code or integrate it to Unity editor?

Also better to move this to a separate post under the Discussions

Created a topic on ONNX at Discussions: #1479

from tts.

MarkChrisE2091 avatar MarkChrisE2091 commented on May 16, 2024 1

Any new update?

from tts.

csukuangfj avatar csukuangfj commented on May 16, 2024 1

Any plan to a port of coqui-ai engine for android? TTS on android is very robotic (espeak, rhvoice, festival lite).

@paolo-caroni

We have supported it in k2-fsa/sherpa-onnx#508

The following is a YouTube video
https://www.youtube.com/watch?v=33QYuVzDORA

You can use all coqui-ai/TTS models and piper models listed in
https://github.com/k2-fsa/sherpa-onnx/releases/tag/tts-models
with k2-fsa/sherpa-onnx#508

from tts.

agrinh avatar agrinh commented on May 16, 2024

@erogol Thanks for sharing the plans!

Do you have any thoughts (or need help to) simplifying the dependencies a bit? I'm thinking that if TTS is used as a lib installed over pip it might be nice to remove visualisation dependencies only used in notebooks, removing test/dev dependencies and moving e.g. tensorflow into extras to reduce the footprint. Personally would love to use this as a dependency rather than maintaining my own fork.

from tts.

erogol avatar erogol commented on May 16, 2024

@agrinh Why do you need to keep your own fork exactly? It'd be better to expand the conversation on gitter if you like.

from tts.

will-rice avatar will-rice commented on May 16, 2024

What were you thinking about the "TensorFlow run-time for training models"? Like giving the user the option of using TensorFlow or PyTorch? I wouldn't mind taking a stab at the TensorFlow part.

from tts.

lucashueda avatar lucashueda commented on May 16, 2024

Are you guys planning to develop some expressive TTS architectures? I'm currently studying this topic and planning to implement some of them based on Coqui, part of them just controlling latent space using GST Kwon et al 2020 or RE Sorin et al 2020, and others that actually changes the architecture by adding VAE, normalizing flows and gradient reversal

from tts.

BillyBobQuebec avatar BillyBobQuebec commented on May 16, 2024

@erogol Look forward to new End-to-End models being implemented, specfically Efficient-TTS! if the paper is accurate, it should blow most 2 stage configurations out of the water, considering it seems to have higher MOS than tacotron2+hifigan, while also seeming to have significantly faster speed than glowtts+fastest vocoder! I have not seen a single repo replicating the EFTS-Wav architecture described in the paper released 10 months ago, it would be amazing to see it in Coqui first!

from tts.

WeberJulian avatar WeberJulian commented on May 16, 2024

@BillyBobQuebec but you can try VITS which is close to what you're describing :)

from tts.

BillyBobQuebec avatar BillyBobQuebec commented on May 16, 2024

@BillyBobQuebec but you can try VITS which is close to what you're describing :)

Agreed, I am currently trying VITS actually, I have some issues training with the coqui implementation unfortunately, I've posted the issue about the bug today and hope I can get it resolved.

from tts.

erogol avatar erogol commented on May 16, 2024

@kerryeon great work!! Thanks for sharing!

from tts.

paolo-caroni avatar paolo-caroni commented on May 16, 2024

Any plan to a port of coqui-ai engine for android? TTS on android is very robotic (espeak, rhvoice, festival lite).

from tts.

erogol avatar erogol commented on May 16, 2024

@Darth-Carrotpie what is your use-case of ONNX? (Just want to get some feedback)

from tts.

lexkoro avatar lexkoro commented on May 16, 2024

@Darth-Carrotpie what is your use-case of ONNX? (Just want to get some feedback)

Personally, for me it sounds like a good way to develop Windows nativ TTS applications without needing a Python runtime and/or the big dependencies like pytorch.

I tried exporting the VITS model to onnx before, but didn't succeed.
There are also other obstacles beside executing the model, like phonemization. ^^

Currently I am using pythonnet to embed the required python functions directly in my C# code. For Python I use the embedded version to make the App distributable.

from tts.

Darth-Carrotpie avatar Darth-Carrotpie commented on May 16, 2024

@erogol I am trying to run models in Unity. It's environment is in C#, .NET Standard 2.1. Having a universal format model also means in the long run I can not only run models in OS agnostic manner. Of course things like tokenization and phonemization are additional hurdles, but if there are open source examples it's quite doable. For models needing tokenizers I've been using BlingFire succesfully, so I reckon there's similar phonemizer helpers / libraries for other languages beside python, including C#.
Edit:
things that embed python into C#, like pythonnet are convenient, though quite slow. In my case, where I have multiple models loaded and running at the same time (i.e. ~10) means that needless interpreter overhead can become a critical bottleneck. Plus it might add unforeseen debugging issues.

from tts.

desh-woes avatar desh-woes commented on May 16, 2024

Is there a flutter package for using this TTS library? Might be an easy way to get this for use in real-world applications.

I am also very new to development but will like to contribute to this project. Can I work under someone?

from tts.

erogol avatar erogol commented on May 16, 2024

@desh-woes there is no flutter package, unfortunately.

Can you DM me on Gitter or Element (out chat rooms) if you're willing to work on a particular thing?

from tts.

omkarade avatar omkarade commented on May 16, 2024

how train model using word embedding as input

from tts.

erogol avatar erogol commented on May 16, 2024

@omkarade no support for that yet.

from tts.

omkarade avatar omkarade commented on May 16, 2024

I want to train a custom Your TTS model on my data set. Can you please share me detailed process.

from tts.

WeberJulian avatar WeberJulian commented on May 16, 2024

I want to train a custom Your TTS model on my data set. Can you please share me detailed process.

You can read the relevant documentation here: https://tts.readthedocs.io/en/latest/finetuning.html
Also this is the roadmap thread, please ask for support here or open a new discussion/issue

from tts.

Kthulu120 avatar Kthulu120 commented on May 16, 2024

@erogol is the NaturalSpeech paper something you'd think about implementing I could take a crack at it.

from tts.

erogol avatar erogol commented on May 16, 2024

@Kthulu120 sure thing. Feel free to shoot a PR. We are always here to help.

from tts.

EricTulowetzke avatar EricTulowetzke commented on May 16, 2024

Will there be a C API to this library like your STT library?

from tts.

erogol avatar erogol commented on May 16, 2024

Not in the roadmap currently

from tts.

erogol avatar erogol commented on May 16, 2024

@JediMaster25 you think the Roadmap is the right place for this convo?

from tts.

stale avatar stale commented on May 16, 2024

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. You might also look our discussion channels.

from tts.

NicoleKai avatar NicoleKai commented on May 16, 2024

It's good to see progress on a propper tts project. I'm running arch and no cuda and I am gonna see if I can convince it to use my cpu instead!
What would be really cool would be if this could work on AVX512 in amd chipsets.

from tts.

nfaraji2002 avatar nfaraji2002 commented on May 16, 2024

Hi
thanks for delightful codes!
I want to use this version of TTS on raspberry pi 4, but I think this version does not support real time processing.
Are there TF utilities provided as in Mozilla TTS to convert trained models to tf-lite?
Can the strategy of quantization work here for real-time processing?
I need some roadmaps in this regard.

Thanks
Neda

from tts.

jhj0517 avatar jhj0517 commented on May 16, 2024

Thank you for your great work for TTS.

Is there any progress on Let the user pass a custom text cleaner function. ?
If it's possible, I want to pass my own Korean cleaners.

from tts.

erogol avatar erogol commented on May 16, 2024

You can currently do it by creating your own tokenizer or overloading the class.

from tts.

stale avatar stale commented on May 16, 2024

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. You might also look our discussion channels.

from tts.

MaxIakovliev avatar MaxIakovliev commented on May 16, 2024

Marvelous project.
Any ways to donate to core contributors?
I would prefer to use paypal.

from tts.

stale avatar stale commented on May 16, 2024

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. You might also look our discussion channels.

from tts.

erogol avatar erogol commented on May 16, 2024

@MaxIakovliev you can use https://coqui.ai/ :)

from tts.

jmlcoliveira avatar jmlcoliveira commented on May 16, 2024

Any update regarding SSML implementation?

from tts.

erogol avatar erogol commented on May 16, 2024

We are not working on SSML currently, it is back in the list without a precise timeline.

from tts.

offside609 avatar offside609 commented on May 16, 2024

Please do!!

from tts.

stale avatar stale commented on May 16, 2024

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. You might also look our discussion channels.

from tts.

violet17 avatar violet17 commented on May 16, 2024

Will you support bark-small? Thanks.

from tts.

DmitryVN avatar DmitryVN commented on May 16, 2024

Fix it plz #3039 #3282
The problem persists and because of this, normal correct use is not possible. Also at the moment it kind of breaks off the phrase at the end of each sentence and it turns out a jerky reading.

from tts.

imevro avatar imevro commented on May 16, 2024

hi guys, why?

upd: found https://twitter.com/_josh_meyer_/status/1742522906041635166

Screenshot 2024-04-29 at 19 19 25

from tts.

NicoleKai avatar NicoleKai commented on May 16, 2024

Their ability to exist and be profitable was dependent on how much better their tech was compared to everyone else. It may not feel like it, but we are in the middle of an AI singularity. Coqui's business model might have stood a chance if they started with this tech 5 years earlier, but it was probably too little too late. Eleven labs is probably eating their lunch :/

from tts.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.