there are new fast stt models from nvidia they claim to be better than whisper v3: on

new fast models about realtimestt HOT 4 CLOSED

koljab commented on September 16, 2024

new fast models

from realtimestt.

Comments (4)

francqz31 commented on September 16, 2024

i also know that coqui shut down , there is this really new tts model here https://github.com/PolyAI-LDN/pheme that claims to be really fast too , if both of this and parakeet got integrated into https://github.com/KoljaB/LocalAIVoiceChat i believe it will be a super boost better performance with faster speed , you can also apply some tricks to them to make them faster !!

from realtimestt.

KoljaB commented on September 16, 2024

The nvidia stt looks very promising. Word error rate better than whisper and if it's even faster it's for sure is a great candidate. Hope it does all languages well and not only english. I think currently it does not scale to low VRAM systems, Whisper offers tiny model...

pheme looks good, but tbh so do a lot of engines currently. For pure speed for example styletts2 is a really great engine. 6-7x faster than XTTS.

from realtimestt.

francqz31 commented on September 16, 2024

ok got it 👍 i just wanted to notify you , there is also a really new MIT licenced model that claims to be better than mistral 7B thus it mostly will be compatible with zypher! , it is only 2.7B so i bet it will be really fast https://huggingface.co/microsoft/phi-2
you might want to integrated into LocalAIVoiceChat for better speed while holding same accuracy!

from realtimestt.

francqz31 commented on September 16, 2024

now i will close the issue

from realtimestt.

Recommend Projects

new fast models about realtimestt HOT 4 CLOSED

Comments (4)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent