Comments (6)
This sounds more like the behaviour of the on_realtime_transcription_update callback. Definitely should not occur with the default parameter set. My first guess would be you are maybe using the same callbacks for both the on_transcription_finished callback from the text method and the on_realtime_transcription_update callback from the AudioToTextRecorder constructor.
from realtimestt.
Some updates on this. Former faster-whisper version prob caused this (got somehow corrupted on pypi), I think it was 0.6.0. Neuer versions are fine.
from realtimestt.
Thanks! Should I use the 0.6.0 version of the faster-whisper instead of the latetest [v1.0.1]?(https://github.com/SYSTRAN/faster-whisper/releases/tag/v1.0.1)
Or just update the latest faster-whisper / RealtimeSTT version?
from realtimestt.
You can upgrade RealtimeSTT to newest version which uses latest faster-whisper 1.0.1 (this version is also referenced in the requirements file of RealtimeSTT) .
from realtimestt.
great! Another question is the latest v0.1.15 of RealtimeSTT has the parameter beam_size, it can be use to reduce the delay?
from realtimestt.
You trade-off accuracy vs speed: A larger beam_size yields better quality output because the model can explore more options and potentially avoid local minima in the search space. But also means slower performance because more sequences are evaluated at each step.
from realtimestt.
Related Issues (20)
- How to choose the CUDA version? HOT 2
- the on_realtime_transcription_update text issue HOT 3
- CUDA initialization error on current master HOT 1
- Porcupine integration on Mac HOT 1
- pyaudio Invalid number of channels HOT 1
- Float 16 to Float 32 quantization HOT 19
- recorder.text(process_text) does not stop recording HOT 6
- Support GPT-SoVITS TTS
- Passing audio bytes (Frames) to the AudioToTextRecorder HOT 10
- Multiple clients in browser-client code HOT 3
- How to calculate the latency of STT
- Syntax error line 520 audio_recorder.py HOT 3
- An attempt has been made to start a new process before the current process has finished its bootstrapping phase HOT 1
- how about the qulity of the batched faster-whisper? HOT 1
- Extract Phonemes from script.
- Interrupt the process uisng STT HOT 1
- packages for realtimestt cannot be found. HOT 1
- Noise reduction/Sensitivity HOT 9
- [MacOS Sonoma 14.5 - Intel] EOF ERROR in multiprocessing HOT 1
- STT: UnpicklingError: invalid load key, '\x00' HOT 6
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from realtimestt.