Comments (7)
the link you send has "Verbose Json" section where the word timestamps are included.
Yes I have tried that, it does not work. Just try running that API via openai module and you won't get "words" key anymore, you will get "tokens" key instead which is basically the word embeddings of the words used.
To get words we need
response_format="verbose_json",
timestamp_granularities=["word"]
from whisper_streaming.
the link you send has "Verbose Json" section where the word timestamps are included.
Yes I have tried that, it does not work. Just try running that API via openai module and you won't get "words" key anymore, you will get "tokens" key instead which is basically the word embeddings of the words used.
To get words we need response_format="verbose_json", timestamp_granularities=["word"]
timestamp_granularities
is only available for transcription API, what about translation API?
I am using whisper_streaming
for translation task.
from whisper_streaming.
the link you send has "Verbose Json" section where the word timestamps are included.
from whisper_streaming.
no plan ahead. PR is welcome.
Thanks!
from whisper_streaming.
the link you send has "Verbose Json" section where the word timestamps are included.
Yes I have tried that, it does not work.
Just try running that API via openai module and you won't get "words" key anymore, you will get "tokens" key instead which is basically the word embeddings of the words used.
from whisper_streaming.
I also just encountered the timestamp_granularities
issue with translation. I am also not clear how to set target language for translation. Is it always translating back to English as the target language?
from whisper_streaming.
from whisper_streaming.
Related Issues (20)
- Server and Client for Web App HOT 1
- How to start the command correctly:whisper_online_server.py HOT 1
- Occasional Increasing Delay and Hallucination Issues HOT 5
- How to stream mic input on Windows? HOT 1
- can anyone tell me how to exactly run this project on windows from srarting as i dont know k=nothing how to run it HOT 1
- New Fork: Web client + WebSocket + own VAD impl. HOT 7
- Why does "whisper_online_server.py" close after I disconnect the client? HOT 3
- Getting this error on windows , any idea why ? HOT 1
- Cannot use finetuned model from huggingface HOT 4
- Unable to get sentences, only segments HOT 5
- t HOT 1
- Help to to run the program to transcirbe real time audio from mic HOT 1
- [BUG] Unnecessary socket re-creation inside with statement in whisper_online_server.py HOT 3
- Use of another backend HOT 2
- OpenAi Api not adding punctuation HOT 11
- Could this impletemented with micphone as voice input? HOT 1
- unexpected slow speed HOT 3
- [Quesion] about embedding whisper on deivce? HOT 1
- bilgi/ instructions notice learning HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from whisper_streaming.