Comments (4)
We do have different log levels, sounds like you're interested in logLevel: .info
rather than debug? For the CLI this is hardcoded at the moment, so we can add this as a new CLI argument. Anything specific you'd especially like to see in the info logs?
from whisperkit.
@quist00 Adding to Zach's point, if you are interested in a streaming application (as opposed to offlline processing of a file) and want to test/emulate the streaming performance on a file, you can use --stream-simulated
in the CLI.
from whisperkit.
It would be great if that could be added as a flag to the CLI. Streaming applications is not something we are really looking at currently. I work at a library and we want to use whisper internally to drastically reduce the time and expenditure to transcribe / translate items for oral history projects. I and many of my colleagues have Apple Silicon, so I really appreciate you all working on options for us that work more efficiently. I want to share it with other researchers around campus who also may have dozens or hundreds of hours of audio to contend with, so command line will really be the best options for most of them rather than a programmatic API approach given they are not programmers in most cases nor do they have any on staff.
As far as the output, I think the time stamps along with chunks of text as it goes is best. That way, novice users can get rough estimates of if I use this model with whisperkit, then I can estimate that I will get x minutes of output for a minute of processing. They can then grade the output and determine what is the right tradeoff of model verse processing time.
Thanks for you consideration.
from whisperkit.
@quist00 Could you perhaps give an example of the input/output pairs you're looking for? That way we can build toward a CLI flag that would result in an acceptable output for you.
from whisperkit.
Related Issues (20)
- how to fix "Ambiguous use of 'transcribe(audioPath:decodeOptions:callback:)'" HOT 3
- Experiencing crash on iPad8,8. HOT 2
- Add version support
- Is it possible to run turbo model on M1? HOT 1
- VAD: Finishes too early (almost empty transcript) with VAD enabled, completes successfully without. HOT 1
- VAD: Progress reporting doesn't report evenly when VAD is active
- VAD: First time loading a file it works, second and third time loading the same files it just blanks out HOT 1
- VAD issue with English-only models HOT 5
- Publish in CocoaPods HOT 2
- Is it possible to add a TranscriptionSegment callback? HOT 4
- Incorrect word timestamp when using VAD HOT 8
- Prompt string being returned as transcription result
- Segment order regression since 10mb chunking HOT 3
- Some advice for detecting song lyrics HOT 4
- Usage of `--prompt` drastically affects results HOT 4
- `detectLanguage` isn't working HOT 2
- Unexpected reduced timestamp tokens frequency in first 30s window
- Error calling whisperKit.prewarmModels() in iOS app HOT 3
- Only translating last 30s or so of the audio file. HOT 5
- WhisperAX demo doesn't copy with streaming HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from whisperkit.