Comments (1)
@jkrukowski has expressed interest in taking this on, here is a suggested approach:
Starting with CLI:
--audio-path
accepts an array of audio files now--batch-size
param which stands for maximum concurrent inference that can happen in the queue (default is unlimited). Open to suggestions on terminology, becausebatch
isn't exactly the same concept as concurrent async- Accuracy is within 1%, should not need to tradeoff here (whisperkittools can run this analysis)
- This code may help for reference https://github.com/ZachNagengast/similarity-search-kit/blob/ef737c4aad3af904a9925c476dc37c0dddc6ad8c/Sources/SimilaritySearchKit/Core/Index/SimilarityIndex.swift#L231-L244
from whisperkit.
Related Issues (20)
- Speculative decoding support with Eager streaming mode
- Disallow invalid `--language` values HOT 1
- Use `config.json` for device support filtering
- Incorrect timestamps (0.5sec off) HOT 7
- When transcribing non english audio files, I get results always translated in english :( Even though it's correct but not in the original language. HOT 1
- Clarify the translation capabilities in sample App
- Major difference with whisper.cpp? HOT 4
- How to use custom prompts? Couldn't find the usage from the examples. HOT 2
- English text normalization utilization for Eager Streaming Mode HOT 1
- @atiorh Today, I tested the latest version of the WhisperAX app consistently crashes when loading the 'openai_whisper-large-v3_947MB' file. HOT 1
- Implement test data-driven `unsupportedModelDeviceCombination` at init HOT 2
- Standard output while processing. HOT 4
- Can a local model be used without requesting the Hugging Face API? HOT 3
- How do I use a parameter like initial_prompt in Python's Whisper? HOT 1
- When my Mac connects to AirPods, starting recording fails. HOT 6
- Problems with "base" model HOT 4
- Audio input captures only the first channel HOT 1
- Instructions for running the cli version? HOT 3
- Medium models missing? HOT 4
- how to fix "Ambiguous use of 'transcribe(audioPath:decodeOptions:callback:)'" HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from whisperkit.