Comments (6)
Thanks for reporting, this issue has been noticed by some others as well. @cgfarmer4 any thoughts? We definitely need a way to fix this because it's a pretty common use case.
from whisperkit.
Decent idea if you want to flip the microphone while you're recording and a good improvement regardless. However, the App doesn't actually crash, it just causes AVAudioEngine to interpret the input in a way that is not compatible and throws warnings. Ive found it doesn't work right even if you start the app with Airpods sometimes.
from whisperkit.
Hey @ZachNagengast -- thanks for the tag here. I can reproduce. The issue doesn't seem isolated to Airpods, but for all bluetooth headsets. Or at least the 2 others I tried besides my Airpods.
From poking around the web, I see a lot of issues pointing to AVAudioEngine + Bluetooth problems but nothing sticks out as a way forward unfortunately.
from whisperkit.
One idea is that if there is some detection of a bluetooth device (not sure how to query this), we could omit the assign input function. 2 other options are:
- Remove the feature.
- Switch to using AVCaptureSession instead of AVAudioEngine.
from whisperkit.
@cgfarmer4 I have yet to try with a wired device yet but this might be related avaudioengine-crashes-when-plug-headphones-in-or-out
The thread recommends subscribing to the event AVAudioEngineConfigurationChangeNotification and updating the audio nodes. I think this should also be triggered for the bluetooth device getting connected. Thoughts ?
from whisperkit.
Ah I assumed the issue popped up when the app was already recording. I understand what you mean now. That's weird behaviour 🤔
from whisperkit.
Related Issues (20)
- Speculative decoding support with Eager streaming mode
- Disallow invalid `--language` values HOT 1
- Use `config.json` for device support filtering
- Incorrect timestamps (0.5sec off) HOT 7
- When transcribing non english audio files, I get results always translated in english :( Even though it's correct but not in the original language. HOT 1
- Clarify the translation capabilities in sample App
- Major difference with whisper.cpp? HOT 4
- How to use custom prompts? Couldn't find the usage from the examples. HOT 2
- English text normalization utilization for Eager Streaming Mode HOT 1
- @atiorh Today, I tested the latest version of the WhisperAX app consistently crashes when loading the 'openai_whisper-large-v3_947MB' file. HOT 1
- Implement test data-driven `unsupportedModelDeviceCombination` at init HOT 2
- Standard output while processing. HOT 4
- Can a local model be used without requesting the Hugging Face API? HOT 3
- How do I use a parameter like initial_prompt in Python's Whisper? HOT 1
- Problems with "base" model HOT 4
- Audio input captures only the first channel HOT 1
- Instructions for running the cli version? HOT 3
- Medium models missing? HOT 4
- how to fix "Ambiguous use of 'transcribe(audioPath:decodeOptions:callback:)'" HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from whisperkit.