Giter Club home page Giter Club logo

Comments (5)

mkiol avatar mkiol commented on May 9, 2024 1

Thank you for sharing the log.

I've managed to track the source of the problem. When you open a file from Google drive, the Portal (software responsible for showing "Open File" dialog window) passes empty file path to the app. This unexpected "empty file path" is handled terribly and that's why the application stops in an abnormal state.

Unfortunately I can't fix the main problem but I can fix this "abnormal state" issue.

from dsnote.

mkiol avatar mkiol commented on May 9, 2024 1

ab81df3

from dsnote.

mkiol avatar mkiol commented on May 9, 2024

Thanks for the report.

Flatpak app runs in the sandbox and doesn't have access to all files. Speech Note has permissions to unrestricted access to "Desktop", "Downloads", "Music", "Documents" and "Videos".

But.. when you use "File->Transcribe a file", so called "Portal" generates temporary link to the file outside sandbox and app should be able to use it. I'm testing this right now and it works well.

Can you please check the "beta" version? I've fixed few things. It can be downloaded from flathub-beta. After installing, please run the app with:

flatpak run --branch=beta net.mkiol.SpeechNote --verbose

and paste here the output log.

As a last resort measure, you can override the permissions and allow access to all files:

flatpak override net.mkiol.SpeechNote --filesystem=host

from dsnote.

chrisshaw avatar chrisshaw commented on May 9, 2024

Here's the output from flatpak run --branch=beta net.mkiol.SpeechNote --verbose, same thing occurs:

Log

[chrisshaw@chris-fedora ~]$ flatpak run --branch=beta net.mkiol.SpeechNote --verbose
QSocketNotifier: Can only be used with threads started with QThread
qt.qpa.qgnomeplatform: Could not find color scheme  ""
[I] 22:28:41.582 0x7f76eb60fd00 init:49 - logging to stderr enabled
[D] 22:28:41.582 0x7f76eb60fd00 () - translation: "en_US"
[W] 22:28:41.582 0x7f76eb60fd00 () - failed to install translation
[D] 22:28:41.582 0x7f76eb60fd00 () - starting standalone app
[D] 22:28:41.583 0x7f76eb60fd00 () - app: net.mkiol dsnote
[D] 22:28:41.583 0x7f76eb60fd00 () - config location: "/home/chrisshaw/.var/app/net.mkiol.SpeechNote/config"
[D] 22:28:41.583 0x7f76eb60fd00 () - data location: "/home/chrisshaw/.var/app/net.mkiol.SpeechNote/data/net.mkiol/dsnote"
[D] 22:28:41.583 0x7f76eb60fd00 () - cache location: "/home/chrisshaw/.var/app/net.mkiol.SpeechNote/cache/net.mkiol/dsnote"
[D] 22:28:41.583 0x7f76eb60fd00 () - settings file: "/home/chrisshaw/.var/app/net.mkiol.SpeechNote/config/net.mkiol/dsnote/settings.conf"
[D] 22:28:41.583 0x7f76eb60fd00 () - platform: "wayland"
[D] 22:28:41.583 0x7f76eb60fd00 () - available styles: ("Default", "Fusion", "Imagine", "Material", "org.kde.breeze", "org.kde.desktop", "Plasma", "Universal")
[D] 22:28:41.583 0x7f76eb60fd00 () - style paths: ("/usr/lib/qml/QtQuick/Controls.2")
[D] 22:28:41.584 0x7f76eb60fd00 () - switching to style: "org.kde.desktop"
[D] 22:28:41.754 0x7f76eb60fd00 () - supported audio input devices:
ALSA lib ../../oss/pcm_oss.c:397:(_snd_pcm_oss_open) Cannot open device /dev/dsp
[D] 22:28:41.771 0x7f76eb60fd00 () - "pulse"
[D] 22:28:41.837 0x7f76eb60fd00 () - "upmix"
[D] 22:28:42.30 0x7f76eb60fd00 () - "default"
ALSA lib ../../../src/pcm/pcm_direct.c:2045:(snd1_pcm_direct_parse_open_conf) The field ipc_gid must be a valid group (create group audio)
ALSA lib ../../../src/pcm/pcm_direct.c:2045:(snd1_pcm_direct_parse_open_conf) The field ipc_gid must be a valid group (create group audio)
[D] 22:28:42.41 0x7f76eb60fd00 () - "alsa_input.usb-046d_HD_Pro_Webcam_C920_2AE889FF-02.analog-stereo"
[D] 22:28:42.41 0x7f76eb60fd00 () - "alsa_output.pci-0000_00_1f.3.analog-stereo.monitor"
[D] 22:28:42.41 0x7f76eb60fd00 () - "alsa_input.pci-0000_00_1f.3.analog-stereo"
[D] 22:28:42.43 0x7f76eb60fd00 add_cuda_devices:227 - scanning for cuda devices
[D] 22:28:42.44 0x7f76eb60fd00 add_cuda_devices:236 - cuda version: driver=0, runtime=0
[W] 22:28:42.44 0x7f76eb60fd00 add_cuda_devices:241 - cudaGetDeviceCount returned: 35
[D] 22:28:42.44 0x7f76eb60fd00 add_hip_devices:264 - scanning for hip devices
[W] 22:28:42.44 0x7f76eb60fd00 hip_api:170 - failed to open hip lib: libamdhip64.so: cannot open shared object file: No such file or directory
[D] 22:28:42.44 0x7f76eb60fd00 add_opencl_devices:301 - scanning for opencl devices
[D] 22:28:42.138 0x7f76eb60fd00 add_opencl_devices:318 - opencl number of platforms: 2
[D] 22:28:42.138 0x7f76eb60fd00 add_opencl_devices:343 - opencl platform: 0, name=Clover, vendor=Mesa
[D] 22:28:42.138 0x7f76eb60fd00 add_opencl_devices:357 - opencl number of devices: 0
[D] 22:28:42.138 0x7f76eb60fd00 add_opencl_devices:343 - opencl platform: 1, name=AMD Accelerated Parallel Processing, vendor=Advanced Micro Devices, Inc.
[D] 22:28:42.138 0x7f76eb60fd00 add_opencl_devices:357 - opencl number of devices: 0
[D] 22:28:42.138 0x7f76d6fe5600 loop:56 - py executor loop started
[D] 22:28:42.170 0x7f76eb60fd00 () - starting service: app-standalone
[D] 22:28:42.185 0x7f76eb60fd00 () - module already unpacked: "rhvoicedata"
[D] 22:28:42.185 0x7f76c49ff600 () - config version: 41 41
[D] 22:28:42.185 0x7f76eb60fd00 () - module already unpacked: "rhvoiceconfig"
[D] 22:28:42.189 0x7f76c49ff600 () - checksum ok: "6571cb18" "en_whisper_base.ggml"
[D] 22:28:42.189 0x7f76c49ff600 () - found model: "en_whisper_base"
[D] 22:28:42.190 0x7f76c49ff600 () - checksum ok: "633a9f5b" "en_whisper_small.ggml"
[D] 22:28:42.190 0x7f76c49ff600 () - found model: "en_whisper_small"
[D] 22:28:42.201 0x7f76eb60fd00 () - module already unpacked: "espeakdata"
[D] 22:28:42.201 0x7f76eb60fd00 () - mbrola dir: "/app/bin"
[D] 22:28:42.201 0x7f76eb60fd00 () - espeak dir: "/app/bin"
[D] 22:28:42.201 0x7f76eb60fd00 () - default stt model not found: "en_whisper_small"
[D] 22:28:42.201 0x7f76eb60fd00 () - default tts model not found: "en"
[D] 22:28:42.202 0x7f76eb60fd00 () - default mnt lang not found: "en"
[D] 22:28:42.202 0x7f76eb60fd00 () - new default mnt lang: "en"
[D] 22:28:42.202 0x7f76eb60fd00 () - service refresh status, new state: busy
[D] 22:28:42.202 0x7f76eb60fd00 () - service state changed: unknown => busy
[D] 22:28:42.202 0x7f76eb60fd00 () - delaying features availability
[D] 22:28:42.208 0x7f76c49ff600 () - found model: "am_espeak_am"
[D] 22:28:42.208 0x7f76c49ff600 () - found model: "bg_espeak_bg"
[D] 22:28:42.208 0x7f76c49ff600 () - found model: "bs_espeak_bs"
[D] 22:28:42.208 0x7f76c49ff600 () - found model: "ca_espeak_ca"
[D] 22:28:42.208 0x7f76c49ff600 () - found model: "cs_espeak_cs"
[D] 22:28:42.208 0x7f76c49ff600 () - found model: "da_espeak_da"
[D] 22:28:42.208 0x7f76c49ff600 () - found model: "de_espeak_de"
[D] 22:28:42.208 0x7f76c49ff600 () - found model: "el_espeak_el"
[D] 22:28:42.208 0x7f76c49ff600 () - found model: "en_espeak_en"
[D] 22:28:42.208 0x7f76c49ff600 () - found model: "eo_espeak_eo"
[D] 22:28:42.208 0x7f76c49ff600 () - found model: "es_espeak_es"
[D] 22:28:42.208 0x7f76c49ff600 () - found model: "et_espeak_et"
[D] 22:28:42.208 0x7f76c49ff600 () - found model: "eu_espeak_eu"
[D] 22:28:42.208 0x7f76c49ff600 () - found model: "is_espeak_is"
[D] 22:28:42.208 0x7f76c49ff600 () - found model: "fa_espeak_fa"
[D] 22:28:42.208 0x7f76c49ff600 () - found model: "fi_espeak_fi"
[D] 22:28:42.208 0x7f76c49ff600 () - found model: "fr_espeak_fr"
[D] 22:28:42.208 0x7f76c49ff600 () - found model: "hi_espeak_hi"
[D] 22:28:42.208 0x7f76c49ff600 () - found model: "hr_espeak_hr"
[D] 22:28:42.208 0x7f76c49ff600 () - found model: "hu_espeak_hu"
[D] 22:28:42.208 0x7f76c49ff600 () - found model: "id_espeak_id"
[D] 22:28:42.208 0x7f76c49ff600 () - found model: "it_espeak_it"
[D] 22:28:42.208 0x7f76c49ff600 () - found model: "ja_espeak_ja"
[D] 22:28:42.208 0x7f76c49ff600 () - found model: "kk_espeak_kk"
[D] 22:28:42.208 0x7f76c49ff600 () - found model: "ko_espeak_ko"
[D] 22:28:42.208 0x7f76c49ff600 () - found model: "lv_espeak_lv"
[D] 22:28:42.208 0x7f76c49ff600 () - found model: "lt_espeak_lt"
[D] 22:28:42.208 0x7f76c49ff600 () - found model: "mk_espeak_mk"
[D] 22:28:42.208 0x7f76c49ff600 () - found model: "ms_espeak_ms"
[D] 22:28:42.208 0x7f76c49ff600 () - found model: "ne_espeak_ne"
[D] 22:28:42.208 0x7f76c49ff600 () - found model: "nl_espeak_nl"
[D] 22:28:42.208 0x7f76c49ff600 () - found model: "no_espeak_no"
[D] 22:28:42.208 0x7f76c49ff600 () - found model: "pt_espeak_pt"
[D] 22:28:42.208 0x7f76c49ff600 () - found model: "pt_espeak_pt_br"
[D] 22:28:42.208 0x7f76c49ff600 () - found model: "ro_espeak_ro"
[D] 22:28:42.208 0x7f76c49ff600 () - found model: "ru_espeak_ru"
[D] 22:28:42.208 0x7f76c49ff600 () - found model: "sk_espeak_sk"
[D] 22:28:42.209 0x7f76c49ff600 () - found model: "sl_espeak_sl"
[D] 22:28:42.209 0x7f76c49ff600 () - found model: "sr_espeak_sr"
[D] 22:28:42.209 0x7f76c49ff600 () - found model: "sv_espeak_sv"
[D] 22:28:42.209 0x7f76c49ff600 () - found model: "sw_espeak_sw"
[D] 22:28:42.209 0x7f76c49ff600 () - found model: "th_espeak_th"
[D] 22:28:42.209 0x7f76c49ff600 () - found model: "tr_espeak_tr"
[D] 22:28:42.209 0x7f76c49ff600 () - found model: "uk_espeak_uk"
[D] 22:28:42.209 0x7f76c49ff600 () - found model: "ka_espeak_ka"
[D] 22:28:42.209 0x7f76c49ff600 () - found model: "ky_espeak_ky"
[D] 22:28:42.209 0x7f76c49ff600 () - found model: "la_espeak_la"
[D] 22:28:42.209 0x7f76c49ff600 () - found model: "tt_espeak_tt"
[D] 22:28:42.209 0x7f76c49ff600 () - found model: "sq_espeak_sq"
[D] 22:28:42.209 0x7f76c49ff600 () - found model: "uz_espeak_uz"
[D] 22:28:42.209 0x7f76c49ff600 () - found model: "vi_espeak_vi"
[D] 22:28:42.209 0x7f76c49ff600 () - found model: "zh_espeak_yue"
[D] 22:28:42.209 0x7f76c49ff600 () - found model: "zh_espeak_hak"
[D] 22:28:42.209 0x7f76c49ff600 () - found model: "zh_espeak_cmn"
[D] 22:28:42.209 0x7f76c49ff600 () - found model: "ga_espeak_ga"
[D] 22:28:42.209 0x7f76c49ff600 () - found model: "mt_espeak_mt"
[D] 22:28:42.209 0x7f76c49ff600 () - found model: "bn_espeak_bn"
[D] 22:28:42.209 0x7f76c49ff600 () - found model: "pl_espeak_pl"
[D] 22:28:42.209 0x7f76c49ff600 () - found model: "af_espeak_af"
[D] 22:28:42.209 0x7f76c49ff600 () - found model: "he_espeak_he"
[D] 22:28:42.209 0x7f76c49ff600 () - found model: "gu_espeak_gu"
[D] 22:28:42.209 0x7f76c49ff600 () - found model: "te_espeak_te"
[D] 22:28:42.209 0x7f76c49ff600 () - found model: "tn_espeak_tn"
[D] 22:28:42.219 0x7f76c49ff600 () - models changed
[D] 22:28:42.643 0x7f76eb60fd00 () - starting app: app-standalone
[D] 22:28:42.643 0x7f76eb60fd00 () - app service state: unknown => busy
[W] 22:28:42.644 0x7f76eb60fd00 () - hot keys are supported only under x11
logger error: invalid format string
qrc:/qml/main.qml:267:5: QML Connections: Implicitly defined onFoo properties in Connections are deprecated. Use this syntax instead: function onFoo(<arguments>) { ... }
logger error: invalid format string
qrc:/qml/main.qml:258:5: QML Connections: Implicitly defined onFoo properties in Connections are deprecated. Use this syntax instead: function onFoo(<arguments>) { ... }
logger error: invalid format string
qrc:/qml/Notepad.qml:24:5: QML Connections: Implicitly defined onFoo properties in Connections are deprecated. Use this syntax instead: function onFoo(<arguments>) { ... }
logger error: invalid format string
qrc:/qml/Translator.qml:29:5: QML Connections: Implicitly defined onFoo properties in Connections are deprecated. Use this syntax instead: function onFoo(<arguments>) { ... }
[D] 22:28:43.185 0x7f76eb60fd00 onCompleted:153 - default font pixel size: 14
[D] 22:28:43.207 0x7f76eb60fd00 () - default tts model not found: "en"
[D] 22:28:43.207 0x7f76eb60fd00 () - default mnt lang not found: "en"
[D] 22:28:43.207 0x7f76eb60fd00 () - new default mnt lang: "en"
[D] 22:28:43.207 0x7f76eb60fd00 () - service refresh status, new state: idle
[D] 22:28:43.207 0x7f76eb60fd00 () - service state changed: busy => idle
[D] 22:28:43.207 0x7f76eb60fd00 () - service refresh status, new state: idle
[W] 22:28:43.257 0x7f76eb60fd00 ():164 - qrc:/qml/Translator.qml:164:9: QML ColumnLayout (parent or ancestor of QQuickLayoutAttached): Binding loop detected for property "preferredWidth"
[D] 22:28:43.428 0x7f76eb60fd00 () - trying features availability update: false
[D] 22:28:43.429 0x7f76eb60fd00 () - app service state: busy => idle
[D] 22:28:43.429 0x7f76eb60fd00 () - app stt available models: 0 => 2
[D] 22:28:43.430 0x7f76eb60fd00 () - update listen
[D] 22:28:43.430 0x7f76eb60fd00 () - app active stt model: "" => "en_whisper_small"
[D] 22:28:43.430 0x7f76eb60fd00 () - update listen
[W] 22:28:43.435 0x7f76eb60fd00 () - no available mnt langs
[W] 22:28:43.435 0x7f76eb60fd00 () - no available mnt out langs
[W] 22:28:43.435 0x7f76eb60fd00 () - no available tts models for in mnt
[W] 22:28:43.435 0x7f76eb60fd00 () - no available tts models for out mnt
[W] 22:28:43.435 0x7f76eb60fd00 () - invalid task, reseting task state
[D] 22:28:43.435 0x7f76eb60fd00 () - app stt configured: false => true
[D] 22:28:43.438 0x7f76eb60fd00 () - app busy: true => false
[D] 22:28:43.438 0x7f76eb60fd00 () - stt models changed
[D] 22:28:43.438 0x7f76eb60fd00 () - update listen
[D] 22:28:43.438 0x7f76eb60fd00 () - tts models changed
[D] 22:28:43.438 0x7f76eb60fd00 () - update listen
[W] 22:28:43.438 0x7f76eb60fd00 () - no available tts models for in mnt
[W] 22:28:43.438 0x7f76eb60fd00 () - no available tts models for out mnt
[D] 22:28:43.438 0x7f76eb60fd00 () - ttt models changed
[D] 22:28:43.443 0x7f76eb60fd00 () - mnt langs changed
[D] 22:28:43.443 0x7f76eb60fd00 () - update listen
[W] 22:28:43.443 0x7f76eb60fd00 () - no available mnt langs
[W] 22:28:43.443 0x7f76eb60fd00 () - no available mnt out langs
[D] 22:28:44.863 0x7f76eb60fd00 () - trying features availability update: false
[D] 22:28:45.863 0x7f76eb60fd00 () - trying features availability update: false
[D] 22:28:46.864 0x7f76eb60fd00 () - trying features availability update: false
[D] 22:28:47.863 0x7f76eb60fd00 () - trying features availability update: false
[D] 22:28:48.863 0x7f76eb60fd00 () - trying features availability update: false
[D] 22:28:49.864 0x7f76eb60fd00 () - trying features availability update: false
[D] 22:28:50.864 0x7f76eb60fd00 () - trying features availability update: false
[D] 22:28:51.864 0x7f76eb60fd00 () - trying features availability update: false
[D] 22:28:52.437 0x7f76d6fe5600 libs_availability:147 - py libs availability: [coqui-tts=true, faster-whisper=true, mimic3-tts=true, transformers=true, unikud=true, gruut_de=true, gruut_es=true, gruut_fr=true, gruut_it=true, gruut_ru=true, mecab=true, torch-cuda=false]
[D] 22:28:52.863 0x7f76eb60fd00 () - trying features availability update: true
[D] 22:28:52.863 0x7f76eb60fd00 () - features availability ready
[W] 22:28:52.864 0x7f76eb60fd00 has_cudnn:433 - failed to open cudnn lib: libcudnn.so: cannot open shared object file: No such file or directory
[W] 22:28:52.895 0x7f76eb60fd00 has_hip:79 - failed to open whisper-hipblas lib: libwhisper-hipblas.so: cannot open shared object file: No such file or directory
[D] 22:28:52.904 0x7f76eb60fd00 () - updating model using availability
[D] 22:28:52.904 0x7f76eb60fd00 () - updating model using availability internal
[D] 22:28:52.906 0x7f76eb60fd00 () - default tts model not found: "en"
[D] 22:28:52.906 0x7f76eb60fd00 () - default mnt lang not found: "en"
[D] 22:28:52.906 0x7f76eb60fd00 () - new default mnt lang: "en"
[D] 22:28:52.906 0x7f76eb60fd00 () - service refresh status, new state: idle
[D] 22:28:52.906 0x7f76eb60fd00 () - stt models changed
[D] 22:28:52.906 0x7f76eb60fd00 () - update listen
[D] 22:28:52.906 0x7f76eb60fd00 () - tts models changed
[D] 22:28:52.906 0x7f76eb60fd00 () - update listen
[W] 22:28:52.906 0x7f76eb60fd00 () - no available tts models for in mnt
[W] 22:28:52.906 0x7f76eb60fd00 () - no available tts models for out mnt
[D] 22:28:52.906 0x7f76eb60fd00 () - ttt models changed
[D] 22:28:52.911 0x7f76eb60fd00 () - mnt langs changed
[D] 22:28:52.911 0x7f76eb60fd00 () - update listen
[W] 22:28:52.911 0x7f76eb60fd00 () - no available mnt langs
[W] 22:28:52.911 0x7f76eb60fd00 () - no available mnt out langs
[D] 22:28:58.854 0x7f76eb60fd00 () - default tts model not found: "en"
[D] 22:28:58.854 0x7f76eb60fd00 () - default mnt lang not found: "en"
[D] 22:28:58.854 0x7f76eb60fd00 () - new default mnt lang: "en"
[D] 22:28:58.854 0x7f76eb60fd00 () - choosing model for id: "en_whisper_small" "en"
[D] 22:28:58.855 0x7f76eb60fd00 () - restart stt engine config: "lang=en, model-files=[model-file=/home/chrisshaw/.var/app/net.mkiol.SpeechNote/cache/net.mkiol/dsnote/speech-models/en_whisper_small.ggml, scorer-file=, ttt-model-file=], speech-mode=automatic, vad-mode=aggressiveness-3, speech-started=0, use-gpu=0, gpu-device=[id=-1, api=opencl, name=, platform-name=]"
[D] 22:28:58.855 0x7f76eb60fd00 () - new stt engine required
[D] 22:28:58.856 0x7f76eb60fd00 open_whisper_lib:150 - using whisper-openblas
[D] 22:29:00.124 0x7f76eb60fd00 make_wparams:385 - cpu info: arch=x86_64, cores=4
[D] 22:29:00.124 0x7f76eb60fd00 make_wparams:387 - using threads: 4/4
[D] 22:29:00.124 0x7f76eb60fd00 make_wparams:389 - system info: AVX = 1 | AVX2 = 1 | AVX512 = 0 | FMA = 1 | NEON = 0 | ARM_FMA = 0 | METAL = 0 | F16C = 1 | FP16_VA = 0 | WASM_SIMD = 0 | BLAS = 1 | SSE3 = 1 | SSSE3 = 1 | VSX = 0 | COREML = 0 | OPENVINO = 0 | 
[D] 22:29:00.124 0x7f76eb60fd00 start:199 - starting engine
[D] 22:29:00.124 0x7f76eb60fd00 start:207 - engine started
[D] 22:29:00.124 0x7f75cc5e6600 start_processing:244 - processing started
[D] 22:29:00.124 0x7f76eb60fd00 () - creating audio source
[D] 22:29:00.124 0x7f75cc5e6600 set_processing_state:430 - processing state: idle => initializing
[D] 22:29:00.124 0x7f76eb60fd00 () - mic source created
[D] 22:29:00.124 0x7f75cc5e6600 set_processing_state:437 - speech detection status: no-speech => initializing (no-speech)
[D] 22:29:00.124 0x7f75cc5e6600 () - service refresh status, new state: idle
[D] 22:29:00.124 0x7f75cc5e6600 () - task state changed: 0 => 3
[D] 22:29:00.124 0x7f75cc5e6600 create_whisper_model:216 - creating whisper model
whisper_init_from_file_no_state: loading model from '/home/chrisshaw/.var/app/net.mkiol.SpeechNote/cache/net.mkiol/dsnote/speech-models/en_whisper_small.ggml'
whisper_model_load: loading model
whisper_model_load: n_vocab       = 51864
whisper_model_load: n_audio_ctx   = 1500
whisper_model_load: n_audio_state = 768
whisper_model_load: n_audio_head  = 12
whisper_model_load: n_audio_layer = 12
whisper_model_load: n_text_ctx    = 448
whisper_model_load: n_text_state  = 768
whisper_model_load: n_text_head   = 12
whisper_model_load: n_text_layer  = 12
whisper_model_load: n_mels        = 80
whisper_model_load: ftype         = 9
whisper_model_load: qntvr         = 2
whisper_model_load: type          = 3
whisper_model_load: adding 1607 extra tokens
whisper_model_load: model ctx     =  180.95 MB
whisper_model_load: model size    =  180.71 MB
whisper_init_state: kv self size  =   15.75 MB
[D] 22:29:00.380 0x7f76eb60fd00 () - using audio input: "alsa_input.usb-046d_HD_Pro_Webcam_C920_2AE889FF-02.analog-stereo"
whisper_init_state: kv cross size =   52.73 MB
whisper_init_state: compute buffer (conv)   =   19.96 MB
whisper_init_state: compute buffer (encode) =  122.04 MB
whisper_init_state: compute buffer (cross)  =    5.86 MB
whisper_init_state: compute buffer (decode) =   36.17 MB
[D] 22:29:00.670 0x7f75cc5e6600 create_whisper_model:226 - whisper model created
[D] 22:29:00.670 0x7f75cc5e6600 set_processing_state:430 - processing state: initializing => idle
[D] 22:29:00.670 0x7f75cc5e6600 set_processing_state:437 - speech detection status: initializing => no-speech (no-speech)
[D] 22:29:00.670 0x7f75cc5e6600 () - service refresh status, new state: idle
[D] 22:29:00.670 0x7f75cc5e6600 () - task state changed: 3 => 0
[D] 22:29:00.698 0x7f76eb60fd00 () - audio state: IdleState
[D] 22:29:00.698 0x7f76eb60fd00 () - service refresh status, new state: listening-auto
[D] 22:29:00.698 0x7f76eb60fd00 () - service state changed: idle => listening-auto
[W] 22:29:00.699 0x7f76eb60fd00 () - ignore TaskStatePropertyChanged signal
[W] 22:29:00.699 0x7f76eb60fd00 () - ignore TaskStatePropertyChanged signal
[D] 22:29:00.699 0x7f76eb60fd00 () - app current task: -1 => 0
[W] 22:29:00.699 0x7f76eb60fd00 () - invalid task, reseting task state
[D] 22:29:00.699 0x7f76eb60fd00 () - app service state: idle => listening-auto
[W] 22:29:00.703 0x7f76eb60fd00 () - no available mnt langs
[W] 22:29:00.703 0x7f76eb60fd00 () - no available mnt out langs
[W] 22:29:00.703 0x7f76eb60fd00 () - no available tts models for in mnt
[W] 22:29:00.703 0x7f76eb60fd00 () - no available tts models for out mnt
[W] 22:29:00.703 0x7f76eb60fd00 () - invalid task, reseting task state
[D] 22:29:00.888 0x7f76eb60fd00 () - audio state: ActiveState
[D] 22:29:02.263 0x7f75cc5e6600 process_buff:236 - process samples buf: mode=automatic, in-buf size=24000, speech-buf size=0, sof=true, eof=false
[D] 22:29:02.293 0x7f75cc5e6600 process_buff:268 - vad: no speech
[D] 22:29:03.663 0x7f75cc5e6600 process_buff:236 - process samples buf: mode=automatic, in-buf size=24000, speech-buf size=0, sof=false, eof=false
[D] 22:29:03.674 0x7f75cc5e6600 process_buff:268 - vad: no speech
[D] 22:29:05.264 0x7f75cc5e6600 process_buff:236 - process samples buf: mode=automatic, in-buf size=24000, speech-buf size=0, sof=false, eof=false
[D] 22:29:05.295 0x7f75cc5e6600 process_buff:268 - vad: no speech
[D] 22:29:06.664 0x7f75cc5e6600 process_buff:236 - process samples buf: mode=automatic, in-buf size=24000, speech-buf size=0, sof=false, eof=false
[D] 22:29:06.697 0x7f75cc5e6600 process_buff:268 - vad: no speech
[D] 22:29:08.263 0x7f75cc5e6600 process_buff:236 - process samples buf: mode=automatic, in-buf size=24000, speech-buf size=0, sof=false, eof=false
[D] 22:29:08.295 0x7f75cc5e6600 process_buff:268 - vad: no speech

** (dsnote:2): WARNING **: 22:29:08.534: atk-bridge: get_device_events_reply: unknown signature
[D] 22:29:09.664 0x7f75cc5e6600 process_buff:236 - process samples buf: mode=automatic, in-buf size=24000, speech-buf size=0, sof=false, eof=false
[D] 22:29:09.699 0x7f75cc5e6600 process_buff:268 - vad: no speech
[D] 22:29:11.263 0x7f75cc5e6600 process_buff:236 - process samples buf: mode=automatic, in-buf size=24000, speech-buf size=0, sof=false, eof=false
[D] 22:29:11.297 0x7f75cc5e6600 process_buff:268 - vad: no speech
[D] 22:29:12.664 0x7f75cc5e6600 process_buff:236 - process samples buf: mode=automatic, in-buf size=24000, speech-buf size=0, sof=false, eof=false
[D] 22:29:12.697 0x7f75cc5e6600 process_buff:268 - vad: no speech
[D] 22:29:14.264 0x7f75cc5e6600 process_buff:236 - process samples buf: mode=automatic, in-buf size=24000, speech-buf size=0, sof=false, eof=false
[D] 22:29:14.298 0x7f75cc5e6600 process_buff:268 - vad: no speech
[D] 22:29:15.664 0x7f75cc5e6600 process_buff:236 - process samples buf: mode=automatic, in-buf size=24000, speech-buf size=0, sof=false, eof=false
[D] 22:29:15.702 0x7f75cc5e6600 process_buff:268 - vad: no speech
[D] 22:29:17.264 0x7f75cc5e6600 process_buff:236 - process samples buf: mode=automatic, in-buf size=24000, speech-buf size=0, sof=false, eof=false
[D] 22:29:17.304 0x7f75cc5e6600 process_buff:268 - vad: no speech
[D] 22:29:18.664 0x7f75cc5e6600 process_buff:236 - process samples buf: mode=automatic, in-buf size=24000, speech-buf size=0, sof=false, eof=false
[D] 22:29:18.703 0x7f75cc5e6600 process_buff:268 - vad: no speech
[D] 22:29:20.263 0x7f75cc5e6600 process_buff:236 - process samples buf: mode=automatic, in-buf size=24000, speech-buf size=0, sof=false, eof=false
[D] 22:29:20.281 0x7f75cc5e6600 process_buff:268 - vad: no speech
[D] 22:29:21.663 0x7f75cc5e6600 process_buff:236 - process samples buf: mode=automatic, in-buf size=24000, speech-buf size=0, sof=false, eof=false
[D] 22:29:21.707 0x7f75cc5e6600 process_buff:268 - vad: no speech
[D] 22:29:23.263 0x7f75cc5e6600 process_buff:236 - process samples buf: mode=automatic, in-buf size=24000, speech-buf size=0, sof=false, eof=false
[D] 22:29:23.279 0x7f75cc5e6600 process_buff:268 - vad: no speech
[D] 22:29:24.663 0x7f75cc5e6600 process_buff:236 - process samples buf: mode=automatic, in-buf size=24000, speech-buf size=0, sof=false, eof=false
[D] 22:29:24.703 0x7f75cc5e6600 process_buff:268 - vad: no speech
[D] 22:29:26.264 0x7f75cc5e6600 process_buff:236 - process samples buf: mode=automatic, in-buf size=24000, speech-buf size=0, sof=false, eof=false
[D] 22:29:26.304 0x7f75cc5e6600 process_buff:268 - vad: no speech
[D] 22:29:27.663 0x7f75cc5e6600 process_buff:236 - process samples buf: mode=automatic, in-buf size=24000, speech-buf size=0, sof=false, eof=false
[D] 22:29:27.698 0x7f75cc5e6600 process_buff:268 - vad: no speech
[D] 22:29:29.41 0x7f76eb60fd00 () - cancel
[D] 22:29:29.41 0x7f76eb60fd00 () - stop stt engine
[D] 22:29:29.41 0x7f76eb60fd00 stop:225 - stop requested
[D] 22:29:29.41 0x7f76eb60fd00 stop_processing_impl:207 - whisper cancel
[D] 22:29:29.41 0x7f75cc5e6600 flush:446 - flush: exit
[D] 22:29:29.41 0x7f75cc5e6600 reset_in_processing:356 - reset in processing
[D] 22:29:29.41 0x7f75cc5e6600 start_processing:279 - processing ended
[D] 22:29:29.41 0x7f76eb60fd00 stop:240 - stop completed
[D] 22:29:29.41 0x7f76eb60fd00 () - mic source dtor
[D] 22:29:29.41 0x7f76eb60fd00 () - audio state: SuspendedState
[D] 22:29:29.42 0x7f76eb60fd00 () - audio ended
[D] 22:29:29.44 0x7f76eb60fd00 () - service refresh status, new state: idle
[D] 22:29:29.44 0x7f76eb60fd00 () - service state changed: listening-auto => idle
[D] 22:29:29.44 0x7f76eb60fd00 () - service refresh status, new state: idle
[D] 22:29:29.45 0x7f76eb60fd00 () - app current task: 0 => -1
[W] 22:29:29.45 0x7f76eb60fd00 () - invalid task, reseting task state
[D] 22:29:29.45 0x7f76eb60fd00 () - app service state: listening-auto => idle
[W] 22:29:29.64 0x7f76eb60fd00 () - no available mnt langs
[W] 22:29:29.64 0x7f76eb60fd00 () - no available mnt out langs
[W] 22:29:29.64 0x7f76eb60fd00 () - no available tts models for in mnt
[W] 22:29:29.64 0x7f76eb60fd00 () - no available tts models for out mnt
[W] 22:29:29.64 0x7f76eb60fd00 () - invalid task, reseting task state
[D] 22:29:38.365 0x7f76eb60fd00 () - exiting

from dsnote.

mkiol avatar mkiol commented on May 9, 2024

This change is included in the recent 4.3.0 release.

from dsnote.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.