mkiol / dsnote

Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation.
Mozilla Public License 2.0
588 stars 20 forks source link

The app is crashing when GPU acceleration is enabled using any Whisper model #137

Closed mediocretwo closed 2 weeks ago

mediocretwo commented 6 months ago

System Info

Distribution: Fedora Workstation 40 Desktop Environment: Gnome 46 GPU: AMD Radeon Rx 5700XT CPU: AMD Ryzen 5 3600 RAM: 16 GB DDR4

Hi, just installed the app and it works flawlessly on CPU processing. However, when I enable GPU acceleration and use any of the Whisper models, the app freezes for 2 seconds and crashes after pressing 'Listen' button. I have installed the AMD addon flatpak and selected the ROC GPU acceleration method. When I disable GPU acceleration it works flawlessly again. Would appreciate any help. Thanks!

mkiol commented 6 months ago

Hi. Thanks for the report.

Could you please run the app with --verbose option and paste here an output.

flatpak run net.mkiol.SpeechNote --verbose
mkiol commented 6 months ago

Most likely your GPU is not supported by ROCm.

You may try to enable "Override GPU version" in the settings ("Other" tab). Set the value to "10.3.0" and see if it helped after restarting the app.

image

mediocretwo commented 6 months ago

I applied the version change and it has fixed the crashes. However, when I record some audio and try to process it, the app is just processing it forever and not giving any text output. The processing animation in bottom left is going on and on but there is no output. If I check GPU usage with nvtop, as soon as I press 'Stop' after speaking the GPU usage spikes to 99% and is staying there forever. Pressing Cancel is also not stopping the loop. I have to close the app to get back the GPU to normal levels. This is happening with both Whisper Large and Whisper Tiny models.

Attaching the log again for you to check:

QSocketNotifier: Can only be used with threads started with QThread
QSocketNotifier: Can only be used with threads started with QThread
qt.qpa.qgnomeplatform: Could not find color scheme  ""
qt.qpa.qgnomeplatform: Could not find color scheme  ""
[I] 19:44:31.462 0x7fa801d81d00 init:49 - logging to stderr enabled
[D] 19:44:31.462 0x7fa801d81d00 () - version: 4.4.0
[I] 19:44:31.462 0x7fa801d81d00 init:49 - logging to stderr enabled
[D] 19:44:31.462 0x7fa801d81d00 () - version: 4.4.0
[D] 19:44:31.463 0x7fa801d81d00 parse_cpuinfo:117 - cpu flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm constant_tsc rep_good nopl nonstop_tsc cpuid extd_apicid aperfmperf rapl pni pclmulqdq monitor ssse3 fma cx16 sse4_1 sse4_2 x2apic movbe popcnt aes xsave avx f16c rdrand lahf_lm cmp_legacy svm extapic cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw ibs skinit wdt tce topoext perfctr_core perfctr_nb bpext perfctr_llc mwaitx cpb cat_l3 cdp_l3 hw_pstate ssbd mba ibpb stibp vmmcall fsgsbase bmi1 avx2 smep bmi2 cqm rdt_a rdseed adx smap clflushopt clwb sha_ni xsaveopt xsavec xgetbv1 cqm_llc cqm_occup_llc cqm_mbm_total cqm_mbm_local clzero irperf xsaveerptr rdpru wbnoinvd arat npt lbrv svm_lock nrip_save tsc_scale vmcb_clean flushbyasid decodeassists pausefilter pfthreshold avic v_vmsave_vmload vgif v_spec_ctrl umip rdpid overflow_recov succor smca sev sev_es
[D] 19:44:31.463 0x7fa801d81d00 parse_cpuinfo:117 - cpu flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm constant_tsc rep_good nopl nonstop_tsc cpuid extd_apicid aperfmperf rapl pni pclmulqdq monitor ssse3 fma cx16 sse4_1 sse4_2 x2apic movbe popcnt aes xsave avx f16c rdrand lahf_lm cmp_legacy svm extapic cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw ibs skinit wdt tce topoext perfctr_core perfctr_nb bpext perfctr_llc mwaitx cpb cat_l3 cdp_l3 hw_pstate ssbd mba ibpb stibp vmmcall fsgsbase bmi1 avx2 smep bmi2 cqm rdt_a rdseed adx smap clflushopt clwb sha_ni xsaveopt xsavec xgetbv1 cqm_llc cqm_occup_llc cqm_mbm_total cqm_mbm_local clzero irperf xsaveerptr rdpru wbnoinvd arat npt lbrv svm_lock nrip_save tsc_scale vmcb_clean flushbyasid decodeassists pausefilter pfthreshold avic v_vmsave_vmload vgif v_spec_ctrl umip rdpid overflow_recov succor smca sev sev_es
[D] 19:44:31.463 0x7fa801d81d00 parse_cpuinfo:125 - cpuinfo: processor-count=12, flags=[avx, avx2, fma, f16c, ]
[D] 19:44:31.463 0x7fa801d81d00 parse_cpuinfo:125 - cpuinfo: processor-count=12, flags=[avx, avx2, fma, f16c, ]
[D] 19:44:31.463 0x7fa801d81d00 () - translation: "en_US"
[W] 19:44:31.463 0x7fa801d81d00 () - failed to install translation
[D] 19:44:31.463 0x7fa801d81d00 () - starting standalone app
[D] 19:44:31.463 0x7fa801d81d00 () - translation: "en_US"
[W] 19:44:31.463 0x7fa801d81d00 () - failed to install translation
[D] 19:44:31.463 0x7fa801d81d00 () - starting standalone app
[D] 19:44:31.464 0x7fa801d81d00 () - app: net.mkiol dsnote
[D] 19:44:31.464 0x7fa801d81d00 () - config location: "/home/asish/.var/app/net.mkiol.SpeechNote/config"
[D] 19:44:31.464 0x7fa801d81d00 () - app: net.mkiol dsnote
[D] 19:44:31.464 0x7fa801d81d00 () - config location: "/home/asish/.var/app/net.mkiol.SpeechNote/config"
[D] 19:44:31.464 0x7fa801d81d00 () - data location: "/home/asish/.var/app/net.mkiol.SpeechNote/data/net.mkiol/dsnote"
[D] 19:44:31.464 0x7fa801d81d00 () - cache location: "/home/asish/.var/app/net.mkiol.SpeechNote/cache/net.mkiol/dsnote"
[D] 19:44:31.464 0x7fa801d81d00 () - settings file: "/home/asish/.var/app/net.mkiol.SpeechNote/config/net.mkiol/dsnote/settings.conf"
[D] 19:44:31.464 0x7fa801d81d00 () - platform: "wayland"
[D] 19:44:31.464 0x7fa801d81d00 () - data location: "/home/asish/.var/app/net.mkiol.SpeechNote/data/net.mkiol/dsnote"
[D] 19:44:31.464 0x7fa801d81d00 () - cache location: "/home/asish/.var/app/net.mkiol.SpeechNote/cache/net.mkiol/dsnote"
[D] 19:44:31.464 0x7fa801d81d00 () - settings file: "/home/asish/.var/app/net.mkiol.SpeechNote/config/net.mkiol/dsnote/settings.conf"
[D] 19:44:31.464 0x7fa801d81d00 () - platform: "wayland"
[D] 19:44:31.464 0x7fa801d81d00 () - amd addon exists
[D] 19:44:31.464 0x7fa801d81d00 () - enforcing num threads: 0
[D] 19:44:31.464 0x7fa801d81d00 () - amd addon exists
[D] 19:44:31.464 0x7fa801d81d00 () - enforcing num threads: 0
[D] 19:44:31.702 0x7fa801d81d00 () - supported audio input devices:
[D] 19:44:31.702 0x7fa801d81d00 () - supported audio input devices:
ALSA lib ../../oss/pcm_oss.c:397:(_snd_pcm_oss_open) Cannot open device /dev/dsp
ALSA lib ../../oss/pcm_oss.c:397:(_snd_pcm_oss_open) Cannot open device /dev/dsp
[D] 19:44:31.711 0x7fa801d81d00 () - "pulse"
[D] 19:44:31.711 0x7fa801d81d00 () - "pulse"
[D] 19:44:31.732 0x7fa801d81d00 () - "default"
[D] 19:44:31.732 0x7fa801d81d00 () - "default"
ALSA lib ../../../src/pcm/pcm_direct.c:2045:(snd1_pcm_direct_parse_open_conf) The field ipc_gid must be a valid group (create group audio)
ALSA lib ../../../src/pcm/pcm_direct.c:2045:(snd1_pcm_direct_parse_open_conf) The field ipc_gid must be a valid group (create group audio)
ALSA lib ../../../src/pcm/pcm_direct.c:2045:(snd1_pcm_direct_parse_open_conf) The field ipc_gid must be a valid group (create group audio)
ALSA lib ../../../src/pcm/pcm_direct.c:2045:(snd1_pcm_direct_parse_open_conf) The field ipc_gid must be a valid group (create group audio)
[D] 19:44:32.43 0x7fa801d81d00 () - "front:CARD=C615,DEV=0"
[D] 19:44:32.43 0x7fa801d81d00 () - "front:CARD=C615,DEV=0"
ALSA lib ../../../src/pcm/pcm_direct.c:2045:(snd1_pcm_direct_parse_open_conf) The field ipc_gid must be a valid group (create group audio)
ALSA lib ../../../src/pcm/pcm_direct.c:2045:(snd1_pcm_direct_parse_open_conf) The field ipc_gid must be a valid group (create group audio)
[D] 19:44:32.45 0x7fa801d81d00 () - "alsa_input.usb-HP__Inc_HyperX_Cloud_Stinger_2_Wireless_0-00.mono-fallback"
[D] 19:44:32.45 0x7fa801d81d00 () - "alsa_output.pci-0000_2a_00.4.analog-stereo.monitor"
[D] 19:44:32.45 0x7fa801d81d00 () - "alsa_input.pci-0000_2a_00.4.analog-stereo"
[D] 19:44:32.45 0x7fa801d81d00 () - "alsa_input.usb-046d_HD_Webcam_C615_940C6E50-00.mono-fallback"
[D] 19:44:32.45 0x7fa801d81d00 () - "alsa_input.usb-HP__Inc_HyperX_Cloud_Stinger_2_Wireless_0-00.mono-fallback"
[D] 19:44:32.45 0x7fa801d81d00 () - "alsa_output.pci-0000_2a_00.4.analog-stereo.monitor"
[D] 19:44:32.45 0x7fa801d81d00 () - "alsa_input.pci-0000_2a_00.4.analog-stereo"
[D] 19:44:32.45 0x7fa801d81d00 () - "alsa_input.usb-046d_HD_Webcam_C615_940C6E50-00.mono-fallback"
[D] 19:44:32.45 0x7fa801d81d00 () - "alsa_output.usb-HP__Inc_HyperX_Cloud_Stinger_2_Wireless_0-00.analog-stereo.monitor"
[D] 19:44:32.45 0x7fa801d81d00 () - "alsa_output.pci-0000_28_00.1.hdmi-stereo-extra2.monitor"
[D] 19:44:32.45 0x7fa801d81d00 () - "alsa_output.usb-HP__Inc_HyperX_Cloud_Stinger_2_Wireless_0-00.analog-stereo.monitor"
[D] 19:44:32.45 0x7fa801d81d00 () - "alsa_output.pci-0000_28_00.1.hdmi-stereo-extra2.monitor"
[D] 19:44:32.71 0x7fa801d81d00 () - starting service: app-standalone
[D] 19:44:32.71 0x7fa801d81d00 () - starting service: app-standalone
[D] 19:44:32.74 0x7fa801d81d00 () - mbrola dir: "/app/bin"
[D] 19:44:32.74 0x7fa801d81d00 () - espeak dir: "/app/bin"
[D] 19:44:32.74 0x7fa801d81d00 () - overrided gpu version: "10.3.0"
[D] 19:44:32.74 0x7fa801d81d00 () - HSA_OVERRIDE_GFX_VERSION: 10.3.0
[D] 19:44:32.74 0x7fa801d81d00 () - mbrola dir: "/app/bin"
[D] 19:44:32.74 0x7fa801d81d00 () - espeak dir: "/app/bin"
[D] 19:44:32.74 0x7fa801d81d00 () - overrided gpu version: "10.3.0"
[D] 19:44:32.74 0x7fa801d81d00 () - HSA_OVERRIDE_GFX_VERSION: 10.3.0
[D] 19:44:32.75 0x7fa7ef400600 loop:75 - py executor loop started
[D] 19:44:32.75 0x7fa7ef400600 loop:75 - py executor loop started
[D] 19:44:32.79 0x7fa801d81d00 () - module already unpacked: "rhvoicedata"
[D] 19:44:32.79 0x7fa801d81d00 () - module already unpacked: "rhvoicedata"
[D] 19:44:32.80 0x7fa801d81d00 () - module already unpacked: "rhvoiceconfig"
[D] 19:44:32.80 0x7fa801d81d00 () - module already unpacked: "rhvoiceconfig"
[D] 19:44:32.84 0x7fa7efe00600 () - config version: 65 65
[D] 19:44:32.84 0x7fa7efe00600 () - config version: 65 65
[D] 19:44:32.85 0x7fa801d81d00 () - module already unpacked: "espeakdata"
[D] 19:44:32.85 0x7fa801d81d00 () - module already unpacked: "espeakdata"
[D] 19:44:32.85 0x7fa801d81d00 () - default stt model not found: "en_whisper_tiny"
[D] 19:44:32.85 0x7fa801d81d00 () - default tts model not found: "en"
[D] 19:44:32.85 0x7fa801d81d00 () - default mnt lang not found: "en"
[D] 19:44:32.85 0x7fa801d81d00 () - new default mnt lang: "en"
[D] 19:44:32.85 0x7fa801d81d00 () - default stt model not found: "en_whisper_tiny"
[D] 19:44:32.85 0x7fa801d81d00 () - default tts model not found: "en"
[D] 19:44:32.85 0x7fa801d81d00 () - default mnt lang not found: "en"
[D] 19:44:32.85 0x7fa801d81d00 () - new default mnt lang: "en"
[D] 19:44:32.85 0x7fa801d81d00 () - service refresh status, new state: busy
[D] 19:44:32.85 0x7fa801d81d00 () - service state changed: unknown => busy
[D] 19:44:32.85 0x7fa801d81d00 () - delaying features availability
[D] 19:44:32.85 0x7fa801d81d00 () - service refresh status, new state: busy
[D] 19:44:32.85 0x7fa801d81d00 () - service state changed: unknown => busy
[D] 19:44:32.85 0x7fa801d81d00 () - delaying features availability
[D] 19:44:32.88 0x7fa801d81d00 () - runtime prefix: "/app"
[D] 19:44:32.88 0x7fa801d81d00 () - runtime prefix: "/app"
[D] 19:44:32.88 0x7fa801d81d00 () - available styles: ("Default", "Fusion", "Imagine", "Material", "org.kde.breeze", "org.kde.desktop", "Plasma", "Universal")
[D] 19:44:32.88 0x7fa801d81d00 () - available styles: ("Default", "Fusion", "Imagine", "Material", "org.kde.breeze", "org.kde.desktop", "Plasma", "Universal")
[D] 19:44:32.88 0x7fa7ef400600 libs_availability:61 - checking: torch cuda
[D] 19:44:32.88 0x7fa7ef400600 libs_availability:61 - checking: torch cuda
[D] 19:44:32.88 0x7fa801d81d00 () - style paths: ("/usr/lib/qml/QtQuick/Controls.2")
[D] 19:44:32.88 0x7fa801d81d00 () - import paths: ("/usr/lib/qml", "/app/bin", "qrc:/qt-project.org/imports")
[D] 19:44:32.88 0x7fa801d81d00 () - library paths: ("/usr/share/runtime/lib/plugins", "/usr/lib/plugins", "/app/bin")
[D] 19:44:32.88 0x7fa801d81d00 () - style paths: ("/usr/lib/qml/QtQuick/Controls.2")
[D] 19:44:32.88 0x7fa801d81d00 () - import paths: ("/usr/lib/qml", "/app/bin", "qrc:/qt-project.org/imports")
[D] 19:44:32.88 0x7fa801d81d00 () - library paths: ("/usr/share/runtime/lib/plugins", "/usr/lib/plugins", "/app/bin")
[D] 19:44:32.89 0x7fa801d81d00 () - switching to style: "Plasma"
[D] 19:44:32.89 0x7fa801d81d00 () - switching to style: "Plasma"
[D] 19:44:32.111 0x7fa7efe00600 () - models changed
[D] 19:44:32.111 0x7fa7efe00600 () - models changed
[W] 19:44:32.114 0x7fa7ee600600 () - QObject: Cannot create children for a parent that is in a different thread.
(Parent is QApplication(0x7ffc577a4b50), parent's thread is QThread(0x55571bb0b200), current thread is QThread(0x55571c1456b0)
[W] 19:44:32.114 0x7fa7ee600600 () - QObject: Cannot create children for a parent that is in a different thread.
(Parent is QApplication(0x7ffc577a4b50), parent's thread is QThread(0x55571bb0b200), current thread is QThread(0x55571c1456b0)
[W] 19:44:32.114 0x7fa7ee600600 () - QObject: Cannot create children for a parent that is in a different thread.
(Parent is QApplication(0x7ffc577a4b50), parent's thread is QThread(0x55571bb0b200), current thread is QThread(0x55571c1456b0)
[W] 19:44:32.114 0x7fa7ee600600 () - QObject: Cannot create children for a parent that is in a different thread.
(Parent is QApplication(0x7ffc577a4b50), parent's thread is QThread(0x55571bb0b200), current thread is QThread(0x55571c1456b0)
[W] 19:44:32.114 0x7fa7ee600600 () - QObject: Cannot create children for a parent that is in a different thread.
(Parent is QApplication(0x7ffc577a4b50), parent's thread is QThread(0x55571bb0b200), current thread is QThread(0x55571c1456b0)
[W] 19:44:32.114 0x7fa7ee600600 () - QObject::installEventFilter(): Cannot filter events for objects in a different thread.
[W] 19:44:32.114 0x7fa7ee600600 () - QObject: Cannot create children for a parent that is in a different thread.
(Parent is QApplication(0x7ffc577a4b50), parent's thread is QThread(0x55571bb0b200), current thread is QThread(0x55571c1456b0)
[W] 19:44:32.114 0x7fa7ee600600 () - QObject::installEventFilter(): Cannot filter events for objects in a different thread.
[D] 19:44:32.740 0x7fa801d81d00 () - starting app: app-standalone
[D] 19:44:32.740 0x7fa801d81d00 () - starting app: app-standalone
[D] 19:44:32.740 0x7fa801d81d00 () - app service state: unknown => busy
[D] 19:44:32.740 0x7fa801d81d00 () - app service state: unknown => busy
[W] 19:44:32.740 0x7fa801d81d00 () - hot keys are supported only under x11
[W] 19:44:32.740 0x7fa801d81d00 () - hot keys are supported only under x11
logger error: invalid format string
qrc:/qml/main.qml:340:5: QML Connections: Implicitly defined onFoo properties in Connections are deprecated. Use this syntax instead: function onFoo(<arguments>) { ... }
logger error: invalid format string
qrc:/qml/main.qml:340:5: QML Connections: Implicitly defined onFoo properties in Connections are deprecated. Use this syntax instead: function onFoo(<arguments>) { ... }
logger error: invalid format string
qrc:/qml/main.qml:331:5: QML Connections: Implicitly defined onFoo properties in Connections are deprecated. Use this syntax instead: function onFoo(<arguments>) { ... }
logger error: invalid format string
qrc:/qml/main.qml:331:5: QML Connections: Implicitly defined onFoo properties in Connections are deprecated. Use this syntax instead: function onFoo(<arguments>) { ... }
logger error: invalid format string
qrc:/qml/Notepad.qml:24:5: QML Connections: Implicitly defined onFoo properties in Connections are deprecated. Use this syntax instead: function onFoo(<arguments>) { ... }
logger error: invalid format string
qrc:/qml/Notepad.qml:24:5: QML Connections: Implicitly defined onFoo properties in Connections are deprecated. Use this syntax instead: function onFoo(<arguments>) { ... }
logger error: invalid format string
qrc:/qml/Translator.qml:29:5: QML Connections: Implicitly defined onFoo properties in Connections are deprecated. Use this syntax instead: function onFoo(<arguments>) { ... }
logger error: invalid format string
qrc:/qml/Translator.qml:29:5: QML Connections: Implicitly defined onFoo properties in Connections are deprecated. Use this syntax instead: function onFoo(<arguments>) { ... }
logger error: invalid format string
qrc:/qml/MainToolBar.qml:282:13: QML Connections: Implicitly defined onFoo properties in Connections are deprecated. Use this syntax instead: function onFoo(<arguments>) { ... }
logger error: invalid format string
qrc:/qml/MainToolBar.qml:282:13: QML Connections: Implicitly defined onFoo properties in Connections are deprecated. Use this syntax instead: function onFoo(<arguments>) { ... }
[D] 19:44:33.30 0x7fa801d81d00 onCompleted:180 - default font pixel size: 14
[D] 19:44:33.30 0x7fa801d81d00 onCompleted:180 - default font pixel size: 14
[D] 19:44:33.39 0x7fa801d81d00 () - default tts model not found: "en"
[D] 19:44:33.39 0x7fa801d81d00 () - default mnt lang not found: "en"
[D] 19:44:33.39 0x7fa801d81d00 () - default tts model not found: "en"
[D] 19:44:33.39 0x7fa801d81d00 () - default mnt lang not found: "en"
[D] 19:44:33.39 0x7fa801d81d00 () - new default mnt lang: "en"
[D] 19:44:33.39 0x7fa801d81d00 () - service refresh status, new state: busy
[D] 19:44:33.39 0x7fa801d81d00 () - service refresh status, new state: busy
[D] 19:44:33.39 0x7fa801d81d00 () - new default mnt lang: "en"
[D] 19:44:33.39 0x7fa801d81d00 () - service refresh status, new state: busy
[D] 19:44:33.39 0x7fa801d81d00 () - service refresh status, new state: busy
[W] 19:44:33.89 0x7fa801d81d00 ():219 - qrc:/qml/Translator.qml:219:9: QML ColumnLayout (parent or ancestor of QQuickLayoutAttached): Binding loop detected for property "preferredWidth"
[W] 19:44:33.89 0x7fa801d81d00 ():219 - qrc:/qml/Translator.qml:219:9: QML ColumnLayout (parent or ancestor of QQuickLayoutAttached): Binding loop detected for property "preferredWidth"
[D] 19:44:33.205 0x7fa801d81d00 () - stt models changed
[D] 19:44:33.205 0x7fa801d81d00 () - stt models changed
[D] 19:44:33.205 0x7fa801d81d00 () - update listen
[D] 19:44:33.205 0x7fa801d81d00 () - app stt configured: false => true
[D] 19:44:33.205 0x7fa801d81d00 () - update listen
[D] 19:44:33.205 0x7fa801d81d00 () - app stt configured: false => true
[D] 19:44:33.206 0x7fa801d81d00 () - app active stt model: "" => "en_whisper_tiny"
[D] 19:44:33.206 0x7fa801d81d00 () - app active stt model: "" => "en_whisper_tiny"
[D] 19:44:33.206 0x7fa801d81d00 () - update listen
[D] 19:44:33.206 0x7fa801d81d00 () - tts models changed
[D] 19:44:33.206 0x7fa801d81d00 () - update listen
[D] 19:44:33.206 0x7fa801d81d00 () - tts models changed
[D] 19:44:33.206 0x7fa801d81d00 () - update listen
[W] 19:44:33.206 0x7fa801d81d00 () - no available tts models for in mnt
[W] 19:44:33.206 0x7fa801d81d00 () - no available tts models for out mnt
[D] 19:44:33.206 0x7fa801d81d00 () - update listen
[W] 19:44:33.206 0x7fa801d81d00 () - no available tts models for in mnt
[W] 19:44:33.206 0x7fa801d81d00 () - no available tts models for out mnt
[D] 19:44:33.206 0x7fa801d81d00 () - ttt models changed
[D] 19:44:33.206 0x7fa801d81d00 () - ttt models changed
[D] 19:44:33.210 0x7fa801d81d00 () - mnt langs changed
[D] 19:44:33.210 0x7fa801d81d00 () - mnt langs changed
[D] 19:44:33.210 0x7fa801d81d00 () - update listen
[W] 19:44:33.210 0x7fa801d81d00 () - no available mnt langs
[D] 19:44:33.210 0x7fa801d81d00 () - update listen
[W] 19:44:33.210 0x7fa801d81d00 () - no available mnt langs
[W] 19:44:33.210 0x7fa801d81d00 () - no available mnt out langs
[W] 19:44:33.210 0x7fa801d81d00 () - no available mnt out langs
[W] 19:44:33.215 0x7fa801d81d00 void KDirWatchPrivate::addEntry(KDirWatch*, const QString&, Entry*, bool, KDirWatch::WatchModes):797 - Cannot watch QRC-like path ":/icons/hicolor/index.theme"
[W] 19:44:33.215 0x7fa801d81d00 void KDirWatchPrivate::addEntry(KDirWatch*, const QString&, Entry*, bool, KDirWatch::WatchModes):797 - Cannot watch QRC-like path ":/icons/hicolor/index.theme"
[D] 19:44:33.381 0x7fa801d81d00 () - trying features availability update: false
[D] 19:44:33.381 0x7fa801d81d00 () - trying features availability update: false
[D] 19:44:33.573 0x7fa7ef400600 libs_availability:69 - checking: coqui tts
[D] 19:44:33.573 0x7fa7ef400600 libs_availability:69 - checking: coqui tts
[D] 19:44:33.573 0x7fa7ef400600 libs_availability:77 - checking: faster-whisper
[D] 19:44:33.573 0x7fa7ef400600 libs_availability:77 - checking: faster-whisper
[D] 19:44:33.975 0x7fa7ef400600 libs_availability:85 - checking: transformers
[D] 19:44:33.975 0x7fa7ef400600 libs_availability:87 - checking: accelerate
[D] 19:44:33.975 0x7fa7ef400600 libs_availability:85 - checking: transformers
[D] 19:44:33.975 0x7fa7ef400600 libs_availability:87 - checking: accelerate
[D] 19:44:34.180 0x7fa801d81d00 () - trying features availability update: false
[D] 19:44:34.180 0x7fa801d81d00 () - trying features availability update: false
[D] 19:44:34.430 0x7fa7ef400600 libs_availability:95 - checking: unikud
[D] 19:44:34.430 0x7fa7ef400600 libs_availability:95 - checking: unikud
[D] 19:44:34.430 0x7fa7ef400600 libs_availability:106 - checking: mimic3 tts
[D] 19:44:34.430 0x7fa7ef400600 libs_availability:106 - checking: mimic3 tts
[D] 19:44:35.79 0x7fa801d81d00 () - trying features availability update: false
[D] 19:44:35.79 0x7fa801d81d00 () - trying features availability update: false
[D] 19:44:35.105 0x7fa7ef400600 libs_availability:114 - checking: gruut
[D] 19:44:35.105 0x7fa7ef400600 libs_availability:118 - checking: gruut-de
[D] 19:44:35.105 0x7fa7ef400600 libs_availability:114 - checking: gruut
[D] 19:44:35.105 0x7fa7ef400600 libs_availability:118 - checking: gruut-de
[D] 19:44:35.105 0x7fa7ef400600 libs_availability:126 - checking: gruut-es
[D] 19:44:35.105 0x7fa7ef400600 libs_availability:126 - checking: gruut-es
[D] 19:44:35.105 0x7fa7ef400600 libs_availability:134 - checking: gruut-fr
[D] 19:44:35.105 0x7fa7ef400600 libs_availability:134 - checking: gruut-fr
[D] 19:44:35.105 0x7fa7ef400600 libs_availability:142 - checking: gruut-it
[D] 19:44:35.105 0x7fa7ef400600 libs_availability:142 - checking: gruut-it
[D] 19:44:35.105 0x7fa7ef400600 libs_availability:150 - checking: gruut-ru
[D] 19:44:35.105 0x7fa7ef400600 libs_availability:150 - checking: gruut-ru
[D] 19:44:35.106 0x7fa7ef400600 libs_availability:158 - checking: gruut-fa
[D] 19:44:35.106 0x7fa7ef400600 libs_availability:158 - checking: gruut-fa
[D] 19:44:35.106 0x7fa7ef400600 libs_availability:166 - checking: gruut-sw
[D] 19:44:35.106 0x7fa7ef400600 libs_availability:166 - checking: gruut-sw
[D] 19:44:35.106 0x7fa7ef400600 libs_availability:174 - checking: gruut-nl
[D] 19:44:35.106 0x7fa7ef400600 libs_availability:174 - checking: gruut-nl
[D] 19:44:35.106 0x7fa7ef400600 libs_availability:185 - checking: mecab
[D] 19:44:35.106 0x7fa7ef400600 libs_availability:185 - checking: mecab
[D] 19:44:35.108 0x7fa7ef400600 libs_availability:187 - checking: unidic-lite
[D] 19:44:35.108 0x7fa7ef400600 libs_availability:187 - checking: unidic-lite
[D] 19:44:35.109 0x7fa7ef400600 libs_availability:194 - py libs availability: [coqui-tts=true, faster-whisper=true, mimic3-tts=true, transformers=true, unikud=true, gruut_de=true, gruut_es=true, gruut_fa=true, gruut_fr=true, gruut_nl=true, gruut_it=true, gruut_ru=true, gruut_sw=true, mecab=true, torch-cuda=true]
[D] 19:44:35.109 0x7fa7ef400600 libs_availability:194 - py libs availability: [coqui-tts=true, faster-whisper=true, mimic3-tts=true, transformers=true, unikud=true, gruut_de=true, gruut_es=true, gruut_fa=true, gruut_fr=true, gruut_nl=true, gruut_it=true, gruut_ru=true, gruut_sw=true, mecab=true, torch-cuda=true]
[D] 19:44:36.81 0x7fa801d81d00 () - trying features availability update: true
[D] 19:44:36.81 0x7fa801d81d00 () - features availability ready
[D] 19:44:36.81 0x7fa801d81d00 () - trying features availability update: true
[D] 19:44:36.81 0x7fa801d81d00 () - features availability ready
[W] 19:44:36.81 0x7fa801d81d00 has_lib:477 - failed to open libcudart.so: libcudart.so: cannot open shared object file: No such file or directory
[W] 19:44:36.81 0x7fa801d81d00 has_lib:477 - failed to open libcudart.so: libcudart.so: cannot open shared object file: No such file or directory
[W] 19:44:36.81 0x7fa801d81d00 has_lib:477 - failed to open libcudnn.so: libcudnn.so: cannot open shared object file: No such file or directory
[W] 19:44:36.81 0x7fa801d81d00 has_lib:477 - failed to open libcudnn.so: libcudnn.so: cannot open shared object file: No such file or directory
[W] 19:44:36.81 0x7fa801d81d00 has_lib:477 - failed to open libcudnn.so.8: libcudnn.so.8: cannot open shared object file: No such file or directory
[W] 19:44:36.81 0x7fa801d81d00 has_lib:477 - failed to open libcudnn.so.8: libcudnn.so.8: cannot open shared object file: No such file or directory
[W] 19:44:36.87 0x7fa801d81d00 has_cuda:56 - failed to open whisper-cublas lib: libwhisper-cublas.so: cannot open shared object file: No such file or directory
[W] 19:44:36.87 0x7fa801d81d00 has_cuda:56 - failed to open whisper-cublas lib: libwhisper-cublas.so: cannot open shared object file: No such file or directory
[D] 19:44:36.96 0x7fa801d81d00 () - updating model using availability
[D] 19:44:36.96 0x7fa801d81d00 () - updating model using availability internal
[D] 19:44:36.96 0x7fa801d81d00 () - updating model using availability
[D] 19:44:36.96 0x7fa801d81d00 () - updating model using availability internal
[D] 19:44:36.97 0x7fa801d81d00 () - default tts model not found: "en"
[D] 19:44:36.97 0x7fa801d81d00 () - default mnt lang not found: "en"
[D] 19:44:36.97 0x7fa801d81d00 () - default tts model not found: "en"
[D] 19:44:36.97 0x7fa801d81d00 () - default mnt lang not found: "en"
[D] 19:44:36.97 0x7fa801d81d00 () - new default mnt lang: "en"
[D] 19:44:36.97 0x7fa801d81d00 () - new default mnt lang: "en"
[D] 19:44:36.97 0x7fa801d81d00 () - service refresh status, new state: idle
[D] 19:44:36.97 0x7fa801d81d00 () - service state changed: busy => idle
[D] 19:44:36.97 0x7fa801d81d00 () - service refresh status, new state: idle
[D] 19:44:36.97 0x7fa801d81d00 () - service state changed: busy => idle
[D] 19:44:36.97 0x7fa801d81d00 () - scan cuda: true
[D] 19:44:36.97 0x7fa801d81d00 () - scan hip: true
[D] 19:44:36.97 0x7fa801d81d00 () - scan cuda: true
[D] 19:44:36.97 0x7fa801d81d00 () - scan hip: true
[D] 19:44:36.97 0x7fa801d81d00 () - scan opencl: true false
[D] 19:44:36.97 0x7fa801d81d00 add_cuda_devices:281 - scanning for cuda devices
[D] 19:44:36.97 0x7fa801d81d00 () - scan opencl: true false
[D] 19:44:36.97 0x7fa801d81d00 add_cuda_devices:281 - scanning for cuda devices
[W] 19:44:36.97 0x7fa801d81d00 cuda_api:168 - failed to open cudart lib: libcudart.so: cannot open shared object file: No such file or directory
[W] 19:44:36.97 0x7fa801d81d00 cuda_api:168 - failed to open cudart lib: libcudart.so: cannot open shared object file: No such file or directory
[D] 19:44:36.97 0x7fa801d81d00 add_hip_devices:318 - scanning for hip devices
[D] 19:44:36.97 0x7fa801d81d00 add_hip_devices:318 - scanning for hip devices
[D] 19:44:36.104 0x7fa801d81d00 add_hip_devices:327 - hip version: driver=50631062, runtime=50631062
[D] 19:44:36.104 0x7fa801d81d00 add_hip_devices:336 - hip number of devices: 1
[D] 19:44:36.104 0x7fa801d81d00 add_hip_devices:327 - hip version: driver=50631062, runtime=50631062
[D] 19:44:36.104 0x7fa801d81d00 add_hip_devices:336 - hip number of devices: 1
[D] 19:44:36.104 0x7fa801d81d00 add_hip_devices:345 - hip device: 0, name=AMD Radeon RX 5700 XT, gcn-arch=1030, gcn-arch-name=gfx1030
[D] 19:44:36.104 0x7fa801d81d00 add_hip_devices:345 - hip device: 0, name=AMD Radeon RX 5700 XT, gcn-arch=1030, gcn-arch-name=gfx1030
[D] 19:44:36.104 0x7fa801d81d00 () - service refresh status, new state: idle
[D] 19:44:36.104 0x7fa801d81d00 () - service refresh status, new state: idle
[D] 19:44:36.105 0x7fa801d81d00 () - app service state: busy => idle
[D] 19:44:36.105 0x7fa801d81d00 () - app service state: busy => idle
[W] 19:44:36.109 0x7fa801d81d00 () - no available mnt langs
[W] 19:44:36.109 0x7fa801d81d00 () - no available mnt out langs
[W] 19:44:36.109 0x7fa801d81d00 () - no available mnt langs
[W] 19:44:36.109 0x7fa801d81d00 () - no available mnt out langs
[W] 19:44:36.109 0x7fa801d81d00 () - no available tts models for in mnt
[W] 19:44:36.109 0x7fa801d81d00 () - no available tts models for out mnt
[W] 19:44:36.109 0x7fa801d81d00 () - invalid task, reseting task state
[W] 19:44:36.109 0x7fa801d81d00 () - no available tts models for in mnt
[W] 19:44:36.109 0x7fa801d81d00 () - no available tts models for out mnt
[W] 19:44:36.109 0x7fa801d81d00 () - invalid task, reseting task state
[D] 19:44:36.109 0x7fa801d81d00 () - app busy: true => false
[D] 19:44:36.109 0x7fa801d81d00 () - app busy: true => false
[D] 19:44:36.110 0x7fa801d81d00 () - stt models changed
[D] 19:44:36.110 0x7fa801d81d00 () - stt models changed
[D] 19:44:36.110 0x7fa801d81d00 () - update listen
[D] 19:44:36.110 0x7fa801d81d00 () - update listen
[D] 19:44:36.110 0x7fa801d81d00 () - tts models changed
[D] 19:44:36.110 0x7fa801d81d00 () - tts models changed
[D] 19:44:36.110 0x7fa801d81d00 () - update listen
[D] 19:44:36.110 0x7fa801d81d00 () - update listen
[W] 19:44:36.110 0x7fa801d81d00 () - no available tts models for in mnt
[W] 19:44:36.110 0x7fa801d81d00 () - no available tts models for out mnt
[W] 19:44:36.110 0x7fa801d81d00 () - no available tts models for in mnt
[W] 19:44:36.110 0x7fa801d81d00 () - no available tts models for out mnt
[D] 19:44:36.110 0x7fa801d81d00 () - ttt models changed
[D] 19:44:36.110 0x7fa801d81d00 () - ttt models changed
[D] 19:44:36.114 0x7fa801d81d00 () - mnt langs changed
[D] 19:44:36.114 0x7fa801d81d00 () - mnt langs changed
[D] 19:44:36.114 0x7fa801d81d00 () - update listen
[D] 19:44:36.114 0x7fa801d81d00 () - update listen
[W] 19:44:36.114 0x7fa801d81d00 () - no available mnt langs
[W] 19:44:36.114 0x7fa801d81d00 () - no available mnt langs
[W] 19:44:36.114 0x7fa801d81d00 () - no available mnt out langs
[W] 19:44:36.114 0x7fa801d81d00 () - no available mnt out langs
[D] 19:44:38.106 0x7fa801d81d00 () - stt start listen
[D] 19:44:38.106 0x7fa801d81d00 () - stt start listen
[D] 19:44:38.107 0x7fa801d81d00 () - default tts model not found: "en"
[D] 19:44:38.107 0x7fa801d81d00 () - default mnt lang not found: "en"
[D] 19:44:38.107 0x7fa801d81d00 () - default tts model not found: "en"
[D] 19:44:38.107 0x7fa801d81d00 () - default mnt lang not found: "en"
[D] 19:44:38.107 0x7fa801d81d00 () - new default mnt lang: "en"
[D] 19:44:38.107 0x7fa801d81d00 () - choosing model for id: "en_whisper_tiny" "en"
[D] 19:44:38.107 0x7fa801d81d00 () - new default mnt lang: "en"
[D] 19:44:38.107 0x7fa801d81d00 () - choosing model for id: "en_whisper_tiny" "en"
[D] 19:44:38.107 0x7fa801d81d00 () - gpu device str: ("ROCm", " 0", " AMD Radeon RX 5700 XT")
[D] 19:44:38.107 0x7fa801d81d00 () - gpu device str: ("ROCm", " 0", " AMD Radeon RX 5700 XT")
[D] 19:44:38.107 0x7fa801d81d00 () - restart stt engine config: "lang=en, lang_code=, model-files=[model-file=/home/asish/.var/app/net.mkiol.SpeechNote/cache/net.mkiol/dsnote/speech-models/en_whisper_tiny.ggml, scorer-file=, ttt-model-file=], speech-mode=single-sentence, vad-mode=aggressiveness-3, speech-started=0, text-format=raw, options=, use-gpu=1, gpu-device=[id=0, api=rocm, name=AMD Radeon RX 5700 XT, platform-name=], sub-config=[min-segment-dur=4, min-line-length=0, max-line-length=0]"
[D] 19:44:38.107 0x7fa801d81d00 () - new stt engine required
[D] 19:44:38.107 0x7fa801d81d00 () - restart stt engine config: "lang=en, lang_code=, model-files=[model-file=/home/asish/.var/app/net.mkiol.SpeechNote/cache/net.mkiol/dsnote/speech-models/en_whisper_tiny.ggml, scorer-file=, ttt-model-file=], speech-mode=single-sentence, vad-mode=aggressiveness-3, speech-started=0, text-format=raw, options=, use-gpu=1, gpu-device=[id=0, api=rocm, name=AMD Radeon RX 5700 XT, platform-name=], sub-config=[min-segment-dur=4, min-line-length=0, max-line-length=0]"
[D] 19:44:38.107 0x7fa801d81d00 () - new stt engine required
[D] 19:44:38.107 0x7fa801d81d00 open_whisper_lib:122 - using whisper-hipblas
[D] 19:44:38.107 0x7fa801d81d00 open_whisper_lib:122 - using whisper-hipblas
[D] 19:44:38.109 0x7fa801d81d00 make_wparams:429 - cpu info: arch=x86_64, cores=12
[D] 19:44:38.109 0x7fa801d81d00 make_wparams:429 - cpu info: arch=x86_64, cores=12
[D] 19:44:38.109 0x7fa801d81d00 make_wparams:431 - using threads: 5/12
[D] 19:44:38.109 0x7fa801d81d00 make_wparams:431 - using threads: 5/12
[D] 19:44:38.110 0x7fa801d81d00 make_wparams:433 - system info: AVX = 1 | AVX2 = 1 | AVX512 = 0 | FMA = 1 | NEON = 0 | ARM_FMA = 0 | METAL = 0 | F16C = 1 | FP16_VA = 0 | WASM_SIMD = 0 | BLAS = 1 | SSE3 = 1 | SSSE3 = 1 | VSX = 0 | CUDA = 1 | COREML = 0 | OPENVINO = 0 | 
[D] 19:44:38.110 0x7fa801d81d00 make_wparams:433 - system info: AVX = 1 | AVX2 = 1 | AVX512 = 0 | FMA = 1 | NEON = 0 | ARM_FMA = 0 | METAL = 0 | F16C = 1 | FP16_VA = 0 | WASM_SIMD = 0 | BLAS = 1 | SSE3 = 1 | SSSE3 = 1 | VSX = 0 | CUDA = 1 | COREML = 0 | OPENVINO = 0 | 
[D] 19:44:38.110 0x7fa801d81d00 start:225 - starting engine
[D] 19:44:38.110 0x7fa801d81d00 start:225 - starting engine
[D] 19:44:38.110 0x7fa801d81d00 start:234 - engine started
[D] 19:44:38.110 0x7fa801d81d00 start:234 - engine started
[D] 19:44:38.110 0x7fa801d81d00 () - creating audio source
[D] 19:44:38.110 0x7fa801d81d00 () - mic source created
[D] 19:44:38.110 0x7fa801d81d00 () - creating audio source
[D] 19:44:38.110 0x7fa801d81d00 () - mic source created
[D] 19:44:38.110 0x7fa507400600 start_processing:271 - processing started
[D] 19:44:38.110 0x7fa507400600 set_processing_state:457 - processing state: idle => initializing
[D] 19:44:38.110 0x7fa507400600 set_processing_state:464 - speech detection status: no-speech => initializing (no-speech)
[D] 19:44:38.110 0x7fa507400600 start_processing:271 - processing started
[D] 19:44:38.110 0x7fa507400600 set_processing_state:457 - processing state: idle => initializing
[D] 19:44:38.110 0x7fa507400600 set_processing_state:464 - speech detection status: no-speech => initializing (no-speech)
[D] 19:44:38.110 0x7fa507400600 () - service refresh status, new state: idle
[D] 19:44:38.110 0x7fa507400600 () - task state changed: 0 => 3
[D] 19:44:38.110 0x7fa507400600 () - service refresh status, new state: idle
[D] 19:44:38.110 0x7fa507400600 () - task state changed: 0 => 3
[D] 19:44:38.110 0x7fa507400600 create_whisper_model:239 - creating whisper model
whisper_init_from_file_with_params_no_state: loading model from '/home/asish/.var/app/net.mkiol.SpeechNote/cache/net.mkiol/dsnote/speech-models/en_whisper_tiny.ggml'
[D] 19:44:38.110 0x7fa507400600 create_whisper_model:239 - creating whisper model
whisper_init_from_file_with_params_no_state: loading model from '/home/asish/.var/app/net.mkiol.SpeechNote/cache/net.mkiol/dsnote/speech-models/en_whisper_tiny.ggml'
whisper_model_load: loading model
whisper_model_load: n_vocab       = 51864
whisper_model_load: n_audio_ctx   = 1500
whisper_model_load: n_audio_state = 384
whisper_model_load: n_audio_head  = 6
whisper_model_load: loading model
whisper_model_load: n_vocab       = 51864
whisper_model_load: n_audio_ctx   = 1500
whisper_model_load: n_audio_state = 384
whisper_model_load: n_audio_head  = 6
whisper_model_load: n_audio_layer = 4
whisper_model_load: n_text_ctx    = 448
whisper_model_load: n_text_state  = 384
whisper_model_load: n_text_head   = 6
whisper_model_load: n_text_layer  = 4
whisper_model_load: n_mels        = 80
whisper_model_load: ftype         = 9
whisper_model_load: qntvr         = 2
whisper_model_load: type          = 1 (tiny)
whisper_model_load: n_audio_layer = 4
whisper_model_load: n_text_ctx    = 448
whisper_model_load: n_text_state  = 384
whisper_model_load: n_text_head   = 6
whisper_model_load: n_text_layer  = 4
whisper_model_load: n_mels        = 80
whisper_model_load: ftype         = 9
whisper_model_load: qntvr         = 2
whisper_model_load: type          = 1 (tiny)
whisper_model_load: adding 1607 extra tokens
whisper_model_load: adding 1607 extra tokens
whisper_model_load: n_langs       = 99
whisper_model_load: n_langs       = 99
[D] 19:44:38.385 0x7fa801d81d00 () - using audio input: "alsa_input.usb-HP__Inc_HyperX_Cloud_Stinger_2_Wireless_0-00.mono-fallback"
[D] 19:44:38.385 0x7fa801d81d00 () - using audio input: "alsa_input.usb-HP__Inc_HyperX_Cloud_Stinger_2_Wireless_0-00.mono-fallback"
[D] 19:44:38.405 0x7fa801d81d00 () - audio state: IdleState
[D] 19:44:38.405 0x7fa801d81d00 () - audio state: IdleState
[D] 19:44:38.405 0x7fa801d81d00 set_speech_started:515 - speech started: false => true
[D] 19:44:38.405 0x7fa801d81d00 set_speech_detection_status:537 - speech detection status: initializing => initializing (speech-detected)
[D] 19:44:38.405 0x7fa801d81d00 set_speech_started:515 - speech started: false => true
[D] 19:44:38.405 0x7fa801d81d00 set_speech_detection_status:537 - speech detection status: initializing => initializing (speech-detected)
[D] 19:44:38.405 0x7fa801d81d00 () - service refresh status, new state: listening-single-sentence
[D] 19:44:38.405 0x7fa801d81d00 () - service state changed: idle => listening-single-sentence
[D] 19:44:38.405 0x7fa801d81d00 () - service refresh status, new state: listening-single-sentence
[D] 19:44:38.405 0x7fa801d81d00 () - service state changed: idle => listening-single-sentence
[W] 19:44:38.406 0x7fa801d81d00 () - ignore TaskStatePropertyChanged signal
[W] 19:44:38.406 0x7fa801d81d00 () - ignore TaskStatePropertyChanged signal
[D] 19:44:38.406 0x7fa801d81d00 () - app current task: -1 => 0
[D] 19:44:38.406 0x7fa801d81d00 () - app current task: -1 => 0
[D] 19:44:38.406 0x7fa801d81d00 () - app speech state: idle => initializing
[D] 19:44:38.406 0x7fa801d81d00 () - app speech state: idle => initializing
[D] 19:44:38.406 0x7fa801d81d00 () - app service state: idle => listening-single-sentence
[D] 19:44:38.406 0x7fa801d81d00 () - app service state: idle => listening-single-sentence
[W] 19:44:38.409 0x7fa801d81d00 () - no available mnt langs
[W] 19:44:38.409 0x7fa801d81d00 () - no available mnt out langs
[W] 19:44:38.409 0x7fa801d81d00 () - no available tts models for in mnt
[W] 19:44:38.409 0x7fa801d81d00 () - no available mnt langs
[W] 19:44:38.409 0x7fa801d81d00 () - no available mnt out langs
[W] 19:44:38.409 0x7fa801d81d00 () - no available tts models for in mnt
[W] 19:44:38.409 0x7fa801d81d00 () - no available tts models for out mnt
[W] 19:44:38.409 0x7fa801d81d00 () - no available tts models for out mnt
[D] 19:44:38.628 0x7fa801d81d00 () - mic clear
[D] 19:44:38.628 0x7fa801d81d00 () - mic clear
[D] 19:44:38.628 0x7fa801d81d00 () - audio state: ActiveState
[D] 19:44:38.628 0x7fa801d81d00 () - audio state: ActiveState
[D] 19:44:38.829 0x7fa801d81d00 () - mic clear
[D] 19:44:38.829 0x7fa801d81d00 () - mic clear
[D] 19:44:39.47 0x7fa801d81d00 () - mic clear
[D] 19:44:39.47 0x7fa801d81d00 () - mic clear
ggml_init_cublas: GGML_CUDA_FORCE_MMQ:   no
ggml_init_cublas: CUDA_USE_TENSOR_CORES: yes
ggml_init_cublas: found 1 ROCm devices:
  Device 0: AMD Radeon RX 5700 XT, compute capability 10.3, VMM: no
ggml_init_cublas: GGML_CUDA_FORCE_MMQ:   no
ggml_init_cublas: CUDA_USE_TENSOR_CORES: yes
ggml_init_cublas: found 1 ROCm devices:
  Device 0: AMD Radeon RX 5700 XT, compute capability 10.3, VMM: no
[D] 19:44:39.246 0x7fa801d81d00 () - mic clear
[D] 19:44:39.246 0x7fa801d81d00 () - mic clear
whisper_backend_init: using CUDA backend
whisper_backend_init: using CUDA backend
whisper_model_load:     ROCm buffer size =    31.64 MB
whisper_model_load:     ROCm buffer size =    31.64 MB
whisper_model_load: model size    =   31.57 MB
whisper_model_load: model size    =   31.57 MB
whisper_backend_init: using CUDA backend
whisper_backend_init: using CUDA backend
whisper_init_state: kv self size  =    8.26 MB
whisper_init_state: kv self size  =    8.26 MB
whisper_init_state: kv cross size =    9.22 MB
whisper_init_state: kv cross size =    9.22 MB
whisper_init_state: compute buffer (conv)   =   12.17 MB
whisper_init_state: compute buffer (conv)   =   12.17 MB
whisper_init_state: compute buffer (encode) =   64.92 MB
whisper_init_state: compute buffer (encode) =   64.92 MB
whisper_init_state: compute buffer (cross)  =    4.01 MB
whisper_init_state: compute buffer (cross)  =    4.01 MB
whisper_init_state: compute buffer (decode) =   96.02 MB
whisper_init_state: compute buffer (decode) =   96.02 MB
[D] 19:44:39.420 0x7fa507400600 create_whisper_model:249 - whisper model created
[D] 19:44:39.420 0x7fa507400600 set_processing_state:457 - processing state: initializing => idle
[D] 19:44:39.420 0x7fa507400600 create_whisper_model:249 - whisper model created
[D] 19:44:39.420 0x7fa507400600 set_processing_state:457 - processing state: initializing => idle
[D] 19:44:39.420 0x7fa507400600 set_processing_state:464 - speech detection status: initializing => speech-detected (speech-detected)
[D] 19:44:39.420 0x7fa507400600 set_processing_state:464 - speech detection status: initializing => speech-detected (speech-detected)
[D] 19:44:39.421 0x7fa507400600 () - service refresh status, new state: listening-single-sentence
[D] 19:44:39.421 0x7fa507400600 () - service refresh status, new state: listening-single-sentence
[D] 19:44:39.421 0x7fa507400600 () - task state changed: 3 => 1
[D] 19:44:39.421 0x7fa507400600 () - task state changed: 3 => 1
[D] 19:44:39.421 0x7fa801d81d00 () - app task state: initializing => speech-detected
[D] 19:44:39.421 0x7fa801d81d00 () - app task state: initializing => speech-detected
[D] 19:44:40.870 0x7fa507400600 process_buff:259 - process samples buf: mode=single-sentence, in-buf size=24000, speech-buf size=0, sof=true, eof=false
[D] 19:44:40.870 0x7fa507400600 process_buff:259 - process samples buf: mode=single-sentence, in-buf size=24000, speech-buf size=0, sof=true, eof=false
[D] 19:44:40.919 0x7fa507400600 process_buff:279 - vad: speech detected
[D] 19:44:40.919 0x7fa507400600 process_buff:279 - vad: speech detected
[D] 19:44:42.270 0x7fa507400600 process_buff:259 - process samples buf: mode=single-sentence, in-buf size=24000, speech-buf size=24000, sof=false, eof=false
[D] 19:44:42.270 0x7fa507400600 process_buff:259 - process samples buf: mode=single-sentence, in-buf size=24000, speech-buf size=24000, sof=false, eof=false
[D] 19:44:42.319 0x7fa507400600 process_buff:279 - vad: speech detected
[D] 19:44:42.319 0x7fa507400600 process_buff:279 - vad: speech detected
[D] 19:44:43.870 0x7fa507400600 process_buff:259 - process samples buf: mode=single-sentence, in-buf size=24000, speech-buf size=47040, sof=false, eof=false
[D] 19:44:43.870 0x7fa507400600 process_buff:259 - process samples buf: mode=single-sentence, in-buf size=24000, speech-buf size=47040, sof=false, eof=false
[D] 19:44:43.902 0x7fa507400600 process_buff:279 - vad: speech detected
[D] 19:44:43.902 0x7fa507400600 process_buff:279 - vad: speech detected
[D] 19:44:45.270 0x7fa507400600 process_buff:259 - process samples buf: mode=single-sentence, in-buf size=24000, speech-buf size=60960, sof=false, eof=false
[D] 19:44:45.270 0x7fa507400600 process_buff:259 - process samples buf: mode=single-sentence, in-buf size=24000, speech-buf size=60960, sof=false, eof=false
[D] 19:44:45.304 0x7fa507400600 process_buff:279 - vad: speech detected
[D] 19:44:45.304 0x7fa507400600 process_buff:279 - vad: speech detected
[D] 19:44:46.871 0x7fa507400600 process_buff:259 - process samples buf: mode=single-sentence, in-buf size=24000, speech-buf size=84960, sof=false, eof=false
[D] 19:44:46.871 0x7fa507400600 process_buff:259 - process samples buf: mode=single-sentence, in-buf size=24000, speech-buf size=84960, sof=false, eof=false
[D] 19:44:46.898 0x7fa507400600 process_buff:279 - vad: speech detected
[D] 19:44:46.898 0x7fa507400600 process_buff:279 - vad: speech detected
[D] 19:44:48.270 0x7fa507400600 process_buff:259 - process samples buf: mode=single-sentence, in-buf size=24000, speech-buf size=108960, sof=false, eof=false
[D] 19:44:48.270 0x7fa507400600 process_buff:259 - process samples buf: mode=single-sentence, in-buf size=24000, speech-buf size=108960, sof=false, eof=false
[D] 19:44:48.298 0x7fa507400600 process_buff:279 - vad: speech detected
[D] 19:44:48.298 0x7fa507400600 process_buff:279 - vad: speech detected
[D] 19:44:49.870 0x7fa507400600 process_buff:259 - process samples buf: mode=single-sentence, in-buf size=24000, speech-buf size=125760, sof=false, eof=false
[D] 19:44:49.870 0x7fa507400600 process_buff:259 - process samples buf: mode=single-sentence, in-buf size=24000, speech-buf size=125760, sof=false, eof=false
[D] 19:44:49.907 0x7fa507400600 process_buff:279 - vad: speech detected
[D] 19:44:49.907 0x7fa507400600 process_buff:279 - vad: speech detected
[D] 19:44:50.248 0x7fa801d81d00 () - stt stop listen
[D] 19:44:50.248 0x7fa801d81d00 () - stt stop listen
[D] 19:44:50.248 0x7fa801d81d00 () - stop stt engine gracefully
[D] 19:44:50.248 0x7fa801d81d00 set_speech_started:515 - speech started: true => false
[D] 19:44:50.248 0x7fa801d81d00 set_speech_detection_status:537 - speech detection status: speech-detected => no-speech (no-speech)
[D] 19:44:50.248 0x7fa801d81d00 () - stop stt engine gracefully
[D] 19:44:50.248 0x7fa801d81d00 set_speech_started:515 - speech started: true => false
[D] 19:44:50.248 0x7fa801d81d00 set_speech_detection_status:537 - speech detection status: speech-detected => no-speech (no-speech)
[D] 19:44:50.248 0x7fa801d81d00 () - service refresh status, new state: listening-single-sentence
[D] 19:44:50.248 0x7fa801d81d00 () - service refresh status, new state: listening-single-sentence
[D] 19:44:50.248 0x7fa801d81d00 () - task state changed: 1 => 0
[D] 19:44:50.248 0x7fa801d81d00 () - task state changed: 1 => 0
[D] 19:44:50.248 0x7fa801d81d00 () - mic source stop
[D] 19:44:50.248 0x7fa801d81d00 () - mic source stop
[D] 19:44:50.248 0x7fa801d81d00 () - audio state: SuspendedState
[D] 19:44:50.248 0x7fa801d81d00 () - audio state: SuspendedState
[D] 19:44:50.248 0x7fa801d81d00 () - audio ended
[D] 19:44:50.248 0x7fa801d81d00 () - audio ended
[D] 19:44:50.248 0x7fa801d81d00 () - app task state: speech-detected => idle
[D] 19:44:50.248 0x7fa801d81d00 () - app task state: speech-detected => idle
[D] 19:44:50.270 0x7fa801d81d00 () - audio state: ActiveState
[D] 19:44:50.270 0x7fa801d81d00 () - audio ended
[D] 19:44:50.270 0x7fa801d81d00 () - audio state: ActiveState
[D] 19:44:50.270 0x7fa801d81d00 () - audio ended
[D] 19:44:50.470 0x7fa801d81d00 () - mic source stop
[D] 19:44:50.470 0x7fa801d81d00 () - mic source stop
[D] 19:44:50.470 0x7fa801d81d00 () - audio state: SuspendedState
[D] 19:44:50.470 0x7fa801d81d00 () - audio ended
[D] 19:44:50.470 0x7fa801d81d00 () - audio state: SuspendedState
[D] 19:44:50.470 0x7fa801d81d00 () - audio ended
[D] 19:44:50.471 0x7fa507400600 process_buff:259 - process samples buf: mode=single-sentence, in-buf size=8000, speech-buf size=140640, sof=false, eof=true
[D] 19:44:50.471 0x7fa507400600 process_buff:259 - process samples buf: mode=single-sentence, in-buf size=8000, speech-buf size=140640, sof=false, eof=true
[D] 19:44:50.480 0x7fa507400600 process_buff:294 - vad: no speech
[D] 19:44:50.480 0x7fa507400600 set_processing_state:457 - processing state: idle => decoding
[D] 19:44:50.480 0x7fa507400600 set_processing_state:464 - speech detection status: no-speech => decoding (no-speech)
[D] 19:44:50.480 0x7fa507400600 process_buff:294 - vad: no speech
[D] 19:44:50.480 0x7fa507400600 set_processing_state:457 - processing state: idle => decoding
[D] 19:44:50.480 0x7fa507400600 set_processing_state:464 - speech detection status: no-speech => decoding (no-speech)
[D] 19:44:50.480 0x7fa507400600 () - service refresh status, new state: listening-single-sentence
[D] 19:44:50.480 0x7fa507400600 () - task state changed: 0 => 2
[D] 19:44:50.480 0x7fa507400600 () - service refresh status, new state: listening-single-sentence
[D] 19:44:50.480 0x7fa507400600 () - task state changed: 0 => 2
[D] 19:44:50.480 0x7fa507400600 process_buff:362 - speech frame: samples=140640
[D] 19:44:50.480 0x7fa507400600 decode_speech:439 - speech decoding started
[D] 19:44:50.480 0x7fa507400600 process_buff:362 - speech frame: samples=140640
[D] 19:44:50.480 0x7fa507400600 decode_speech:439 - speech decoding started
[D] 19:44:50.480 0x7fa801d81d00 () - app task state: idle => processing
[D] 19:44:50.480 0x7fa801d81d00 () - app task state: idle => processing
[D] 19:44:50.683 0x7fa801d81d00 () - audio source ended successfuly
[D] 19:44:50.683 0x7fa801d81d00 () - audio source ended successfuly

** (dsnote:2): WARNING **: 19:44:58.395: atk-bridge: get_device_events_reply: unknown signature

** (dsnote:2): WARNING **: 19:44:58.395: atk-bridge: get_device_events_reply: unknown signature
[D] 19:46:05.946 0x7fa801d81d00 () - exiting
[D] 19:46:05.946 0x7fa801d81d00 () - exiting
mkiol commented 6 months ago

There is a new version 4.5.0 released with newer ROCm... but don't install it! I haven't updated the Flatpak GPU add-on yet.

I will let you know when it is ready for download.

mediocretwo commented 6 months ago

Ok, will that fix the issue I am facing?

mkiol commented 6 months ago

Maybe. This new version comes with updated ROCm libraries, so there is a chance that something will improve.

Problem with the add-ons is resolved, so please update both net.mkiol.SpeechNote (v4.5.0) and net.mkiol.SpeechNote.Addon.amd (v1.1.0).

mediocretwo commented 6 months ago

OK, I have tried again after updating both the app and AMD addon. Still the same result unfortunately. The GPU usage goes to 99% and the app is stuck in the processing state for ages. The cancel button still does not work, the only way to stop the loop is to close the app. Attaching the log again:

QSocketNotifier: Can only be used with threads started with QThread
QSocketNotifier: Can only be used with threads started with QThread
qt.qpa.qgnomeplatform: Could not find color scheme  ""
qt.qpa.qgnomeplatform: Could not find color scheme  ""
[I] 07:37:12.140727893.140 0x7effe8f13d00 init:49 - logging to stderr enabled
[I] 07:37:12.140727893.140 0x7effe8f13d00 init:49 - logging to stderr enabled
[D] 07:37:12.140785713.140 0x7effe8f13d00 () - version: 4.5.0
[D] 07:37:12.140785713.140 0x7effe8f13d00 () - version: 4.5.0
[D] 07:37:12.141125794.141 0x7effe8f13d00 parse_cpuinfo:117 - cpu flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm constant_tsc rep_good nopl nonstop_tsc cpuid extd_apicid aperfmperf rapl pni pclmulqdq monitor ssse3 fma cx16 sse4_1 sse4_2 x2apic movbe popcnt aes xsave avx f16c rdrand lahf_lm cmp_legacy svm extapic cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw ibs skinit wdt tce topoext perfctr_core perfctr_nb bpext perfctr_llc mwaitx cpb cat_l3 cdp_l3 hw_pstate ssbd mba ibpb stibp vmmcall fsgsbase bmi1 avx2 smep bmi2 cqm rdt_a rdseed adx smap clflushopt clwb sha_ni xsaveopt xsavec xgetbv1 cqm_llc cqm_occup_llc cqm_mbm_total cqm_mbm_local clzero irperf xsaveerptr rdpru wbnoinvd arat npt lbrv svm_lock nrip_save tsc_scale vmcb_clean flushbyasid decodeassists pausefilter pfthreshold avic v_vmsave_vmload vgif v_spec_ctrl umip rdpid overflow_recov succor smca sev sev_es
[D] 07:37:12.141125794.141 0x7effe8f13d00 parse_cpuinfo:117 - cpu flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm constant_tsc rep_good nopl nonstop_tsc cpuid extd_apicid aperfmperf rapl pni pclmulqdq monitor ssse3 fma cx16 sse4_1 sse4_2 x2apic movbe popcnt aes xsave avx f16c rdrand lahf_lm cmp_legacy svm extapic cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw ibs skinit wdt tce topoext perfctr_core perfctr_nb bpext perfctr_llc mwaitx cpb cat_l3 cdp_l3 hw_pstate ssbd mba ibpb stibp vmmcall fsgsbase bmi1 avx2 smep bmi2 cqm rdt_a rdseed adx smap clflushopt clwb sha_ni xsaveopt xsavec xgetbv1 cqm_llc cqm_occup_llc cqm_mbm_total cqm_mbm_local clzero irperf xsaveerptr rdpru wbnoinvd arat npt lbrv svm_lock nrip_save tsc_scale vmcb_clean flushbyasid decodeassists pausefilter pfthreshold avic v_vmsave_vmload vgif v_spec_ctrl umip rdpid overflow_recov succor smca sev sev_es
[D] 07:37:12.141397484.141 0x7effe8f13d00 parse_cpuinfo:125 - cpuinfo: processor-count=12, flags=[avx, avx2, fma, f16c, ]
[D] 07:37:12.141397484.141 0x7effe8f13d00 parse_cpuinfo:125 - cpuinfo: processor-count=12, flags=[avx, avx2, fma, f16c, ]
[D] 07:37:12.141434774.141 0x7effe8f13d00 () - translation: "en_US"
[W] 07:37:12.141439104.141 0x7effe8f13d00 () - failed to install translation
[D] 07:37:12.141442644.141 0x7effe8f13d00 () - starting standalone app
[D] 07:37:12.141434774.141 0x7effe8f13d00 () - translation: "en_US"
[W] 07:37:12.141439104.141 0x7effe8f13d00 () - failed to install translation
[D] 07:37:12.141442644.141 0x7effe8f13d00 () - starting standalone app
[D] 07:37:12.142127376.142 0x7effe8f13d00 () - app: net.mkiol dsnote
[D] 07:37:12.142134636.142 0x7effe8f13d00 () - config location: "/home/asish/.var/app/net.mkiol.SpeechNote/config"
[D] 07:37:12.142138756.142 0x7effe8f13d00 () - data location: "/home/asish/.var/app/net.mkiol.SpeechNote/data/net.mkiol/dsnote"
[D] 07:37:12.142127376.142 0x7effe8f13d00 () - app: net.mkiol dsnote
[D] 07:37:12.142134636.142 0x7effe8f13d00 () - config location: "/home/asish/.var/app/net.mkiol.SpeechNote/config"
[D] 07:37:12.142138756.142 0x7effe8f13d00 () - data location: "/home/asish/.var/app/net.mkiol.SpeechNote/data/net.mkiol/dsnote"
[D] 07:37:12.142142316.142 0x7effe8f13d00 () - cache location: "/home/asish/.var/app/net.mkiol.SpeechNote/cache/net.mkiol/dsnote"
[D] 07:37:12.142145436.142 0x7effe8f13d00 () - settings file: "/home/asish/.var/app/net.mkiol.SpeechNote/config/net.mkiol/dsnote/settings.conf"
[D] 07:37:12.142148376.142 0x7effe8f13d00 () - platform: "wayland"
[D] 07:37:12.142157436.142 0x7effe8f13d00 () - amd addon exists
[D] 07:37:12.142142316.142 0x7effe8f13d00 () - cache location: "/home/asish/.var/app/net.mkiol.SpeechNote/cache/net.mkiol/dsnote"
[D] 07:37:12.142145436.142 0x7effe8f13d00 () - settings file: "/home/asish/.var/app/net.mkiol.SpeechNote/config/net.mkiol/dsnote/settings.conf"
[D] 07:37:12.142148376.142 0x7effe8f13d00 () - platform: "wayland"
[D] 07:37:12.142157436.142 0x7effe8f13d00 () - amd addon exists
[D] 07:37:12.142168676.142 0x7effe8f13d00 () - enforcing num threads: 0
[D] 07:37:12.142168676.142 0x7effe8f13d00 () - enforcing num threads: 0
[D] 07:37:12.158521307.158 0x7effe8f13d00 () - starting service: app-standalone
[D] 07:37:12.158521307.158 0x7effe8f13d00 () - starting service: app-standalone
[D] 07:37:12.160529142.160 0x7effe8f13d00 () - mbrola dir: "/app/bin"
[D] 07:37:12.160529142.160 0x7effe8f13d00 () - mbrola dir: "/app/bin"
[D] 07:37:12.160549582.160 0x7effe8f13d00 () - espeak dir: "/app/bin"
[D] 07:37:12.160565452.160 0x7effe8f13d00 () - overrided gpu version: "10.3.0"
[D] 07:37:12.160573712.160 0x7effe8f13d00 () - HSA_OVERRIDE_GFX_VERSION: 10.3.0
[D] 07:37:12.160549582.160 0x7effe8f13d00 () - espeak dir: "/app/bin"
[D] 07:37:12.160565452.160 0x7effe8f13d00 () - overrided gpu version: "10.3.0"
[D] 07:37:12.160573712.160 0x7effe8f13d00 () - HSA_OVERRIDE_GFX_VERSION: 10.3.0
[D] 07:37:12.160686412.160 0x7effd9600680 loop:88 - py executor loop started
[D] 07:37:12.160686412.160 0x7effd9600680 loop:88 - py executor loop started
[D] 07:37:12.160709062.160 0x7effd9600680 set_env:84 - set env: PYTHONIOENCODING = utf-8
[D] 07:37:12.160716562.160 0x7effd9600680 set_env:84 - set env: HF_HUB_DISABLE_TELEMETRY = 1
[D] 07:37:12.160709062.160 0x7effd9600680 set_env:84 - set env: PYTHONIOENCODING = utf-8
[D] 07:37:12.160716562.160 0x7effd9600680 set_env:84 - set env: HF_HUB_DISABLE_TELEMETRY = 1
[D] 07:37:12.160722972.160 0x7effd9600680 set_env:84 - set env: HF_HUB_OFFLINE = 1
[D] 07:37:12.160722972.160 0x7effd9600680 set_env:84 - set env: HF_HUB_OFFLINE = 1
[D] 07:37:12.160729272.160 0x7effd9600680 set_env:84 - set env: HF_HUB_LOCAL_DIR_AUTO_SYMLINK_THRESHOLD = 100000000000
[D] 07:37:12.160729272.160 0x7effd9600680 set_env:84 - set env: HF_HUB_LOCAL_DIR_AUTO_SYMLINK_THRESHOLD = 100000000000
[D] 07:37:12.160780992.160 0x7effd9600680 set_env:84 - set env: HF_HUB_CACHE = /home/asish/.var/app/net.mkiol.SpeechNote/cache/net.mkiol/dsnote
[D] 07:37:12.160780992.160 0x7effd9600680 set_env:84 - set env: HF_HUB_CACHE = /home/asish/.var/app/net.mkiol.SpeechNote/cache/net.mkiol/dsnote
[D] 07:37:12.163047698.163 0x7effe8f13d00 () - module already unpacked: "rhvoicedata"
[D] 07:37:12.163047698.163 0x7effe8f13d00 () - module already unpacked: "rhvoicedata"
[D] 07:37:12.163075278.163 0x7effe8f13d00 () - module already unpacked: "rhvoiceconfig"
[D] 07:37:12.163075278.163 0x7effe8f13d00 () - module already unpacked: "rhvoiceconfig"
[D] 07:37:12.165933785.165 0x7effe8f13d00 () - module already unpacked: "espeakdata"
[D] 07:37:12.165933785.165 0x7effe8f13d00 () - module already unpacked: "espeakdata"
[D] 07:37:12.166106166.166 0x7effe8f13d00 () - default stt model not found: "en_whisper_tiny"
[D] 07:37:12.166112136.166 0x7effe8f13d00 () - default tts model not found: "en"
[D] 07:37:12.166115686.166 0x7effe8f13d00 () - default mnt lang not found: "en"
[D] 07:37:12.166106166.166 0x7effe8f13d00 () - default stt model not found: "en_whisper_tiny"
[D] 07:37:12.166112136.166 0x7effe8f13d00 () - default tts model not found: "en"
[D] 07:37:12.166115686.166 0x7effe8f13d00 () - default mnt lang not found: "en"
[D] 07:37:12.166118296.166 0x7effe8f13d00 () - new default mnt lang: "en"
[D] 07:37:12.166122716.166 0x7effe8f13d00 () - service refresh status, new state: busy
[D] 07:37:12.166125576.166 0x7effe8f13d00 () - service state changed: unknown => busy
[D] 07:37:12.166128376.166 0x7effe8f13d00 () - delaying features availability
[D] 07:37:12.166118296.166 0x7effe8f13d00 () - new default mnt lang: "en"
[D] 07:37:12.166122716.166 0x7effe8f13d00 () - service refresh status, new state: busy
[D] 07:37:12.166125576.166 0x7effe8f13d00 () - service state changed: unknown => busy
[D] 07:37:12.166128376.166 0x7effe8f13d00 () - delaying features availability
[D] 07:37:12.167415419.167 0x7effe8f13d00 () - runtime prefix: "/app"
[D] 07:37:12.167415419.167 0x7effe8f13d00 () - runtime prefix: "/app"
[D] 07:37:12.167638509.167 0x7effe8f13d00 () - available styles: ("Default", "Fusion", "Imagine", "Material", "org.kde.breeze", "org.kde.desktop", "Plasma", "Universal")
[D] 07:37:12.167638509.167 0x7effe8f13d00 () - available styles: ("Default", "Fusion", "Imagine", "Material", "org.kde.breeze", "org.kde.desktop", "Plasma", "Universal")
[D] 07:37:12.167698029.167 0x7effe8f13d00 () - style paths: ("/usr/lib/qml/QtQuick/Controls.2")
[D] 07:37:12.167707150.167 0x7effe8f13d00 () - import paths: ("/usr/lib/qml", "/app/bin", "qrc:/qt-project.org/imports")
[D] 07:37:12.167698029.167 0x7effe8f13d00 () - style paths: ("/usr/lib/qml/QtQuick/Controls.2")
[D] 07:37:12.167707150.167 0x7effe8f13d00 () - import paths: ("/usr/lib/qml", "/app/bin", "qrc:/qt-project.org/imports")
[D] 07:37:12.167710800.167 0x7effe8f13d00 () - library paths: ("/usr/share/runtime/lib/plugins", "/usr/lib/plugins", "/app/bin")
[D] 07:37:12.167710800.167 0x7effe8f13d00 () - library paths: ("/usr/share/runtime/lib/plugins", "/usr/lib/plugins", "/app/bin")
[D] 07:37:12.167978200.167 0x7effe8f13d00 () - switching to style: "Plasma"
[D] 07:37:12.167978200.167 0x7effe8f13d00 () - switching to style: "Plasma"
[D] 07:37:12.168154381.168 0x7effe8f13d00 () - desktop file: "net.mkiol.SpeechNote"
[D] 07:37:12.168154381.168 0x7effe8f13d00 () - desktop file: "net.mkiol.SpeechNote"
[D] 07:37:12.170300696.170 0x7effd9600680 libs_availability:62 - checking: torch cuda
[D] 07:37:12.170300696.170 0x7effd9600680 libs_availability:62 - checking: torch cuda
[D] 07:37:12.171443909.171 0x7effda000680 () - config version: 81 81
[D] 07:37:12.171443909.171 0x7effda000680 () - config version: 81 81
[W] 07:37:12.193188303.193 0x7effcfe00680 () - QObject: Cannot create children for a parent that is in a different thread.
(Parent is QApplication(0x7ffc16390a50), parent's thread is QThread(0x56070b62e6c0), current thread is QThread(0x56070ba1fc90)
[W] 07:37:12.193188303.193 0x7effcfe00680 () - QObject: Cannot create children for a parent that is in a different thread.
(Parent is QApplication(0x7ffc16390a50), parent's thread is QThread(0x56070b62e6c0), current thread is QThread(0x56070ba1fc90)
[W] 07:37:12.193291963.193 0x7effcfe00680 () - QObject: Cannot create children for a parent that is in a different thread.
(Parent is QApplication(0x7ffc16390a50), parent's thread is QThread(0x56070b62e6c0), current thread is QThread(0x56070ba1fc90)
[W] 07:37:12.193291963.193 0x7effcfe00680 () - QObject: Cannot create children for a parent that is in a different thread.
(Parent is QApplication(0x7ffc16390a50), parent's thread is QThread(0x56070b62e6c0), current thread is QThread(0x56070ba1fc90)
[W] 07:37:12.193546574.193 0x7effcfe00680 () - QObject: Cannot create children for a parent that is in a different thread.
(Parent is QApplication(0x7ffc16390a50), parent's thread is QThread(0x56070b62e6c0), current thread is QThread(0x56070ba1fc90)
[W] 07:37:12.193552344.193 0x7effcfe00680 () - QObject::installEventFilter(): Cannot filter events for objects in a different thread.
[W] 07:37:12.193546574.193 0x7effcfe00680 () - QObject: Cannot create children for a parent that is in a different thread.
(Parent is QApplication(0x7ffc16390a50), parent's thread is QThread(0x56070b62e6c0), current thread is QThread(0x56070ba1fc90)
[W] 07:37:12.193552344.193 0x7effcfe00680 () - QObject::installEventFilter(): Cannot filter events for objects in a different thread.
[D] 07:37:12.200683111.200 0x7effda000680 () - models changed
[D] 07:37:12.200683111.200 0x7effda000680 () - models changed
[D] 07:37:12.853214100.853 0x7effe8f13d00 state_pa_callback:30 - pa authorizing
[D] 07:37:12.853214100.853 0x7effe8f13d00 state_pa_callback:30 - pa authorizing
[D] 07:37:12.853432140.853 0x7effe8f13d00 state_pa_callback:33 - pa setting name
[D] 07:37:12.853432140.853 0x7effe8f13d00 state_pa_callback:33 - pa setting name
[D] 07:37:12.855827826.855 0x7effe8f13d00 state_pa_callback:36 - pa ready
[D] 07:37:12.855827826.855 0x7effe8f13d00 state_pa_callback:36 - pa ready
[D] 07:37:12.856329047.856 0x7efe0ea00680 source_info_pa_callback:200 - pa source: alsa_output.pci-0000_28_00.1.hdmi-stereo-extra2.monitor Monitor of Navi 10 HDMI Audio Digital Stereo (HDMI 3)
[D] 07:37:12.856329047.856 0x7efe0ea00680 source_info_pa_callback:200 - pa source: alsa_output.pci-0000_28_00.1.hdmi-stereo-extra2.monitor Monitor of Navi 10 HDMI Audio Digital Stereo (HDMI 3)
[D] 07:37:12.856394497.856 0x7efe0ea00680 source_info_pa_callback:200 - pa source: alsa_output.pci-0000_2a_00.4.analog-stereo.monitor Monitor of Starship/Matisse HD Audio Controller Analog Stereo
[D] 07:37:12.856394497.856 0x7efe0ea00680 source_info_pa_callback:200 - pa source: alsa_output.pci-0000_2a_00.4.analog-stereo.monitor Monitor of Starship/Matisse HD Audio Controller Analog Stereo
[D] 07:37:12.856451088.856 0x7efe0ea00680 source_info_pa_callback:200 - pa source: alsa_input.pci-0000_2a_00.4.analog-stereo Starship/Matisse HD Audio Controller Analog Stereo
[D] 07:37:12.856451088.856 0x7efe0ea00680 source_info_pa_callback:200 - pa source: alsa_input.pci-0000_2a_00.4.analog-stereo Starship/Matisse HD Audio Controller Analog Stereo
[D] 07:37:12.856503648.856 0x7efe0ea00680 source_info_pa_callback:200 - pa source: alsa_output.usb-HP__Inc_HyperX_Cloud_Stinger_2_Wireless_0-00.analog-stereo.monitor Monitor of HyperX Cloud Stinger 2 Wireless Analog Stereo
[D] 07:37:12.856503648.856 0x7efe0ea00680 source_info_pa_callback:200 - pa source: alsa_output.usb-HP__Inc_HyperX_Cloud_Stinger_2_Wireless_0-00.analog-stereo.monitor Monitor of HyperX Cloud Stinger 2 Wireless Analog Stereo
[D] 07:37:12.856554768.856 0x7efe0ea00680 source_info_pa_callback:200 - pa source: alsa_input.usb-HP__Inc_HyperX_Cloud_Stinger_2_Wireless_0-00.mono-fallback HyperX Cloud Stinger 2 Wireless Mono
[D] 07:37:12.856554768.856 0x7efe0ea00680 source_info_pa_callback:200 - pa source: alsa_input.usb-HP__Inc_HyperX_Cloud_Stinger_2_Wireless_0-00.mono-fallback HyperX Cloud Stinger 2 Wireless Mono
[D] 07:37:12.863979786.863 0x7effe8f13d00 () - starting app: app-standalone
[D] 07:37:12.863979786.863 0x7effe8f13d00 () - starting app: app-standalone
[D] 07:37:12.864285167.864 0x7effe8f13d00 () - app service state: unknown => busy
[D] 07:37:12.864291517.864 0x7effe8f13d00 () - app busy: false => true
[D] 07:37:12.864285167.864 0x7effe8f13d00 () - app service state: unknown => busy
[D] 07:37:12.864291517.864 0x7effe8f13d00 () - app busy: false => true
[D] 07:37:12.864295787.864 0x7effe8f13d00 () - app connected: false  = >  true
[D] 07:37:12.864295787.864 0x7effe8f13d00 () - app connected: false  = >  true
[W] 07:37:12.864435757.864 0x7effe8f13d00 () - hot keys are supported only under x11
[W] 07:37:12.864435757.864 0x7effe8f13d00 () - hot keys are supported only under x11
logger error: invalid format string
qrc:/qml/main.qml:372:5: QML Connections: Implicitly defined onFoo properties in Connections are deprecated. Use this syntax instead: function onFoo(<arguments>) { ... }
logger error: invalid format string
qrc:/qml/main.qml:372:5: QML Connections: Implicitly defined onFoo properties in Connections are deprecated. Use this syntax instead: function onFoo(<arguments>) { ... }
logger error: invalid format string
qrc:/qml/main.qml:363:5: QML Connections: Implicitly defined onFoo properties in Connections are deprecated. Use this syntax instead: function onFoo(<arguments>) { ... }
logger error: invalid format string
qrc:/qml/main.qml:363:5: QML Connections: Implicitly defined onFoo properties in Connections are deprecated. Use this syntax instead: function onFoo(<arguments>) { ... }
logger error: invalid format string
qrc:/qml/Notepad.qml:24:5: QML Connections: Implicitly defined onFoo properties in Connections are deprecated. Use this syntax instead: function onFoo(<arguments>) { ... }
logger error: invalid format string
qrc:/qml/Notepad.qml:24:5: QML Connections: Implicitly defined onFoo properties in Connections are deprecated. Use this syntax instead: function onFoo(<arguments>) { ... }
logger error: invalid format string
qrc:/qml/Translator.qml:30:5: QML Connections: Implicitly defined onFoo properties in Connections are deprecated. Use this syntax instead: function onFoo(<arguments>) { ... }
logger error: invalid format string
qrc:/qml/Translator.qml:30:5: QML Connections: Implicitly defined onFoo properties in Connections are deprecated. Use this syntax instead: function onFoo(<arguments>) { ... }
[D] 07:37:13.296077818.296 0x7effe8f13d00 onCompleted:168 - default font pixel size: 14
[D] 07:37:13.296077818.296 0x7effe8f13d00 onCompleted:168 - default font pixel size: 14
[D] 07:37:13.302706694.302 0x7effe8f13d00 () - default tts model not found: "en"
[D] 07:37:13.302720884.302 0x7effe8f13d00 () - default mnt lang not found: "en"
[D] 07:37:13.302724054.302 0x7effe8f13d00 () - new default mnt lang: "en"
[D] 07:37:13.302728944.302 0x7effe8f13d00 () - service refresh status, new state: busy
[D] 07:37:13.302734704.302 0x7effe8f13d00 () - service refresh status, new state: busy
[D] 07:37:13.302706694.302 0x7effe8f13d00 () - default tts model not found: "en"
[D] 07:37:13.302720884.302 0x7effe8f13d00 () - default mnt lang not found: "en"
[D] 07:37:13.302724054.302 0x7effe8f13d00 () - new default mnt lang: "en"
[D] 07:37:13.302728944.302 0x7effe8f13d00 () - service refresh status, new state: busy
[D] 07:37:13.302734704.302 0x7effe8f13d00 () - service refresh status, new state: busy
[D] 07:37:13.323632716.323 0x7effe8f13d00 () - trying features availability update: false
[D] 07:37:13.323632716.323 0x7effe8f13d00 () - trying features availability update: false
[W] 07:37:13.351579835.351 0x7effe8f13d00 ():228 - qrc:/qml/Translator.qml:228:9: QML ColumnLayout (parent or ancestor of QQuickLayoutAttached): Binding loop detected for property "preferredWidth"
[W] 07:37:13.351579835.351 0x7effe8f13d00 ():228 - qrc:/qml/Translator.qml:228:9: QML ColumnLayout (parent or ancestor of QQuickLayoutAttached): Binding loop detected for property "preferredWidth"
[D] 07:37:13.386318492.386 0x7effd9600680 libs_availability:70 - checking: coqui tts
[D] 07:37:13.386318492.386 0x7effd9600680 libs_availability:70 - checking: coqui tts
[D] 07:37:13.386706482.386 0x7effd9600680 libs_availability:78 - checking: whisperspeech tts
[D] 07:37:13.386706482.386 0x7effd9600680 libs_availability:78 - checking: whisperspeech tts
[D] 07:37:13.386863473.386 0x7effd9600680 libs_availability:86 - checking: faster-whisper
[D] 07:37:13.386863473.386 0x7effd9600680 libs_availability:86 - checking: faster-whisper
[D] 07:37:13.460807036.460 0x7effe8f13d00 () - stt models changed
[D] 07:37:13.460807036.460 0x7effe8f13d00 () - stt models changed
[D] 07:37:13.461013547.461 0x7effe8f13d00 () - update listen
[D] 07:37:13.461022207.461 0x7effe8f13d00 () - app stt configured: false => true
[D] 07:37:13.461013547.461 0x7effe8f13d00 () - update listen
[D] 07:37:13.461022207.461 0x7effe8f13d00 () - app stt configured: false => true
[D] 07:37:13.464035334.464 0x7effe8f13d00 () - app active stt model: "" => "en_whisper_tiny"
[D] 07:37:13.464035334.464 0x7effe8f13d00 () - app active stt model: "" => "en_whisper_tiny"
[D] 07:37:13.464169935.464 0x7effe8f13d00 () - update listen
[D] 07:37:13.464177165.464 0x7effe8f13d00 () - tts models changed
[D] 07:37:13.464169935.464 0x7effe8f13d00 () - update listen
[D] 07:37:13.464177165.464 0x7effe8f13d00 () - tts models changed
[D] 07:37:13.464247215.464 0x7effe8f13d00 () - update listen
[W] 07:37:13.464254865.464 0x7effe8f13d00 () - no available tts models for in mnt
[W] 07:37:13.464257735.464 0x7effe8f13d00 () - no available tts models for out mnt
[D] 07:37:13.464260325.464 0x7effe8f13d00 () - ttt models changed
[D] 07:37:13.464247215.464 0x7effe8f13d00 () - update listen
[W] 07:37:13.464254865.464 0x7effe8f13d00 () - no available tts models for in mnt
[W] 07:37:13.464257735.464 0x7effe8f13d00 () - no available tts models for out mnt
[D] 07:37:13.464260325.464 0x7effe8f13d00 () - ttt models changed
[D] 07:37:13.467427713.467 0x7effe8f13d00 () - mnt langs changed
[D] 07:37:13.467427713.467 0x7effe8f13d00 () - mnt langs changed
[D] 07:37:13.467456833.467 0x7effe8f13d00 () - update listen
[W] 07:37:13.467461523.467 0x7effe8f13d00 () - no available mnt langs
[W] 07:37:13.467464443.467 0x7effe8f13d00 () - no available mnt out langs
[D] 07:37:13.467456833.467 0x7effe8f13d00 () - update listen
[W] 07:37:13.467461523.467 0x7effe8f13d00 () - no available mnt langs
[W] 07:37:13.467464443.467 0x7effe8f13d00 () - no available mnt out langs
[W] 07:37:13.475001411.475 0x7effe8f13d00 void KDirWatchPrivate::addEntry(KDirWatch*, const QString&, Entry*, bool, KDirWatch::WatchModes):797 - Cannot watch QRC-like path ":/icons/hicolor/index.theme"
[W] 07:37:13.475001411.475 0x7effe8f13d00 void KDirWatchPrivate::addEntry(KDirWatch*, const QString&, Entry*, bool, KDirWatch::WatchModes):797 - Cannot watch QRC-like path ":/icons/hicolor/index.theme"
[D] 07:37:13.766040953.766 0x7effd9600680 libs_availability:94 - checking: transformers
[D] 07:37:13.766061423.766 0x7effd9600680 libs_availability:96 - checking: accelerate
[D] 07:37:13.766040953.766 0x7effd9600680 libs_availability:94 - checking: transformers
[D] 07:37:13.766061423.766 0x7effd9600680 libs_availability:96 - checking: accelerate
[D] 07:37:14.525647517.525 0x7effd9600680 libs_availability:104 - checking: unikud
[D] 07:37:14.525647517.525 0x7effd9600680 libs_availability:104 - checking: unikud
[D] 07:37:14.525848207.525 0x7effd9600680 libs_availability:115 - checking: mimic3 tts
[D] 07:37:14.525848207.525 0x7effd9600680 libs_availability:115 - checking: mimic3 tts
[D] 07:37:14.815375815.815 0x7effe8f13d00 () - trying features availability update: false
[D] 07:37:14.815375815.815 0x7effe8f13d00 () - trying features availability update: false
[D] 07:37:15.068329663.68 0x7effd9600680 libs_availability:123 - checking: gruut
[D] 07:37:15.068351853.68 0x7effd9600680 libs_availability:127 - checking: gruut-de
[D] 07:37:15.068329663.68 0x7effd9600680 libs_availability:123 - checking: gruut
[D] 07:37:15.068351853.68 0x7effd9600680 libs_availability:127 - checking: gruut-de
[D] 07:37:15.068539973.68 0x7effd9600680 libs_availability:135 - checking: gruut-es
[D] 07:37:15.068539973.68 0x7effd9600680 libs_availability:135 - checking: gruut-es
[D] 07:37:15.068714384.68 0x7effd9600680 libs_availability:143 - checking: gruut-fr
[D] 07:37:15.068714384.68 0x7effd9600680 libs_availability:143 - checking: gruut-fr
[D] 07:37:15.068920814.68 0x7effd9600680 libs_availability:151 - checking: gruut-it
[D] 07:37:15.068920814.68 0x7effd9600680 libs_availability:151 - checking: gruut-it
[D] 07:37:15.069085535.69 0x7effd9600680 libs_availability:159 - checking: gruut-ru
[D] 07:37:15.069085535.69 0x7effd9600680 libs_availability:159 - checking: gruut-ru
[D] 07:37:15.069245925.69 0x7effd9600680 libs_availability:167 - checking: gruut-fa
[D] 07:37:15.069245925.69 0x7effd9600680 libs_availability:167 - checking: gruut-fa
[D] 07:37:15.069406145.69 0x7effd9600680 libs_availability:175 - checking: gruut-sw
[D] 07:37:15.069406145.69 0x7effd9600680 libs_availability:175 - checking: gruut-sw
[D] 07:37:15.069574466.69 0x7effd9600680 libs_availability:183 - checking: gruut-nl
[D] 07:37:15.069574466.69 0x7effd9600680 libs_availability:183 - checking: gruut-nl
[D] 07:37:15.069766886.69 0x7effd9600680 libs_availability:194 - checking: mecab
[D] 07:37:15.069766886.69 0x7effd9600680 libs_availability:194 - checking: mecab
[D] 07:37:15.071600781.71 0x7effd9600680 libs_availability:196 - checking: unidic-lite
[D] 07:37:15.071600781.71 0x7effd9600680 libs_availability:196 - checking: unidic-lite
[D] 07:37:15.071949382.71 0x7effd9600680 libs_availability:203 - py libs availability: [coqui-tts=true, faster-whisper=true, mimic3-tts=true, whisperspeech-tts=true, transformers=true, unikud=true, gruut_de=true, gruut_es=true, gruut_fa=true, gruut_fr=true, gruut_nl=true, gruut_it=true, gruut_ru=true, gruut_sw=true, mecab=true, torch-cuda=true]
[D] 07:37:15.071949382.71 0x7effd9600680 libs_availability:203 - py libs availability: [coqui-tts=true, faster-whisper=true, mimic3-tts=true, whisperspeech-tts=true, transformers=true, unikud=true, gruut_de=true, gruut_es=true, gruut_fa=true, gruut_fr=true, gruut_nl=true, gruut_it=true, gruut_ru=true, gruut_sw=true, mecab=true, torch-cuda=true]
[D] 07:37:15.811620496.811 0x7effe8f13d00 () - trying features availability update: true
[D] 07:37:15.811636706.811 0x7effe8f13d00 () - features availability ready
[D] 07:37:15.811620496.811 0x7effe8f13d00 () - trying features availability update: true
[D] 07:37:15.811636706.811 0x7effe8f13d00 () - features availability ready
[W] 07:37:15.811837707.811 0x7effe8f13d00 has_lib:683 - failed to open libcudart.so: libcudart.so: cannot open shared object file: No such file or directory
[W] 07:37:15.811837707.811 0x7effe8f13d00 has_lib:683 - failed to open libcudart.so: libcudart.so: cannot open shared object file: No such file or directory
[W] 07:37:15.811899357.811 0x7effe8f13d00 has_lib:683 - failed to open libcudnn.so: libcudnn.so: cannot open shared object file: No such file or directory
[W] 07:37:15.811899357.811 0x7effe8f13d00 has_lib:683 - failed to open libcudnn.so: libcudnn.so: cannot open shared object file: No such file or directory
[W] 07:37:15.811951607.811 0x7effe8f13d00 has_lib:683 - failed to open libcudnn.so.8: libcudnn.so.8: cannot open shared object file: No such file or directory
[W] 07:37:15.811951607.811 0x7effe8f13d00 has_lib:683 - failed to open libcudnn.so.8: libcudnn.so.8: cannot open shared object file: No such file or directory
[W] 07:37:15.835088624.835 0x7effe8f13d00 has_cuda:56 - failed to open whisper-cublas lib: libwhisper-cublas.so: cannot open shared object file: No such file or directory
[W] 07:37:15.835088624.835 0x7effe8f13d00 has_cuda:56 - failed to open whisper-cublas lib: libwhisper-cublas.so: cannot open shared object file: No such file or directory
[W] 07:37:15.835226724.835 0x7effe8f13d00 has_cuda:56 - failed to open whisper-cublas lib: libwhisper-cublas.so: cannot open shared object file: No such file or directory
[W] 07:37:15.835226724.835 0x7effe8f13d00 has_cuda:56 - failed to open whisper-cublas lib: libwhisper-cublas.so: cannot open shared object file: No such file or directory
[D] 07:37:15.858303662.858 0x7effe8f13d00 () - updating models using availability: tts_coqui, tts_mimic3, tts_mimic3_de, tts_mimic3_es, tts_mimic3_fr, tts_mimic3_it, tts_mimic3_ru, tts_mimic3_sw, tts_mimic3_fa, tts_mimic3_nl, tts_rhvoice, tts_whisperspeech, stt_fasterwhisper, stt_ds, stt_vosk, mnt_bergamot, ttt_hftc option_r,
[D] 07:37:15.858303662.858 0x7effe8f13d00 () - updating models using availability: tts_coqui, tts_mimic3, tts_mimic3_de, tts_mimic3_es, tts_mimic3_fr, tts_mimic3_it, tts_mimic3_ru, tts_mimic3_sw, tts_mimic3_fa, tts_mimic3_nl, tts_rhvoice, tts_whisperspeech, stt_fasterwhisper, stt_ds, stt_vosk, mnt_bergamot, ttt_hftc option_r,
[D] 07:37:15.858365472.858 0x7effe8f13d00 () - updating model using availability internal
[D] 07:37:15.858365472.858 0x7effe8f13d00 () - updating model using availability internal
[D] 07:37:15.860233897.860 0x7effe8f13d00 () - default tts model not found: "en"
[D] 07:37:15.860233897.860 0x7effe8f13d00 () - default tts model not found: "en"
[D] 07:37:15.860273647.860 0x7effe8f13d00 () - default mnt lang not found: "en"
[D] 07:37:15.860273647.860 0x7effe8f13d00 () - default mnt lang not found: "en"
[D] 07:37:15.860299927.860 0x7effe8f13d00 () - new default mnt lang: "en"
[D] 07:37:15.860299927.860 0x7effe8f13d00 () - new default mnt lang: "en"
[D] 07:37:15.860361487.860 0x7effe8f13d00 () - service refresh status, new state: idle
[D] 07:37:15.860361487.860 0x7effe8f13d00 () - service refresh status, new state: idle
[D] 07:37:15.860386467.860 0x7effe8f13d00 () - service state changed: busy => idle
[D] 07:37:15.860386467.860 0x7effe8f13d00 () - service state changed: busy => idle
[D] 07:37:15.860422737.860 0x7effe8f13d00 () - scan cuda: true
[D] 07:37:15.860422737.860 0x7effe8f13d00 () - scan cuda: true
[D] 07:37:15.860447737.860 0x7effe8f13d00 () - scan hip: true
[D] 07:37:15.860447737.860 0x7effe8f13d00 () - scan hip: true
[D] 07:37:15.860491047.860 0x7effe8f13d00 () - scan opencl: true
[D] 07:37:15.860491047.860 0x7effe8f13d00 () - scan opencl: true
[D] 07:37:15.860515047.860 0x7effe8f13d00 () - scan opencl legacy: false
[D] 07:37:15.860515047.860 0x7effe8f13d00 () - scan opencl legacy: false
[D] 07:37:15.860538457.860 0x7effe8f13d00 () - gpu feature flags: stt-whispercpp-hip, stt-whispercpp-opencl, tts-coqui-cuda, tts-whisperspeech-cuda,
[D] 07:37:15.860538457.860 0x7effe8f13d00 () - gpu feature flags: stt-whispercpp-hip, stt-whispercpp-opencl, tts-coqui-cuda, tts-whisperspeech-cuda,
[D] 07:37:15.860565927.860 0x7effe8f13d00 add_hip_devices:523 - scanning for hip devices
[D] 07:37:15.860565927.860 0x7effe8f13d00 add_hip_devices:523 - scanning for hip devices
[D] 07:37:15.867235904.867 0x7effe8f13d00 add_hip_devices:533 - hip version: driver=50731921, runtime=50731921
[D] 07:37:15.867235904.867 0x7effe8f13d00 add_hip_devices:533 - hip version: driver=50731921, runtime=50731921
[D] 07:37:15.867273804.867 0x7effe8f13d00 add_hip_devices:542 - hip number of devices: 1
[D] 07:37:15.867273804.867 0x7effe8f13d00 add_hip_devices:542 - hip number of devices: 1
[D] 07:37:15.867297474.867 0x7effe8f13d00 add_hip_devices:551 - hip device: 0, name=AMD Radeon RX 5700 XT, gcn-arch=1030, gcn-arch-name=gfx1030
[D] 07:37:15.867297474.867 0x7effe8f13d00 add_hip_devices:551 - hip device: 0, name=AMD Radeon RX 5700 XT, gcn-arch=1030, gcn-arch-name=gfx1030
[D] 07:37:15.867424874.867 0x7effe8f13d00 add_cuda_devices:500 - scanning for cuda devices
[D] 07:37:15.867424874.867 0x7effe8f13d00 add_cuda_devices:500 - scanning for cuda devices
[D] 07:37:15.867447854.867 0x7effe8f13d00 add_cuda_dev_devices:403 - scanning for cuda devices
[W] 07:37:15.867565295.867 0x7effe8f13d00 cuda_dev_api:182 - failed to open cuda lib: libcuda.so: cannot open shared object file: No such file or directory
[W] 07:37:15.867590665.867 0x7effe8f13d00 add_cuda_devices:507 - failed to open cuda lib
[D] 07:37:15.867598775.867 0x7effe8f13d00 add_opencl_devices:563 - scanning for opencl devices
[D] 07:37:15.867447854.867 0x7effe8f13d00 add_cuda_dev_devices:403 - scanning for cuda devices
[W] 07:37:15.867565295.867 0x7effe8f13d00 cuda_dev_api:182 - failed to open cuda lib: libcuda.so: cannot open shared object file: No such file or directory
[W] 07:37:15.867590665.867 0x7effe8f13d00 add_cuda_devices:507 - failed to open cuda lib
[D] 07:37:15.867598775.867 0x7effe8f13d00 add_opencl_devices:563 - scanning for opencl devices
[D] 07:37:15.911241363.911 0x7effe8f13d00 add_opencl_devices:580 - opencl number of platforms: 2
[D] 07:37:15.911241363.911 0x7effe8f13d00 add_opencl_devices:580 - opencl number of platforms: 2
[D] 07:37:15.911297373.911 0x7effe8f13d00 add_opencl_devices:605 - opencl platform: 0, name=AMD Accelerated Parallel Processing, vendor=Advanced Micro Devices, Inc.
[D] 07:37:15.911303733.911 0x7effe8f13d00 add_opencl_devices:619 - opencl number of devices: 1
[D] 07:37:15.911307193.911 0x7effe8f13d00 add_opencl_devices:643 - opencl device: 0, platform name=AMD Accelerated Parallel Processing, device name=gfx1030, types=[GPU, ]
[D] 07:37:15.911310333.911 0x7effe8f13d00 add_opencl_devices:605 - opencl platform: 1, name=Clover, vendor=Mesa
[D] 07:37:15.911312513.911 0x7effe8f13d00 add_opencl_devices:619 - opencl number of devices: 1
[D] 07:37:15.911315733.911 0x7effe8f13d00 add_opencl_devices:643 - opencl device: 0, platform name=Clover, device name=AMD Radeon RX 5700 XT (radeonsi, navi10, LLVM 17.0.6, DRM 3.57, 6.8.10-300.fc40.x86_64), types=[GPU, ]
[D] 07:37:15.911505694.911 0x7effe8f13d00 () - service refresh status, new state: idle
[D] 07:37:15.911297373.911 0x7effe8f13d00 add_opencl_devices:605 - opencl platform: 0, name=AMD Accelerated Parallel Processing, vendor=Advanced Micro Devices, Inc.
[D] 07:37:15.911303733.911 0x7effe8f13d00 add_opencl_devices:619 - opencl number of devices: 1
[D] 07:37:15.911307193.911 0x7effe8f13d00 add_opencl_devices:643 - opencl device: 0, platform name=AMD Accelerated Parallel Processing, device name=gfx1030, types=[GPU, ]
[D] 07:37:15.911310333.911 0x7effe8f13d00 add_opencl_devices:605 - opencl platform: 1, name=Clover, vendor=Mesa
[D] 07:37:15.911312513.911 0x7effe8f13d00 add_opencl_devices:619 - opencl number of devices: 1
[D] 07:37:15.911315733.911 0x7effe8f13d00 add_opencl_devices:643 - opencl device: 0, platform name=Clover, device name=AMD Radeon RX 5700 XT (radeonsi, navi10, LLVM 17.0.6, DRM 3.57, 6.8.10-300.fc40.x86_64), types=[GPU, ]
[D] 07:37:15.911505694.911 0x7effe8f13d00 () - service refresh status, new state: idle
[D] 07:37:15.914544021.914 0x7effe8f13d00 () - app service state: busy => idle
[D] 07:37:15.914544021.914 0x7effe8f13d00 () - app service state: busy => idle
[W] 07:37:15.918023840.918 0x7effe8f13d00 () - no available mnt langs
[W] 07:37:15.918032930.918 0x7effe8f13d00 () - no available mnt out langs
[W] 07:37:15.918023840.918 0x7effe8f13d00 () - no available mnt langs
[W] 07:37:15.918032930.918 0x7effe8f13d00 () - no available mnt out langs
[W] 07:37:15.918036550.918 0x7effe8f13d00 () - no available tts models for in mnt
[W] 07:37:15.918039440.918 0x7effe8f13d00 () - no available tts models for out mnt
[W] 07:37:15.918042160.918 0x7effe8f13d00 () - invalid task, reseting task state
[W] 07:37:15.918036550.918 0x7effe8f13d00 () - no available tts models for in mnt
[W] 07:37:15.918039440.918 0x7effe8f13d00 () - no available tts models for out mnt
[W] 07:37:15.918042160.918 0x7effe8f13d00 () - invalid task, reseting task state
[D] 07:37:15.918090480.918 0x7effe8f13d00 () - app busy: true => false
[D] 07:37:15.918090480.918 0x7effe8f13d00 () - app busy: true => false
[D] 07:37:15.918356131.918 0x7effe8f13d00 () - stt models changed
[D] 07:37:15.918356131.918 0x7effe8f13d00 () - stt models changed
[D] 07:37:15.918404821.918 0x7effe8f13d00 () - update listen
[D] 07:37:15.918404821.918 0x7effe8f13d00 () - update listen
[D] 07:37:15.918418011.918 0x7effe8f13d00 () - tts models changed
[D] 07:37:15.918418011.918 0x7effe8f13d00 () - tts models changed
[D] 07:37:15.918450041.918 0x7effe8f13d00 () - update listen
[D] 07:37:15.918450041.918 0x7effe8f13d00 () - update listen
[W] 07:37:15.918538661.918 0x7effe8f13d00 () - no available tts models for in mnt
[W] 07:37:15.918543531.918 0x7effe8f13d00 () - no available tts models for out mnt
[W] 07:37:15.918538661.918 0x7effe8f13d00 () - no available tts models for in mnt
[W] 07:37:15.918543531.918 0x7effe8f13d00 () - no available tts models for out mnt
[D] 07:37:15.918546291.918 0x7effe8f13d00 () - ttt models changed
[D] 07:37:15.918546291.918 0x7effe8f13d00 () - ttt models changed
[D] 07:37:15.921631919.921 0x7effe8f13d00 () - mnt langs changed
[D] 07:37:15.921631919.921 0x7effe8f13d00 () - mnt langs changed
[D] 07:37:15.921653759.921 0x7effe8f13d00 () - update listen
[D] 07:37:15.921653759.921 0x7effe8f13d00 () - update listen
[W] 07:37:15.921665309.921 0x7effe8f13d00 () - no available mnt langs
[W] 07:37:15.921668799.921 0x7effe8f13d00 () - no available mnt out langs
[W] 07:37:15.921665309.921 0x7effe8f13d00 () - no available mnt langs
[W] 07:37:15.921668799.921 0x7effe8f13d00 () - no available mnt out langs
[D] 07:37:17.137418554.137 0x7effe8f13d00 () - stt start listen
[D] 07:37:17.137418554.137 0x7effe8f13d00 () - stt start listen
[D] 07:37:17.138245836.138 0x7effe8f13d00 () - default tts model not found: "en"
[D] 07:37:17.138257266.138 0x7effe8f13d00 () - default mnt lang not found: "en"
[D] 07:37:17.138245836.138 0x7effe8f13d00 () - default tts model not found: "en"
[D] 07:37:17.138257266.138 0x7effe8f13d00 () - default mnt lang not found: "en"
[D] 07:37:17.138262386.138 0x7effe8f13d00 () - new default mnt lang: "en"
[D] 07:37:17.138269926.138 0x7effe8f13d00 () - choosing model for id: "en_whisper_tiny" "en"
[D] 07:37:17.138262386.138 0x7effe8f13d00 () - new default mnt lang: "en"
[D] 07:37:17.138269926.138 0x7effe8f13d00 () - choosing model for id: "en_whisper_tiny" "en"
[D] 07:37:17.138285886.138 0x7effe8f13d00 () - gpu device str: ("ROCm", " 0", " AMD Radeon RX 5700 XT")
[D] 07:37:17.138285886.138 0x7effe8f13d00 () - gpu device str: ("ROCm", " 0", " AMD Radeon RX 5700 XT")
[D] 07:37:17.138315816.138 0x7effe8f13d00 () - restart stt engine config: "lang=en, lang_code=, model-files=[model-file=/home/asish/.var/app/net.mkiol.SpeechNote/cache/net.mkiol/dsnote/speech-models/en_whisper_tiny.ggml, scorer-file=, ttt-model-file=], speech-mode=single-sentence, vad-mode=aggressiveness-3, speech-started=0, text-format=raw, options=, use-gpu=1, gpu-device=[id=0, api=rocm, name=AMD Radeon RX 5700 XT, platform-name=], sub-config=[min-segment-dur=4, min-line-length=0, max-line-length=0]"
[D] 07:37:17.138321536.138 0x7effe8f13d00 () - new stt engine required
[D] 07:37:17.138315816.138 0x7effe8f13d00 () - restart stt engine config: "lang=en, lang_code=, model-files=[model-file=/home/asish/.var/app/net.mkiol.SpeechNote/cache/net.mkiol/dsnote/speech-models/en_whisper_tiny.ggml, scorer-file=, ttt-model-file=], speech-mode=single-sentence, vad-mode=aggressiveness-3, speech-started=0, text-format=raw, options=, use-gpu=1, gpu-device=[id=0, api=rocm, name=AMD Radeon RX 5700 XT, platform-name=], sub-config=[min-segment-dur=4, min-line-length=0, max-line-length=0]"
[D] 07:37:17.138321536.138 0x7effe8f13d00 () - new stt engine required
[D] 07:37:17.138400506.138 0x7effe8f13d00 open_whisper_lib:122 - using whisper-hipblas
[D] 07:37:17.138400506.138 0x7effe8f13d00 open_whisper_lib:122 - using whisper-hipblas
[D] 07:37:17.140287571.140 0x7effe8f13d00 make_wparams:429 - cpu info: arch=x86_64, cores=12
[D] 07:37:17.140287571.140 0x7effe8f13d00 make_wparams:429 - cpu info: arch=x86_64, cores=12
[D] 07:37:17.140310881.140 0x7effe8f13d00 make_wparams:431 - using threads: 5/12
[D] 07:37:17.140310881.140 0x7effe8f13d00 make_wparams:431 - using threads: 5/12
[D] 07:37:17.140327401.140 0x7effe8f13d00 make_wparams:433 - system info: AVX = 1 | AVX2 = 1 | AVX512 = 0 | FMA = 1 | NEON = 0 | ARM_FMA = 0 | METAL = 0 | F16C = 1 | FP16_VA = 0 | WASM_SIMD = 0 | BLAS = 1 | SSE3 = 1 | SSSE3 = 1 | VSX = 0 | CUDA = 1 | COREML = 0 | OPENVINO = 0 | 
[D] 07:37:17.140327401.140 0x7effe8f13d00 make_wparams:433 - system info: AVX = 1 | AVX2 = 1 | AVX512 = 0 | FMA = 1 | NEON = 0 | ARM_FMA = 0 | METAL = 0 | F16C = 1 | FP16_VA = 0 | WASM_SIMD = 0 | BLAS = 1 | SSE3 = 1 | SSSE3 = 1 | VSX = 0 | CUDA = 1 | COREML = 0 | OPENVINO = 0 | 
[D] 07:37:17.140342711.140 0x7effe8f13d00 start:224 - stt start
[D] 07:37:17.140342711.140 0x7effe8f13d00 start:224 - stt start
[D] 07:37:17.140388751.140 0x7effe8f13d00 start:234 - stt start completed
[D] 07:37:17.140388751.140 0x7effe8f13d00 start:234 - stt start completed
[D] 07:37:17.140399111.140 0x7effe8f13d00 () - creating audio source
[D] 07:37:17.140405191.140 0x7effe8f13d00 () - mic source created
[D] 07:37:17.140399111.140 0x7effe8f13d00 () - creating audio source
[D] 07:37:17.140405191.140 0x7effe8f13d00 () - mic source created
[D] 07:37:17.140690202.140 0x7efc49600680 process:283 - stt processing started
[D] 07:37:17.140716252.140 0x7efc49600680 set_state:469 - stt state: idle => initializing
[D] 07:37:17.140690202.140 0x7efc49600680 process:283 - stt processing started
[D] 07:37:17.140716252.140 0x7efc49600680 set_state:469 - stt state: idle => initializing
[D] 07:37:17.140721872.140 0x7efc49600680 set_state:476 - speech detection status: no-speech => initializing (no-speech)
[D] 07:37:17.140733922.140 0x7efc49600680 create_model:239 - creating whisper model
[D] 07:37:17.140721872.140 0x7efc49600680 set_state:476 - speech detection status: no-speech => initializing (no-speech)
[D] 07:37:17.140733922.140 0x7efc49600680 create_model:239 - creating whisper model
whisper_init_from_file_with_params_no_state: loading model from '/home/asish/.var/app/net.mkiol.SpeechNote/cache/net.mkiol/dsnote/speech-models/en_whisper_tiny.ggml'
whisper_init_from_file_with_params_no_state: loading model from '/home/asish/.var/app/net.mkiol.SpeechNote/cache/net.mkiol/dsnote/speech-models/en_whisper_tiny.ggml'
whisper_model_load: loading model
whisper_model_load: loading model
whisper_model_load: n_vocab       = 51864
whisper_model_load: n_audio_ctx   = 1500
whisper_model_load: n_audio_state = 384
whisper_model_load: n_vocab       = 51864
whisper_model_load: n_audio_ctx   = 1500
whisper_model_load: n_audio_state = 384
whisper_model_load: n_audio_head  = 6
whisper_model_load: n_audio_layer = 4
whisper_model_load: n_text_ctx    = 448
whisper_model_load: n_text_state  = 384
whisper_model_load: n_text_head   = 6
whisper_model_load: n_text_layer  = 4
whisper_model_load: n_mels        = 80
whisper_model_load: ftype         = 9
whisper_model_load: qntvr         = 2
whisper_model_load: type          = 1 (tiny)
whisper_model_load: n_audio_head  = 6
whisper_model_load: n_audio_layer = 4
whisper_model_load: n_text_ctx    = 448
whisper_model_load: n_text_state  = 384
whisper_model_load: n_text_head   = 6
whisper_model_load: n_text_layer  = 4
whisper_model_load: n_mels        = 80
whisper_model_load: ftype         = 9
whisper_model_load: qntvr         = 2
whisper_model_load: type          = 1 (tiny)
whisper_model_load: adding 1607 extra tokens
whisper_model_load: adding 1607 extra tokens
whisper_model_load: n_langs       = 99
whisper_model_load: n_langs       = 99
ggml_init_cublas: GGML_CUDA_FORCE_MMQ:   no
ggml_init_cublas: CUDA_USE_TENSOR_CORES: yes
ggml_init_cublas: found 1 ROCm devices:
  Device 0: AMD Radeon RX 5700 XT, compute capability 10.3, VMM: no
ggml_init_cublas: GGML_CUDA_FORCE_MMQ:   no
ggml_init_cublas: CUDA_USE_TENSOR_CORES: yes
ggml_init_cublas: found 1 ROCm devices:
  Device 0: AMD Radeon RX 5700 XT, compute capability 10.3, VMM: no
[D] 07:37:17.324612018.324 0x7effe8f13d00 () - using audio input: "alsa_input.usb-HP__Inc_HyperX_Cloud_Stinger_2_Wireless_0-00.mono-fallback" (preferred was  "" )
[D] 07:37:17.324612018.324 0x7effe8f13d00 () - using audio input: "alsa_input.usb-HP__Inc_HyperX_Cloud_Stinger_2_Wireless_0-00.mono-fallback" (preferred was  "" )
[D] 07:37:17.347536425.347 0x7efe0ea00680 subscription_pa_callback:149 - pa source changed: 56
[D] 07:37:17.347536425.347 0x7efe0ea00680 subscription_pa_callback:149 - pa source changed: 56
[D] 07:37:17.347818086.347 0x7effe8f13d00 () - audio state: IdleState
[D] 07:37:17.347818086.347 0x7effe8f13d00 () - audio state: IdleState
[D] 07:37:17.347830456.347 0x7effe8f13d00 set_speech_started:527 - speech started: false => true
[D] 07:37:17.347834516.347 0x7effe8f13d00 set_speech_detection_status:549 - speech detection status: initializing => initializing (speech-detected)
[D] 07:37:17.347830456.347 0x7effe8f13d00 set_speech_started:527 - speech started: false => true
[D] 07:37:17.347834516.347 0x7effe8f13d00 set_speech_detection_status:549 - speech detection status: initializing => initializing (speech-detected)
[D] 07:37:17.347871546.347 0x7efe0ea00680 source_info_pa_callback:200 - pa source: alsa_input.usb-HP__Inc_HyperX_Cloud_Stinger_2_Wireless_0-00.mono-fallback HyperX Cloud Stinger 2 Wireless Mono
[D] 07:37:17.347871546.347 0x7efe0ea00680 source_info_pa_callback:200 - pa source: alsa_input.usb-HP__Inc_HyperX_Cloud_Stinger_2_Wireless_0-00.mono-fallback HyperX Cloud Stinger 2 Wireless Mono
[D] 07:37:17.348006436.348 0x7effe8f13d00 () - service refresh status, new state: listening-single-sentence
[D] 07:37:17.348012856.348 0x7effe8f13d00 () - service state changed: idle => listening-single-sentence
[D] 07:37:17.348006436.348 0x7effe8f13d00 () - service refresh status, new state: listening-single-sentence
[D] 07:37:17.348012856.348 0x7effe8f13d00 () - service state changed: idle => listening-single-sentence
[D] 07:37:17.348018986.348 0x7effe8f13d00 () - task state changed: 0 => 3
[D] 07:37:17.348018986.348 0x7effe8f13d00 () - task state changed: 0 => 3
[D] 07:37:17.348602208.348 0x7effe8f13d00 () - service refresh status, new state: listening-single-sentence
[D] 07:37:17.348602208.348 0x7effe8f13d00 () - service refresh status, new state: listening-single-sentence
[D] 07:37:17.348616638.348 0x7effe8f13d00 () - app current task: -1 => 0
[D] 07:37:17.348622678.348 0x7effe8f13d00 () - app task state: idle => initializing
[D] 07:37:17.348616638.348 0x7effe8f13d00 () - app current task: -1 => 0
[D] 07:37:17.348622678.348 0x7effe8f13d00 () - app task state: idle => initializing
[D] 07:37:17.349172749.349 0x7effe8f13d00 () - app service state: idle => listening-single-sentence
[D] 07:37:17.349172749.349 0x7effe8f13d00 () - app service state: idle => listening-single-sentence
[W] 07:37:17.352577237.352 0x7effe8f13d00 () - no available mnt langs
[W] 07:37:17.352587497.352 0x7effe8f13d00 () - no available mnt out langs
[W] 07:37:17.352591117.352 0x7effe8f13d00 () - no available tts models for in mnt
[W] 07:37:17.352593687.352 0x7effe8f13d00 () - no available tts models for out mnt
[W] 07:37:17.352577237.352 0x7effe8f13d00 () - no available mnt langs
[W] 07:37:17.352587497.352 0x7effe8f13d00 () - no available mnt out langs
[W] 07:37:17.352591117.352 0x7effe8f13d00 () - no available tts models for in mnt
[W] 07:37:17.352593687.352 0x7effe8f13d00 () - no available tts models for out mnt
whisper_backend_init: using CUDA backend
whisper_backend_init: using CUDA backend
whisper_model_load:     ROCm buffer size =    31.64 MB
whisper_model_load:     ROCm buffer size =    31.64 MB
whisper_model_load: model size    =   31.57 MB
whisper_model_load: model size    =   31.57 MB
whisper_backend_init: using CUDA backend
whisper_backend_init: using CUDA backend
whisper_init_state: kv self size  =    8.26 MB
whisper_init_state: kv self size  =    8.26 MB
whisper_init_state: kv cross size =    9.22 MB
whisper_init_state: kv cross size =    9.22 MB
whisper_init_state: compute buffer (conv)   =   12.17 MB
whisper_init_state: compute buffer (conv)   =   12.17 MB
whisper_init_state: compute buffer (encode) =   64.92 MB
whisper_init_state: compute buffer (encode) =   64.92 MB
whisper_init_state: compute buffer (cross)  =    4.01 MB
whisper_init_state: compute buffer (cross)  =    4.01 MB
whisper_init_state: compute buffer (decode) =   96.02 MB
whisper_init_state: compute buffer (decode) =   96.02 MB
[D] 07:37:17.383630464.383 0x7efc49600680 create_model:249 - whisper model created
[D] 07:37:17.383643304.383 0x7efc49600680 set_state:469 - stt state: initializing => idle
[D] 07:37:17.383646124.383 0x7efc49600680 set_state:476 - speech detection status: initializing => speech-detected (speech-detected)
[D] 07:37:17.383630464.383 0x7efc49600680 create_model:249 - whisper model created
[D] 07:37:17.383643304.383 0x7efc49600680 set_state:469 - stt state: initializing => idle
[D] 07:37:17.383646124.383 0x7efc49600680 set_state:476 - speech detection status: initializing => speech-detected (speech-detected)
[D] 07:37:17.387925695.387 0x7effe8f13d00 () - service refresh status, new state: listening-single-sentence
[D] 07:37:17.387949075.387 0x7effe8f13d00 () - task state changed: 3 => 1
[D] 07:37:17.387925695.387 0x7effe8f13d00 () - service refresh status, new state: listening-single-sentence
[D] 07:37:17.387949075.387 0x7effe8f13d00 () - task state changed: 3 => 1
[D] 07:37:17.393759189.393 0x7effe8f13d00 () - app task state: initializing => speech-detected
[D] 07:37:17.393759189.393 0x7effe8f13d00 () - app task state: initializing => speech-detected
[D] 07:37:17.569713746.569 0x7effe8f13d00 () - audio state: ActiveState
[D] 07:37:17.569713746.569 0x7effe8f13d00 () - audio state: ActiveState
[D] 07:37:19.019660532.19 0x7efc49600680 process_buff:259 - process samples buf: mode=single-sentence, in-buf size=24000, speech-buf size=0, sof=true, eof=false
[D] 07:37:19.019660532.19 0x7efc49600680 process_buff:259 - process samples buf: mode=single-sentence, in-buf size=24000, speech-buf size=0, sof=true, eof=false
[D] 07:37:19.068091112.68 0x7efc49600680 process_buff:279 - vad: speech detected
[D] 07:37:19.068091112.68 0x7efc49600680 process_buff:279 - vad: speech detected
[D] 07:37:20.416758136.416 0x7efc49600680 process_buff:259 - process samples buf: mode=single-sentence, in-buf size=24000, speech-buf size=23520, sof=false, eof=false
[D] 07:37:20.416758136.416 0x7efc49600680 process_buff:259 - process samples buf: mode=single-sentence, in-buf size=24000, speech-buf size=23520, sof=false, eof=false
[D] 07:37:20.465553677.465 0x7efc49600680 process_buff:279 - vad: speech detected
[D] 07:37:20.465553677.465 0x7efc49600680 process_buff:279 - vad: speech detected
[D] 07:37:22.020203273.20 0x7efc49600680 process_buff:259 - process samples buf: mode=single-sentence, in-buf size=24000, speech-buf size=47520, sof=false, eof=false
[D] 07:37:22.020203273.20 0x7efc49600680 process_buff:259 - process samples buf: mode=single-sentence, in-buf size=24000, speech-buf size=47520, sof=false, eof=false
[D] 07:37:22.068820923.68 0x7efc49600680 process_buff:279 - vad: speech detected
[D] 07:37:22.068820923.68 0x7efc49600680 process_buff:279 - vad: speech detected
[D] 07:37:23.421849448.421 0x7efc49600680 process_buff:259 - process samples buf: mode=single-sentence, in-buf size=24000, speech-buf size=63360, sof=false, eof=false
[D] 07:37:23.421849448.421 0x7efc49600680 process_buff:259 - process samples buf: mode=single-sentence, in-buf size=24000, speech-buf size=63360, sof=false, eof=false
[D] 07:37:23.471305091.471 0x7efc49600680 process_buff:279 - vad: speech detected
[D] 07:37:23.471305091.471 0x7efc49600680 process_buff:279 - vad: speech detected
[D] 07:37:25.022435678.22 0x7efc49600680 process_buff:259 - process samples buf: mode=single-sentence, in-buf size=24000, speech-buf size=92160, sof=false, eof=false
[D] 07:37:25.022435678.22 0x7efc49600680 process_buff:259 - process samples buf: mode=single-sentence, in-buf size=24000, speech-buf size=92160, sof=false, eof=false
[D] 07:37:25.072340591.72 0x7efc49600680 process_buff:279 - vad: speech detected
[D] 07:37:25.072340591.72 0x7efc49600680 process_buff:279 - vad: speech detected
[D] 07:37:26.418791121.418 0x7efc49600680 process_buff:259 - process samples buf: mode=single-sentence, in-buf size=24000, speech-buf size=116160, sof=false, eof=false
[D] 07:37:26.418791121.418 0x7efc49600680 process_buff:259 - process samples buf: mode=single-sentence, in-buf size=24000, speech-buf size=116160, sof=false, eof=false
[D] 07:37:26.468650394.468 0x7efc49600680 process_buff:279 - vad: speech detected
[D] 07:37:26.468650394.468 0x7efc49600680 process_buff:279 - vad: speech detected
[D] 07:37:28.020537123.20 0x7efc49600680 process_buff:259 - process samples buf: mode=single-sentence, in-buf size=24000, speech-buf size=140160, sof=false, eof=false
[D] 07:37:28.020537123.20 0x7efc49600680 process_buff:259 - process samples buf: mode=single-sentence, in-buf size=24000, speech-buf size=140160, sof=false, eof=false
[D] 07:37:28.069845305.69 0x7efc49600680 process_buff:279 - vad: speech detected
[D] 07:37:28.069845305.69 0x7efc49600680 process_buff:279 - vad: speech detected
[D] 07:37:29.421963918.421 0x7efc49600680 process_buff:259 - process samples buf: mode=single-sentence, in-buf size=24000, speech-buf size=149280, sof=false, eof=false
[D] 07:37:29.421963918.421 0x7efc49600680 process_buff:259 - process samples buf: mode=single-sentence, in-buf size=24000, speech-buf size=149280, sof=false, eof=false
[D] 07:37:29.471044580.471 0x7efc49600680 process_buff:279 - vad: speech detected
[D] 07:37:29.471044580.471 0x7efc49600680 process_buff:279 - vad: speech detected
[D] 07:37:29.670767505.670 0x7effe8f13d00 () - stt stop listen
[D] 07:37:29.670792325.670 0x7effe8f13d00 () - stop stt engine gracefully
[D] 07:37:29.670798885.670 0x7effe8f13d00 set_speech_started:527 - speech started: true => false
[D] 07:37:29.670767505.670 0x7effe8f13d00 () - stt stop listen
[D] 07:37:29.670792325.670 0x7effe8f13d00 () - stop stt engine gracefully
[D] 07:37:29.670798885.670 0x7effe8f13d00 set_speech_started:527 - speech started: true => false
[D] 07:37:29.670803755.670 0x7effe8f13d00 set_speech_detection_status:549 - speech detection status: speech-detected => no-speech (no-speech)
[D] 07:37:29.670811935.670 0x7effe8f13d00 () - mic source stop
[D] 07:37:29.670820795.670 0x7effe8f13d00 () - audio state: SuspendedState
[D] 07:37:29.670825225.670 0x7effe8f13d00 () - audio ended
[D] 07:37:29.670803755.670 0x7effe8f13d00 set_speech_detection_status:549 - speech detection status: speech-detected => no-speech (no-speech)
[D] 07:37:29.670811935.670 0x7effe8f13d00 () - mic source stop
[D] 07:37:29.670820795.670 0x7effe8f13d00 () - audio state: SuspendedState
[D] 07:37:29.670825225.670 0x7effe8f13d00 () - audio ended
[D] 07:37:29.671781438.671 0x7efe0ea00680 subscription_pa_callback:149 - pa source changed: 56
[D] 07:37:29.671781438.671 0x7efe0ea00680 subscription_pa_callback:149 - pa source changed: 56
[D] 07:37:29.672116788.672 0x7efe0ea00680 source_info_pa_callback:200 - pa source: alsa_input.usb-HP__Inc_HyperX_Cloud_Stinger_2_Wireless_0-00.mono-fallback HyperX Cloud Stinger 2 Wireless Mono
[D] 07:37:29.672116788.672 0x7efe0ea00680 source_info_pa_callback:200 - pa source: alsa_input.usb-HP__Inc_HyperX_Cloud_Stinger_2_Wireless_0-00.mono-fallback HyperX Cloud Stinger 2 Wireless Mono
[D] 07:37:29.676917680.676 0x7effe8f13d00 () - service refresh status, new state: listening-single-sentence
[D] 07:37:29.676933440.676 0x7effe8f13d00 () - task state changed: 1 => 0
[D] 07:37:29.676917680.676 0x7effe8f13d00 () - service refresh status, new state: listening-single-sentence
[D] 07:37:29.676933440.676 0x7effe8f13d00 () - task state changed: 1 => 0
[D] 07:37:29.682868915.682 0x7effe8f13d00 () - app task state: speech-detected => idle
[D] 07:37:29.682868915.682 0x7effe8f13d00 () - app task state: speech-detected => idle
[D] 07:37:29.805463409.805 0x7effe8f13d00 () - audio state: ActiveState
[D] 07:37:29.805478469.805 0x7effe8f13d00 () - audio ended
[D] 07:37:29.805463409.805 0x7effe8f13d00 () - audio state: ActiveState
[D] 07:37:29.805478469.805 0x7effe8f13d00 () - audio ended
[D] 07:37:30.005742336.5 0x7effe8f13d00 () - mic source stop
[D] 07:37:30.005742336.5 0x7effe8f13d00 () - mic source stop
[D] 07:37:30.005760336.5 0x7effe8f13d00 () - audio state: SuspendedState
[D] 07:37:30.005766396.5 0x7effe8f13d00 () - audio ended
[D] 07:37:30.005760336.5 0x7effe8f13d00 () - audio state: SuspendedState
[D] 07:37:30.005766396.5 0x7effe8f13d00 () - audio ended
[D] 07:37:30.006063757.6 0x7efc49600680 process_buff:259 - process samples buf: mode=single-sentence, in-buf size=4000, speech-buf size=176160, sof=false, eof=true
[D] 07:37:30.006063757.6 0x7efc49600680 process_buff:259 - process samples buf: mode=single-sentence, in-buf size=4000, speech-buf size=176160, sof=false, eof=true
[D] 07:37:30.015149170.15 0x7efc49600680 process_buff:294 - vad: no speech
[D] 07:37:30.015158940.15 0x7efc49600680 set_state:469 - stt state: idle => decoding
[D] 07:37:30.015163880.15 0x7efc49600680 set_state:476 - speech detection status: no-speech => decoding (no-speech)
[D] 07:37:30.015149170.15 0x7efc49600680 process_buff:294 - vad: no speech
[D] 07:37:30.015158940.15 0x7efc49600680 set_state:469 - stt state: idle => decoding
[D] 07:37:30.015163880.15 0x7efc49600680 set_state:476 - speech detection status: no-speech => decoding (no-speech)
[D] 07:37:30.015177580.15 0x7efc49600680 process_buff:362 - speech frame: samples=176160
[D] 07:37:30.015181790.15 0x7efc49600680 decode_speech:439 - speech decoding started
[D] 07:37:30.015177580.15 0x7efc49600680 process_buff:362 - speech frame: samples=176160
[D] 07:37:30.015181790.15 0x7efc49600680 decode_speech:439 - speech decoding started
[D] 07:37:30.015328560.15 0x7effe8f13d00 () - service refresh status, new state: listening-single-sentence
[D] 07:37:30.015346040.15 0x7effe8f13d00 () - task state changed: 0 => 2
[D] 07:37:30.015328560.15 0x7effe8f13d00 () - service refresh status, new state: listening-single-sentence
[D] 07:37:30.015346040.15 0x7effe8f13d00 () - task state changed: 0 => 2
[D] 07:37:30.015372690.15 0x7effe8f13d00 () - app task state: idle => processing
[D] 07:37:30.015372690.15 0x7effe8f13d00 () - app task state: idle => processing
[D] 07:37:30.216954630.216 0x7effe8f13d00 () - audio source ended successfuly
[D] 07:37:30.216954630.216 0x7effe8f13d00 () - audio source ended successfuly
[D] 07:37:34.680195719.680 0x7efe0ea00680 subscription_pa_callback:149 - pa source changed: 56
[D] 07:37:34.680195719.680 0x7efe0ea00680 subscription_pa_callback:149 - pa source changed: 56
[D] 07:37:34.680569510.680 0x7efe0ea00680 source_info_pa_callback:200 - pa source: alsa_input.usb-HP__Inc_HyperX_Cloud_Stinger_2_Wireless_0-00.mono-fallback HyperX Cloud Stinger 2 Wireless Mono
[D] 07:37:34.680569510.680 0x7efe0ea00680 source_info_pa_callback:200 - pa source: alsa_input.usb-HP__Inc_HyperX_Cloud_Stinger_2_Wireless_0-00.mono-fallback HyperX Cloud Stinger 2 Wireless Mono
[D] 07:38:36.427608803.427 0x7effe8f13d00 () - cancel
[D] 07:38:36.427608803.427 0x7effe8f13d00 () - cancel
[D] 07:38:36.427658743.427 0x7effe8f13d00 request_stop:254 - stt stop requested
[D] 07:38:36.427663453.427 0x7effe8f13d00 stop_processing_impl:230 - whisper cancel
[D] 07:38:36.427658743.427 0x7effe8f13d00 request_stop:254 - stt stop requested
[D] 07:38:36.427663453.427 0x7effe8f13d00 stop_processing_impl:230 - whisper cancel
[D] 07:38:36.427806664.427 0x7effe8f13d00 () - service refresh status, new state: listening-single-sentence
[D] 07:38:36.427806664.427 0x7effe8f13d00 () - service refresh status, new state: listening-single-sentence
[D] 07:38:36.427855464.427 0x7effe8f13d00 () - stt engine stopping
[D] 07:38:36.427855464.427 0x7effe8f13d00 () - stt engine stopping
[D] 07:38:36.427893664.427 0x7effe8f13d00 () - service refresh status, new state: listening-single-sentence
[D] 07:38:36.427900274.427 0x7effe8f13d00 () - task state changed: 2 => 6
[D] 07:38:36.427893664.427 0x7effe8f13d00 () - service refresh status, new state: listening-single-sentence
[D] 07:38:36.427900274.427 0x7effe8f13d00 () - task state changed: 2 => 6
[D] 07:38:36.427915654.427 0x7effe8f13d00 () - app task state: processing => cancelling
[D] 07:38:36.427915654.427 0x7effe8f13d00 () - app task state: processing => cancelling
[D] 07:38:48.307073390.307 0x7effe8f13d00 () - exiting
[D] 07:38:48.307073390.307 0x7effe8f13d00 () - exiting
mkiol commented 6 months ago

Thanks for the log.

It is weird a bit, but in the log every line is doubled. How it could happen? It looks like you started in the same time two instances of Speech Note 🤔

Apart from "doubled lines", it simply looks like your GPU is not well supported in ROCm. This is not too surprising, as AMD ROCm officially supports only a few graphics cards. Unfortunately, I don't think I can do anything more.

mediocretwo commented 6 months ago

I'm not sure why the logs are doubled, I just ran the app with flatpak run.

Apart from "doubled lines", it simply looks like your GPU is not well supported in ROCm. This is not too surprising, as AMD ROCm officially supports only a few graphics cards. Unfortunately, I don't think I can do anything more.

Maybe you can create an issue in the ROCm repository. Apart from that, nothing else to do so thanks for everything.

mkiol commented 6 months ago

Maybe you can create an issue in the ROCm repository.

It seems that requests for RX 5700XT support in ROCm were made repeatedly in the past and finally support was added.... but according to this, it was withdrawn starting with ROCm 5.3. There is also open ticket which says that in ROCm 6.2 support will be re-enabled. What a story :)

mediocretwo commented 6 months ago

Oh wow... that is such a lack of support from AMD. 5700 XT was the flagship model in its generation, and it has lost support! Unbelievable.

Edit: Not sure if I should close the issue, since its not in your hands. Let me know.

Kentoseth commented 5 months ago

I'm somewhat surprised by the number of reported issues with AMD GPUs. I thought being Open Source guaranteed a little better outcome than NVidia, but it doesn't seem to be the case.

mkiol commented 5 months ago

Yeah... This could be AMD's strategy. They are very late in the field of GPU computing. Right now they are focusing ROCm only on a few graphics cards sold to data centers. They want to capture some of the money that goes to NVIDIA. Broad support for ROCm is not their priority. At least that's how I understand it.

mediocretwo commented 3 weeks ago

Hi, @mkiol just want to see if there's been an update in this issue. Seems like ROCm version 6.2 is released. Has the support for Rx 5700 XT been added?

mkiol commented 3 weeks ago

Hi,

I've updated ROCm to version 6.2.2 for the upcoming Speech Note v4.7, but I haven't checked if anything has changed for the Rx 5700 XT.

Perhaps ROCm won't be needed after all, since the latest whisper.cpp has support for GPU acceleration with Vulkan. I have already implemented it and it works flawlessly. STT decoding is even faster than ROCm and you don't need to install the huge ROCm framework (included in the add-on).

I plan to release a beta version to flatpak-beta over the weekend. I would appreciate it if you could test it on your computer.

mediocretwo commented 3 weeks ago

Perhaps ROCm won't be needed after all, since the latest whisper.cpp has support for GPU acceleration with Vulkan.

This sounds like amazing news! I'm really excited for it to start working on my GPU finally. I hope this can be implemented for supporting older 5000 series GPUs.

I plan to release a beta version to flatpak-beta over the weekend. I would appreciate it if you could test it on your computer.

I'm happy to help, but I'm no software engineer just a hobbyist. You will have to guide me through the advanced steps. Also I will be free next week after Tuesday, so will try to help then.

mkiol commented 2 weeks ago

Beta version is for regular user. You don't need to have any "advanced" skills. It may contain bugs but it is perfectly usable.

To install Beta you need to add "flathub-beta" repository to your system. You can do it simply by entering this command:

flatpak remote-add --if-not-exists flathub-beta https://flathub.org/beta-repo/flathub-beta.flatpakrepo

and install "Speech Note" from flathub-beta.

Full changelog of what you could find in 4.7.0 Beta 1 is here. If you are using GPU acceleration add-on, you need to update it as well (from flathub-beta).

To test Vulkan, set Hardware accelerator to Vulkan in the Speech to Text->WhisperCpp settings.

image

mediocretwo commented 2 weeks ago

Just tested using the Beta version, its working on GPU acceleration using Vulkan! Its wonderful. Thanks. Can I uninstall the AMD addon flatpak if I use Vulkan ?

mkiol commented 2 weeks ago

its working on GPU acceleration using Vulkan! Its wonderful

I'm super glad it works!

Can I uninstall the AMD addon flatpak if I use Vulkan ?

The addon contains ROCm, so if you use Speech Note only for STT, you can uninstall it. On the other hand, ROCm is also used in Coqui TTS and WhisperSpeech TTS, so if ROCm acceleration works with your GPU and you are interested in TTS as well, you can keep it.