Closed devSJR closed 9 months ago
Thank you for the report.
Would you be able to provide a log when this problem occurs?
You can enable logging with --verbose
option. Start the app with the following command:
flatpak run net.mkiol.SpeechNote --verbose
flatpak run net.mkiol.SpeechNote --verbose
Qt: Session management error: Could not open network socket
[I] 08:39:38.420 0x7ff9f8e12d00 init:49 - logging to stderr enabled
[D] 08:39:38.420 0x7ff9f8e12d00 () - version: 4.3.0
[D] 08:39:38.420 0x7ff9f8e12d00 () - translation: "en_US"
[W] 08:39:38.420 0x7ff9f8e12d00 () - failed to install translation
[D] 08:39:38.420 0x7ff9f8e12d00 () - starting standalone app
[D] 08:39:38.421 0x7ff9f8e12d00 () - app: net.mkiol dsnote
[D] 08:39:38.421 0x7ff9f8e12d00 () - config location: "/home/randomuser/.var/app/net.mkiol.SpeechNote/config"
[D] 08:39:38.421 0x7ff9f8e12d00 () - data location: "/home/randomuser/.var/app/net.mkiol.SpeechNote/data/net.mkiol/dsnote"
[D] 08:39:38.421 0x7ff9f8e12d00 () - cache location: "/home/randomuser/.var/app/net.mkiol.SpeechNote/cache/net.mkiol/dsnote"
[D] 08:39:38.421 0x7ff9f8e12d00 () - settings file: "/home/randomuser/.var/app/net.mkiol.SpeechNote/config/net.mkiol/dsnote/settings.conf"
[D] 08:39:38.421 0x7ff9f8e12d00 () - platform: "xcb"
[D] 08:39:38.456 0x7ff9f8e12d00 () - supported audio input devices:
ALSA lib ../../oss/pcm_oss.c:397:(_snd_pcm_oss_open) Cannot open device /dev/dsp
[D] 08:39:38.464 0x7ff9f8e12d00 () - "pulse"
[D] 08:39:38.567 0x7ff9f8e12d00 () - "default"
ALSA lib ../../../src/pcm/pcm_direct.c:2045:(snd1_pcm_direct_parse_open_conf) The field ipc_gid must be a valid group (create group audio)
[D] 08:39:38.568 0x7ff9f8e12d00 () - "alsa_input.pci-0000_00_1f.3.analog-stereo"
[D] 08:39:38.568 0x7ff9f8e12d00 () - "alsa_output.pci-0000_00_1f.3.analog-stereo.monitor"
[D] 08:39:38.637 0x7ff9f8e12d00 () - starting service: app-standalone
[D] 08:39:38.641 0x7ff9f8e12d00 () - mbrola dir: "/app/bin"
[D] 08:39:38.641 0x7ff9f8e12d00 () - espeak dir: "/app/bin"
[D] 08:39:38.641 0x7ff9ddffe600 loop:56 - py executor loop started
[D] 08:39:38.646 0x7ff9f8e12d00 () - module already unpacked: "rhvoicedata"
[D] 08:39:38.646 0x7ff9f8e12d00 () - module already unpacked: "rhvoiceconfig"
[D] 08:39:38.648 0x7ff9de7ff600 () - config version: 51 51
[D] 08:39:38.649 0x7ff9f8e12d00 () - module already unpacked: "espeakdata"
[D] 08:39:38.649 0x7ff9f8e12d00 () - default stt model not found: "de_fasterwhisper_large2"
[D] 08:39:38.649 0x7ff9f8e12d00 () - default tts model not found: "en_piper_us_ryan_high"
[D] 08:39:38.649 0x7ff9f8e12d00 () - default mnt lang not found: "de"
[D] 08:39:38.649 0x7ff9f8e12d00 () - new default mnt lang: "de"
[D] 08:39:38.649 0x7ff9f8e12d00 () - service refresh status, new state: busy
[D] 08:39:38.649 0x7ff9f8e12d00 () - service state changed: unknown => busy
[D] 08:39:38.649 0x7ff9f8e12d00 () - delaying features availability
[W] 08:39:38.650 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.650 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.650 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.650 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.650 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.650 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.650 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[D] 08:39:38.650 0x7ff9f8e12d00 () - available styles: ("Default", "Fusion", "Imagine", "Material", "org.kde.breeze", "org.kde.desktop", "Plasma", "Universal")
[D] 08:39:38.650 0x7ff9f8e12d00 () - style paths: ("/usr/lib/qml/QtQuick/Controls.2")
[W] 08:39:38.650 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[D] 08:39:38.650 0x7ff9f8e12d00 () - import paths: ("/usr/lib/qml", "/app/bin", "qrc:/qt-project.org/imports")
[D] 08:39:38.650 0x7ff9f8e12d00 () - library paths: ("/usr/share/runtime/lib/plugins", "/usr/lib/plugins", "/app/bin")
[W] 08:39:38.650 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.650 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[D] 08:39:38.650 0x7ff9f8e12d00 () - switching to style: "org.kde.desktop"
[W] 08:39:38.650 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.650 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.650 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.651 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.651 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.651 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.651 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.651 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.651 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.651 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.651 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.651 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.651 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.651 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.651 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.651 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.651 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.651 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.652 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.652 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.652 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.652 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.652 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.652 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.652 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.652 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.652 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.652 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.652 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.652 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.652 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.652 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.653 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.653 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.653 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.653 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.653 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.653 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.653 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.653 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.653 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.653 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.653 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.653 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.653 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.653 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.653 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.653 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.654 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.654 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.654 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.654 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[W] 08:39:38.654 0x7ff9de7ff600 () - checksum mismatch: "dfa47af8" (expected: "b4157ea9" ) "multilang_whisper_large.ggml"
[D] 08:39:38.663 0x7ff9de7ff600 () - models changed
[D] 08:39:39.257 0x7ff9f8e12d00 () - starting app: app-standalone
[D] 08:39:39.258 0x7ff9f8e12d00 () - app service state: unknown => busy
logger error: invalid format string
qrc:/qml/main.qml:269:5: QML Connections: Implicitly defined onFoo properties in Connections are deprecated. Use this syntax instead: function onFoo(
Thank you for the report.
Would you be able to provide a log when this problem occurs?
You can enable logging with
--verbose
option. Start the app with the following command:flatpak run net.mkiol.SpeechNote --verbose
Methinks, the issue starts as soon I invoke the STT a second time via the shortcut (shift+ctrl+alt+L) The issue does not occur when I use the listen button.
@devSJR Many thanks for the log and for catching this bug.
The problem occurred because you are using Press and hold
option in Listening mode
. This listening mode is not correctly handled with keyboard shortcuts.
Fix: 4258fa0d52fa8016c9309aaa36ce4fe539695ce7
I just tested it. Am I right that this is not yet in beta 4.4.0?
Am I right that this is not yet in beta 4.4.0?
Indeed, not yet. I will try to push new beta next week.
PS: New beta might be delayed because I split the Flatpak package into 3 smaller sub-packages (Add-ons). The base one, NVIDIA-only and AMD-only. I need to request a separate Flathub repositories for all add-ons which might be complicated. Thanks to this modular approach the main package will be much more smaller.
Fix: 4258fa0d52fa8016c9309aaa36ce4fe539695ce7
Fix is included in v4.4.0.
Whenever I invoke dsnot 4.3. (linux, flatpack, cuda (4 GB VRAM)) for STT (German, whisper, large, v2) via shortcuts (happens also from the 'listen' button) it does not really stop listening. It indicates 'busy …'. However, it is still possible to do STT in this condition. Pushing cancel is not possible (greyed out). If I keep it in this state, I often get very long text in the notepad with some text (maybe things I have said while being in the room and talking to others). To really stop it, I have to restart dsnote.