mkiol / dsnote

Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation.
Mozilla Public License 2.0
467 stars 19 forks source link

SpeechNote crashes #8

Closed snden closed 1 year ago

snden commented 1 year ago

I have installed and run SpeechNote from flatpak. It starts up fine, but as soon as I press Listen, it loads the speech model and crashes.

$ flatpak run net.mkiol.SpeechNote
Gtk-Message: 13:44:53.142: Failed to load module "xapp-gtk3-module"
Qt: Session management error: Authentication Rejected, reason : None of the authentication protocols specified are supported and host-based authentication failed

I select the Speech to text model and press Listen

whisper_init_from_file_no_state: loading model from '/home/user/.var/app/net.mkiol.SpeechNote/cache/net.mkiol/dsnote/speech-models/multilang_whisper_base.ggml'
whisper_model_load: loading model
whisper_model_load: n_vocab       = 51865
whisper_model_load: n_audio_ctx   = 1500
whisper_model_load: n_audio_state = 512
whisper_model_load: n_audio_head  = 8
whisper_model_load: n_audio_layer = 6
whisper_model_load: n_text_ctx    = 448
whisper_model_load: n_text_state  = 512
whisper_model_load: n_text_head   = 8
whisper_model_load: n_text_layer  = 6
whisper_model_load: n_mels        = 80
whisper_model_load: f16           = 1
whisper_model_load: type          = 2
whisper_model_load: mem required  =  218,00 MB (+    6,00 MB per decoder)
whisper_model_load: adding 1608 extra tokens
whisper_model_load: model ctx     =  140,60 MB

And SpeechNote crashes. The same situation occurs for each selected speech model.

(Linux Mint 21.1, Xfce 4.16)

mkiol commented 1 year ago

Thank you for the report.

Could you please start app in following way:

$ flatpak run --command=sh net.mkiol.SpeechNote 
[📦 net.mkiol.SpeechNote ~]$ dsnote --verbose

When app crashes what are the last few output lines?

snden commented 1 year ago

Thank you for answer. I run the application in the terminal:

$ dsnote --verbose
Gtk-Message: 20:11:48.678: Failed to load module "xapp-gtk3-module"
Qt: Session management error: Authentication Rejected, reason : None of the authentication protocols specified are supported and host-based authentication failed
[I] 20:11:48.738 0x7fc2a6d8ad00 init:49 - logging to stderr enabled
[D] 20:11:48.738 0x7fc2a6d8ad00 () - translation: "cs_CZ"
[D] 20:11:48.738 0x7fc2a6d8ad00 () - launch mode: app-standalone
[D] 20:11:48.738 0x7fc2a6d8ad00 () - app: net.mkiol dsnote
[D] 20:11:48.738 0x7fc2a6d8ad00 () - config location: "/home/user/.var/app/net.mkiol.SpeechNote/config"
[D] 20:11:48.738 0x7fc2a6d8ad00 () - data location: "/home/user/.var/app/net.mkiol.SpeechNote/data/net.mkiol/dsnote"
[D] 20:11:48.738 0x7fc2a6d8ad00 () - cache location: "/home/user/.var/app/net.mkiol.SpeechNote/cache/net.mkiol/dsnote"
[D] 20:11:48.738 0x7fc2a6d8ad00 () - settings file: "/home/user/.var/app/net.mkiol.SpeechNote/config/net.mkiol/dsnote/settings.conf"
[D] 20:11:48.738 0x7fc2a6d8ad00 () - starting standalone app
[D] 20:11:48.738 0x7fc2a24cc600 loop:57 - py executor loop started
[D] 20:11:48.780 0x7fc2a6d8ad00 () - starting service: app-standalone
[D] 20:11:48.786 0x7fc2a0a0e600 () - config version: 16 16
[D] 20:11:48.788 0x7fc2a0a0e600 () - checksum ok: "2e243a59" "multilang_whisper_tiny.ggml"
[D] 20:11:48.788 0x7fc2a0a0e600 () - found model: "multilang_whisper_tiny"
[D] 20:11:48.788 0x7fc2a0a0e600 () - checksum ok: "5dc0e88e" "multilang_whisper_base.ggml"
[D] 20:11:48.788 0x7fc2a6d8ad00 () - module already unpacked: "rhvoicedata"
[D] 20:11:48.788 0x7fc2a0a0e600 () - found model: "multilang_whisper_base"
[D] 20:11:48.788 0x7fc2a6d8ad00 () - module already unpacked: "rhvoiceconfig"
[D] 20:11:48.788 0x7fc2a0a0e600 () - found model: "es_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.788 0x7fc2a0a0e600 () - found model: "es_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.789 0x7fc2a0a0e600 () - found model: "it_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.789 0x7fc2a0a0e600 () - found model: "it_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.789 0x7fc2a0a0e600 () - found model: "pt_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.789 0x7fc2a0a0e600 () - found model: "pt_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.789 0x7fc2a0a0e600 () - found model: "de_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.789 0x7fc2a0a0e600 () - found model: "de_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.789 0x7fc2a0a0e600 () - found model: "jp_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.789 0x7fc2a0a0e600 () - found model: "jp_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.789 0x7fc2a0a0e600 () - found model: "ru_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.790 0x7fc2a0a0e600 () - found model: "ru_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.790 0x7fc2a0a0e600 () - found model: "nl_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.790 0x7fc2a0a0e600 () - found model: "nl_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.790 0x7fc2a0a0e600 () - found model: "id_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.790 0x7fc2a0a0e600 () - found model: "id_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.790 0x7fc2a0a0e600 () - found model: "ca_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.790 0x7fc2a0a0e600 () - found model: "ca_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.790 0x7fc2a0a0e600 () - found model: "fr_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.791 0x7fc2a0a0e600 () - found model: "fr_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.791 0x7fc2a0a0e600 () - found model: "tr_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.791 0x7fc2a0a0e600 () - found model: "tr_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.791 0x7fc2a0a0e600 () - found model: "sv_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.791 0x7fc2a0a0e600 () - found model: "sv_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.791 0x7fc2a0a0e600 () - found model: "uk_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.791 0x7fc2a0a0e600 () - found model: "uk_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.791 0x7fc2a0a0e600 () - found model: "ms_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.792 0x7fc2a0a0e600 () - found model: "ms_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.792 0x7fc2a0a0e600 () - found model: "no_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.792 0x7fc2a0a0e600 () - found model: "no_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.792 0x7fc2a0a0e600 () - found model: "fi_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.792 0x7fc2a0a0e600 () - found model: "fi_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.792 0x7fc2a0a0e600 () - found model: "cs_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.792 0x7fc2a0a0e600 () - found model: "cs_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.793 0x7fc2a0a0e600 () - checksum ok: "11841031" "cs_whisper_small_ft.ggml"
[D] 20:11:48.793 0x7fc2a0a0e600 () - found model: "cs_whisper_small_ft"
[D] 20:11:48.793 0x7fc2a0a0e600 () - checksum ok: "c6404ae3" "cs_whisper_medium_ft.ggml"
[D] 20:11:48.793 0x7fc2a0a0e600 () - found model: "cs_whisper_medium_ft"
[D] 20:11:48.793 0x7fc2a0a0e600 () - found model: "sk_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.793 0x7fc2a0a0e600 () - found model: "sk_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.793 0x7fc2a0a0e600 () - found model: "zh_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.793 0x7fc2a0a0e600 () - found model: "zh_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.794 0x7fc2a0a0e600 () - found model: "vi_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.794 0x7fc2a0a0e600 () - found model: "vi_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.794 0x7fc2a0a0e600 () - found model: "th_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.794 0x7fc2a0a0e600 () - found model: "th_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.794 0x7fc2a0a0e600 () - found model: "el_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.794 0x7fc2a0a0e600 () - found model: "el_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.794 0x7fc2a0a0e600 () - found model: "hr_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.794 0x7fc2a0a0e600 () - found model: "hr_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.794 0x7fc2a0a0e600 () - found model: "bg_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.795 0x7fc2a0a0e600 () - found model: "bg_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.795 0x7fc2a0a0e600 () - found model: "ro_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.795 0x7fc2a0a0e600 () - found model: "ro_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.795 0x7fc2a0a0e600 () - found model: "sl_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.795 0x7fc2a0a0e600 () - found model: "sl_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.795 0x7fc2a0a0e600 () - found model: "bs_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.795 0x7fc2a0a0e600 () - found model: "bs_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.795 0x7fc2a0a0e600 () - found model: "mk_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.795 0x7fc2a0a0e600 () - found model: "mk_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.796 0x7fc2a0a0e600 () - found model: "sr_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.796 0x7fc2a0a0e600 () - found model: "sr_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.796 0x7fc2a6d8ad00 () - module already unpacked: "espeakdata"
[D] 20:11:48.796 0x7fc2a0a0e600 () - found model: "hu_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.796 0x7fc2a0a0e600 () - found model: "hu_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.796 0x7fc2a0a0e600 () - found model: "hi_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.796 0x7fc2a6d8ad00 () - model or lang id: "en"
[D] 20:11:48.796 0x7fc2a6d8ad00 () - default stt model not found: "cs_whisper_base"
[D] 20:11:48.796 0x7fc2a6d8ad00 () - default tts model not found: "en"
[D] 20:11:48.796 0x7fc2a6d8ad00 () - no models available
[D] 20:11:48.796 0x7fc2a6d8ad00 () - state changed: unknown => busy
[D] 20:11:48.796 0x7fc2a0a0e600 () - found model: "hi_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.796 0x7fc2a0a0e600 () - found model: "fa_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.796 0x7fc2a0a0e600 () - found model: "fa_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.797 0x7fc2a0a0e600 () - found model: "da_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.797 0x7fc2a0a0e600 () - found model: "da_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.797 0x7fc2a0a0e600 () - found model: "et_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.797 0x7fc2a0a0e600 () - found model: "et_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.797 0x7fc2a0a0e600 () - found model: "eu_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.797 0x7fc2a0a0e600 () - found model: "eu_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.797 0x7fc2a0a0e600 () - found model: "is_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.797 0x7fc2a0a0e600 () - found model: "is_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.797 0x7fc2a0a0e600 () - found model: "ka_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.798 0x7fc2a0a0e600 () - found model: "ka_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.798 0x7fc2a0a0e600 () - found model: "kk_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.798 0x7fc2a0a0e600 () - found model: "kk_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.798 0x7fc2a0a0e600 () - found model: "ko_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.798 0x7fc2a0a0e600 () - found model: "ko_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.798 0x7fc2a0a0e600 () - found model: "lt_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.798 0x7fc2a0a0e600 () - found model: "lt_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.798 0x7fc2a0a0e600 () - found model: "lv_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.798 0x7fc2a0a0e600 () - found model: "lv_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.798 0x7fc2a0a0e600 () - found model: "mn_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.799 0x7fc2a0a0e600 () - found model: "mn_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.799 0x7fc2a0a0e600 () - found model: "mt_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.799 0x7fc2a0a0e600 () - found model: "mt_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.799 0x7fc2a0a0e600 () - found model: "ne_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.799 0x7fc2a0a0e600 () - found model: "ne_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.799 0x7fc2a0a0e600 () - found model: "sq_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.799 0x7fc2a0a0e600 () - found model: "sq_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.799 0x7fc2a0a0e600 () - found model: "sw_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.799 0x7fc2a0a0e600 () - found model: "sw_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.800 0x7fc2a0a0e600 () - found model: "tl_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.800 0x7fc2a0a0e600 () - found model: "tl_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.800 0x7fc2a0a0e600 () - found model: "tt_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.800 0x7fc2a0a0e600 () - found model: "tt_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.800 0x7fc2a0a0e600 () - found model: "uz_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.800 0x7fc2a0a0e600 () - found model: "uz_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.800 0x7fc2a0a0e600 () - found model: "yo_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.800 0x7fc2a0a0e600 () - found model: "yo_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.800 0x7fc2a0a0e600 () - found model: "am_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.801 0x7fc2a0a0e600 () - found model: "am_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.801 0x7fc2a0a0e600 () - found model: "ar_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.801 0x7fc2a0a0e600 () - found model: "ar_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.801 0x7fc2a0a0e600 () - found model: "bn_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.801 0x7fc2a0a0e600 () - found model: "bn_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.801 0x7fc2a0a0e600 () - found model: "pl_whisper_tiny" alias of "multilang_whisper_tiny"
[D] 20:11:48.801 0x7fc2a0a0e600 () - found model: "pl_whisper_base" alias of "multilang_whisper_base"
[D] 20:11:48.808 0x7fc2a0a0e600 () - checksum ok: "b8b03e8a" "multilang_hftc_kredor"
[D] 20:11:48.808 0x7fc2a0a0e600 () - found model: "multilang_hftc_kredor"
[D] 20:11:48.809 0x7fc2a0a0e600 () - found model: "en_hftc_kredor" alias of "multilang_hftc_kredor"
[D] 20:11:48.809 0x7fc2a0a0e600 () - found model: "de_hftc_kredor" alias of "multilang_hftc_kredor"
[D] 20:11:48.809 0x7fc2a0a0e600 () - found model: "fr_hftc_kredor" alias of "multilang_hftc_kredor"
[D] 20:11:48.809 0x7fc2a0a0e600 () - found model: "es_hftc_kredor" alias of "multilang_hftc_kredor"
[D] 20:11:48.809 0x7fc2a0a0e600 () - found model: "it_hftc_kredor" alias of "multilang_hftc_kredor"
[D] 20:11:48.809 0x7fc2a0a0e600 () - found model: "pl_hftc_kredor" alias of "multilang_hftc_kredor"
[D] 20:11:48.809 0x7fc2a0a0e600 () - found model: "nl_hftc_kredor" alias of "multilang_hftc_kredor"
[D] 20:11:48.809 0x7fc2a0a0e600 () - found model: "cs_hftc_kredor" alias of "multilang_hftc_kredor"
[D] 20:11:48.809 0x7fc2a0a0e600 () - found model: "pt_hftc_kredor" alias of "multilang_hftc_kredor"
[D] 20:11:48.809 0x7fc2a0a0e600 () - found model: "sl_hftc_kredor" alias of "multilang_hftc_kredor"
[D] 20:11:48.809 0x7fc2a0a0e600 () - found model: "el_hftc_kredor" alias of "multilang_hftc_kredor"
[D] 20:11:48.809 0x7fc2a0a0e600 () - found model: "sv_hftc_kredor" alias of "multilang_hftc_kredor"
[D] 20:11:48.809 0x7fc2a0a0e600 () - found model: "hu_hftc_kredor" alias of "multilang_hftc_kredor"
[D] 20:11:48.809 0x7fc2a0a0e600 () - found model: "ro_hftc_kredor" alias of "multilang_hftc_kredor"
[W] 20:11:48.810 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.810 0x7fc2a0a0e600 () - found model: "am_espeak_am"
[W] 20:11:48.810 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.810 0x7fc2a0a0e600 () - found model: "ar_espeak_ar"
[W] 20:11:48.810 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.810 0x7fc2a0a0e600 () - found model: "bg_espeak_bg"
[W] 20:11:48.810 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.810 0x7fc2a0a0e600 () - found model: "bs_espeak_bs"
[W] 20:11:48.810 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.810 0x7fc2a0a0e600 () - found model: "ca_espeak_ca"
[W] 20:11:48.810 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.810 0x7fc2a0a0e600 () - found model: "cs_espeak_cs"
[W] 20:11:48.810 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.810 0x7fc2a0a0e600 () - found model: "da_espeak_da"
[W] 20:11:48.810 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.810 0x7fc2a0a0e600 () - found model: "de_espeak_de"
[W] 20:11:48.810 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.810 0x7fc2a0a0e600 () - found model: "el_espeak_el"
[W] 20:11:48.810 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.810 0x7fc2a0a0e600 () - found model: "en_espeak_en"
[W] 20:11:48.810 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.810 0x7fc2a0a0e600 () - found model: "eo_espeak_eo"
[W] 20:11:48.810 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.810 0x7fc2a0a0e600 () - found model: "es_espeak_es"
[W] 20:11:48.810 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.810 0x7fc2a0a0e600 () - found model: "et_espeak_et"
[W] 20:11:48.810 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.810 0x7fc2a0a0e600 () - found model: "eu_espeak_eu"
[W] 20:11:48.810 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.810 0x7fc2a0a0e600 () - found model: "is_espeak_is"
[W] 20:11:48.810 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.810 0x7fc2a0a0e600 () - found model: "fa_espeak_fa"
[W] 20:11:48.810 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.810 0x7fc2a0a0e600 () - found model: "fi_espeak_fi"
[W] 20:11:48.811 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.811 0x7fc2a0a0e600 () - found model: "fr_espeak_fr"
[W] 20:11:48.811 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.811 0x7fc2a0a0e600 () - found model: "hi_espeak_hi"
[W] 20:11:48.811 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.811 0x7fc2a0a0e600 () - found model: "hr_espeak_hr"
[W] 20:11:48.811 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.811 0x7fc2a0a0e600 () - found model: "hu_espeak_hu"
[W] 20:11:48.811 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.811 0x7fc2a0a0e600 () - found model: "id_espeak_id"
[W] 20:11:48.811 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.811 0x7fc2a0a0e600 () - found model: "it_espeak_it"
[W] 20:11:48.811 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.811 0x7fc2a0a0e600 () - found model: "jp_espeak_jp"
[W] 20:11:48.811 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.811 0x7fc2a0a0e600 () - found model: "kk_espeak_kk"
[W] 20:11:48.811 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.811 0x7fc2a0a0e600 () - found model: "ko_espeak_ko"
[W] 20:11:48.811 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.811 0x7fc2a0a0e600 () - found model: "lv_espeak_lv"
[W] 20:11:48.811 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.811 0x7fc2a0a0e600 () - found model: "lt_espeak_lt"
[W] 20:11:48.811 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.811 0x7fc2a0a0e600 () - found model: "mk_espeak_mk"
[W] 20:11:48.811 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.811 0x7fc2a0a0e600 () - found model: "ms_espeak_ms"
[W] 20:11:48.811 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.811 0x7fc2a0a0e600 () - found model: "ne_espeak_ne"
[W] 20:11:48.811 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.811 0x7fc2a0a0e600 () - found model: "nl_espeak_nl"
[W] 20:11:48.811 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.811 0x7fc2a0a0e600 () - found model: "no_espeak_no"
[W] 20:11:48.811 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.811 0x7fc2a0a0e600 () - found model: "pt_espeak_pt"
[W] 20:11:48.811 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.811 0x7fc2a0a0e600 () - found model: "pt_espeak_pt_br"
[W] 20:11:48.811 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.811 0x7fc2a0a0e600 () - found model: "ro_espeak_ro"
[W] 20:11:48.811 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.811 0x7fc2a0a0e600 () - found model: "ru_espeak_ru"
[W] 20:11:48.811 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.811 0x7fc2a0a0e600 () - found model: "sk_espeak_sk"
[W] 20:11:48.811 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.811 0x7fc2a0a0e600 () - found model: "sl_espeak_sl"
[W] 20:11:48.811 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.811 0x7fc2a0a0e600 () - found model: "sr_espeak_sr"
[W] 20:11:48.811 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.811 0x7fc2a0a0e600 () - found model: "sv_espeak_sv"
[W] 20:11:48.812 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.812 0x7fc2a0a0e600 () - found model: "sw_espeak_sw"
[W] 20:11:48.812 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.812 0x7fc2a0a0e600 () - found model: "th_espeak_th"
[W] 20:11:48.812 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.812 0x7fc2a0a0e600 () - found model: "tr_espeak_tr"
[W] 20:11:48.812 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.812 0x7fc2a0a0e600 () - found model: "uk_espeak_uk"
[W] 20:11:48.812 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.812 0x7fc2a0a0e600 () - found model: "ka_espeak_ka"
[W] 20:11:48.812 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.812 0x7fc2a0a0e600 () - found model: "ky_espeak_ky"
[W] 20:11:48.812 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.812 0x7fc2a0a0e600 () - found model: "tt_espeak_tt"
[W] 20:11:48.812 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.812 0x7fc2a0a0e600 () - found model: "sq_espeak_sq"
[W] 20:11:48.812 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.812 0x7fc2a0a0e600 () - found model: "uz_espeak_uz"
[W] 20:11:48.812 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.812 0x7fc2a0a0e600 () - found model: "vi_espeak_vi"
[W] 20:11:48.812 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.812 0x7fc2a0a0e600 () - found model: "zh_CN_espeak_yue"
[W] 20:11:48.812 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.812 0x7fc2a0a0e600 () - found model: "zh_CN_espeak_hak"
[W] 20:11:48.812 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.812 0x7fc2a0a0e600 () - found model: "zh_CN_espeak_cmn"
[W] 20:11:48.812 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.812 0x7fc2a0a0e600 () - found model: "ga_espeak_ga"
[W] 20:11:48.812 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.812 0x7fc2a0a0e600 () - found model: "mt_espeak_mt"
[W] 20:11:48.812 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.812 0x7fc2a0a0e600 () - found model: "bn_espeak_bn"
[W] 20:11:48.812 0x7fc2a0a0e600 () - QDir::exists: Empty or null file name
[D] 20:11:48.812 0x7fc2a0a0e600 () - found model: "pl_espeak_pl"
[D] 20:11:48.814 0x7fc2a0a0e600 () - models changed
[W] 20:11:49.54 0x7fc2a6d8ad00 () - WARNING: Cannot find style "org.kde.desktop" - fallback: "/usr/lib/qml/QtQuick/Controls/Styles/Base"
[W] 20:11:49.86 0x7fc2a6d8ad00 ():64 - qrc:/qml/main.qml:64:13: QML TextArea: ScrollBar must be attached to a Flickable or ScrollView
[D] 20:11:49.89 0x7fc2a6d8ad00 () - starting app: app-standalone
[D] 20:11:49.89 0x7fc2a6d8ad00 () - app service state: unknown => busy
logger error: invalid format string
logger error: invalid format string
[D] 20:11:49.265 0x7fc2a6d8ad00 () - model or lang id: "en"
[D] 20:11:49.266 0x7fc2a6d8ad00 () - default tts model not found: "en"
[D] 20:11:49.266 0x7fc2a6d8ad00 () - state changed: busy => idle
[D] 20:11:49.813 0x7fc2a6d8ad00 () - app service state: busy => idle
[D] 20:11:49.813 0x7fc2a6d8ad00 () - app stt available models: 0 => 4
[D] 20:11:49.815 0x7fc2a6d8ad00 () - update listen
[D] 20:11:49.815 0x7fc2a6d8ad00 () - cancel
[W] 20:11:49.815 0x7fc2a6d8ad00 () - no current task
[D] 20:11:49.815 0x7fc2a6d8ad00 () - app active stt model: "" => "cs_whisper_base"
[D] 20:11:49.815 0x7fc2a6d8ad00 () - update listen
[D] 20:11:49.816 0x7fc2a6d8ad00 () - cancel
[W] 20:11:49.816 0x7fc2a6d8ad00 () - no current task
[W] 20:11:49.816 0x7fc2a6d8ad00 () - invalid task, reseting speech state
[D] 20:11:49.816 0x7fc2a6d8ad00 () - app stt configured: false => true
[D] 20:11:49.817 0x7fc2a6d8ad00 () - app busy: true => false
[D] 20:11:49.817 0x7fc2a6d8ad00 () - stt models changed
[D] 20:11:49.817 0x7fc2a6d8ad00 () - update listen
[D] 20:11:49.817 0x7fc2a6d8ad00 () - cancel
[W] 20:11:49.817 0x7fc2a6d8ad00 () - no current task
[D] 20:11:49.817 0x7fc2a6d8ad00 () - tts models changed
[D] 20:11:49.817 0x7fc2a6d8ad00 () - update listen
[D] 20:11:49.817 0x7fc2a6d8ad00 () - cancel
[W] 20:11:49.817 0x7fc2a6d8ad00 () - no current task
[D] 20:11:49.817 0x7fc2a6d8ad00 () - ttt models changed
[D] 20:11:49.817 0x7fc2a6d8ad00 () - app ttt configured: false => true

The app is running and waiting for speech recognition to start. When I press 'Listen' the report in the terminal continues:

[D] 20:12:53.8 0x7f449e6d4d00 () - stt start listen
[D] 20:12:53.8 0x7f449e6d4d00 () - model or lang id: "cs_whisper_base"
[D] 20:12:53.9 0x7f449e6d4d00 () - default tts model not found: "en"
[D] 20:12:53.9 0x7f449e6d4d00 () - restart stt engine config: "lang=cs, model-files=[model-file=/home/user/.var/app/net.mkiol.SpeechNote/cache/net.mkiol/dsnote/speech-models/multilang_whisper_base.ggml, scorer-file=/home/user/.var/app/net.mkiol.SpeechNote/cache/net.mkiol/dsnote/speech-models/multilang_whisper_base.scorer, ttt-model-file=], speech-mode=single-sentence, vad-mode=aggressiveness-3, speech-started=0"
[D] 20:12:53.9 0x7f449e6d4d00 () - new stt engine required
[D] 20:12:53.27 0x7f449e6d4d00 make_wparams:200 - cpu info: arch=x86_64, cores=4, neon=false
[D] 20:12:53.27 0x7f449e6d4d00 make_wparams:203 - using threads: 2/4
[D] 20:12:53.27 0x7f449e6d4d00 make_wparams:205 - system info: AVX = 1 | AVX2 = 1 | AVX512 = 0 | FMA = 1 | NEON = 0 | ARM_FMA = 0 | F16C = 1 | FP16_VA = 0 | WASM_SIMD = 0 | BLAS = 1 | SSE3 = 1 | VSX = 0 | COREML = 0 | 
[D] 20:12:53.27 0x7f449e6d4d00 start:177 - starting engine
[D] 20:12:53.27 0x7f449e6d4d00 start:185 - engine started
[D] 20:12:53.27 0x7f449e6d4d00 () - creating audio source
[D] 20:12:53.28 0x7f449e6d4d00 () - mic source created
[D] 20:12:53.28 0x7f446e17c600 start_processing:222 - processing started
[D] 20:12:53.28 0x7f446e17c600 set_processing_state:408 - processing state: idle => initializing
[D] 20:12:53.28 0x7f446e17c600 set_processing_state:415 - speech detection status: no-speech => initializing (no-speech)
[D] 20:12:53.28 0x7f446e17c600 () - speech state changed: 0 => 3
[D] 20:12:53.28 0x7f446e17c600 create_whisper_model:59 - creating whisper model
whisper_init_from_file_no_state: loading model from '/home/user/.var/app/net.mkiol.SpeechNote/cache/net.mkiol/dsnote/speech-models/multilang_whisper_base.ggml'
whisper_model_load: loading model
whisper_model_load: n_vocab       = 51865
whisper_model_load: n_audio_ctx   = 1500
whisper_model_load: n_audio_state = 512
whisper_model_load: n_audio_head  = 8
whisper_model_load: n_audio_layer = 6
whisper_model_load: n_text_ctx    = 448
whisper_model_load: n_text_state  = 512
whisper_model_load: n_text_head   = 8
whisper_model_load: n_text_layer  = 6
whisper_model_load: n_mels        = 80
whisper_model_load: f16           = 1
whisper_model_load: type          = 2
whisper_model_load: mem required  =  218,00 MB (+    6,00 MB per decoder)
[D] 20:12:53.73 0x7f449e6d4d00 () - audio state: IdleState
[D] 20:12:53.73 0x7f449e6d4d00 set_speech_started:464 - speech started: false => true
[D] 20:12:53.73 0x7f449e6d4d00 set_speech_detection_status:486 - speech detection status: initializing => initializing (speech-detected)
[D] 20:12:53.73 0x7f449e6d4d00 () - state changed: idle => listening-single-sentence
[W] 20:12:53.73 0x7f449e6d4d00 () - ignore SpeechPropertyChanged signal
[D] 20:12:53.73 0x7f449e6d4d00 () - app current task: -1 => 0
[D] 20:12:53.73 0x7f449e6d4d00 () - app speech state: no-speech => initializing
[D] 20:12:53.76 0x7f449e6d4d00 () - app service state: idle => listening-single-sentence
whisper_model_load: adding 1608 extra tokens
whisper_model_load: model ctx     =  140,60 MB
Nedovolená instrukce (SIGILL) (core dumped [obraz paměti uložen])

... and at this moment the application crashes.

mkiol commented 1 year ago

What cpu are you using?

Could you please paste output of:

cat /proc/cpu_info
snden commented 1 year ago

It is an old Intel(R) Core(TM)2 Quad CPU Q9400 @ 2.66GHz. Do you mean $ cat /proc/cpuinfo ?

mkiol commented 1 year ago

Do you mean $ cat /proc/cpuinfo ?

Yes. Sorry for a typo.

Just want to know what flags your CPU supports. This Illegal instruction (SIGILL) error is because program tries to execute CPU instruction which is not available on your machine. Whisper-cpp (the library where crash occurs) is heavily optimized and uses many CPU extensions to make code execution faster and most likely one of the extension is missing on your CPU.

snden commented 1 year ago

$ cat /proc/cpuinfo processor : 0 vendor_id : GenuineIntel cpu family : 6 model : 23 model name : Intel(R) Core(TM)2 Quad CPU Q9400 @ 2.66GHz stepping : 10 microcode : 0xa0e cpu MHz : 1998.000 cache size : 3072 KB physical id : 0 siblings : 4 core id : 0 cpu cores : 4 apicid : 0 initial apicid : 0 fpu : yes fpu_exception : yes cpuid level : 13 wp : yes flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ht tm pbe syscall nx lm constant_tsc arch_perfmon pebs bts rep_good nopl cpuid aperfmperf pni dtes64 monitor ds_cpl smx est tm2 ssse3 cx16 xtpr pdcm sse4_1 xsave lahf_lm pti dtherm bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_unknown bogomips : 5319.81 clflush size : 64 cache_alignment : 64 address sizes : 36 bits physical, 48 bits virtual power management:

processor : 1 vendor_id : GenuineIntel cpu family : 6 model : 23 model name : Intel(R) Core(TM)2 Quad CPU Q9400 @ 2.66GHz stepping : 10 microcode : 0xa0e cpu MHz : 1998.000 cache size : 3072 KB physical id : 0 siblings : 4 core id : 1 cpu cores : 4 apicid : 1 initial apicid : 1 fpu : yes fpu_exception : yes cpuid level : 13 wp : yes flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ht tm pbe syscall nx lm constant_tsc arch_perfmon pebs bts rep_good nopl cpuid aperfmperf pni dtes64 monitor ds_cpl smx est tm2 ssse3 cx16 xtpr pdcm sse4_1 xsave lahf_lm pti dtherm bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_unknown bogomips : 5319.81 clflush size : 64 cache_alignment : 64 address sizes : 36 bits physical, 48 bits virtual power management:

processor : 2 vendor_id : GenuineIntel cpu family : 6 model : 23 model name : Intel(R) Core(TM)2 Quad CPU Q9400 @ 2.66GHz stepping : 10 microcode : 0xa0e cpu MHz : 1994.929 cache size : 3072 KB physical id : 0 siblings : 4 core id : 3 cpu cores : 4 apicid : 3 initial apicid : 3 fpu : yes fpu_exception : yes cpuid level : 13 wp : yes flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ht tm pbe syscall nx lm constant_tsc arch_perfmon pebs bts rep_good nopl cpuid aperfmperf pni dtes64 monitor ds_cpl smx est tm2 ssse3 cx16 xtpr pdcm sse4_1 xsave lahf_lm pti dtherm bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_unknown bogomips : 5319.81 clflush size : 64 cache_alignment : 64 address sizes : 36 bits physical, 48 bits virtual power management:

processor : 3 vendor_id : GenuineIntel cpu family : 6 model : 23 model name : Intel(R) Core(TM)2 Quad CPU Q9400 @ 2.66GHz stepping : 10 microcode : 0xa0e cpu MHz : 1998.000 cache size : 3072 KB physical id : 0 siblings : 4 core id : 2 cpu cores : 4 apicid : 2 initial apicid : 2 fpu : yes fpu_exception : yes cpuid level : 13 wp : yes flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ht tm pbe syscall nx lm constant_tsc arch_perfmon pebs bts rep_good nopl cpuid aperfmperf pni dtes64 monitor ds_cpl smx est tm2 ssse3 cx16 xtpr pdcm sse4_1 xsave lahf_lm pti dtherm bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_unknown bogomips : 5319.81 clflush size : 64 cache_alignment : 64 address sizes : 36 bits physical, 48 bits virtual power management:

mkiol commented 1 year ago

It looks like CPU doesn't support fma and f16c extensions, but whisper-cpp library requires these extensions to work. This is a cause of the crash.

Need for fma and f16c can be disabled during compilation, so I can build app that won't crash on your CPU but the performance will be much worse on every other CPUs.

The only solution I see is to just disable whisper models on systems without fma and f16c, so you will not observe crash because you will not be able to use whisper is a first place. Sorry :/

snden commented 1 year ago

Thank you for your message. I then downloaded the (DeepSpeech Comodoro) and (Vosk small) models. When I use DeepSpeech Comodoro, the application crashes, as with models (Whisper ...), only there is no message in the terminal about loading the model. When loading the Vosk Small model, SpeechNote works, but the speech recognition is very poor. Since the app crashes even when loading the DeepSpeech Comodoro model, could the error be due to another cause?

mkiol commented 1 year ago

Since the app crashes even when loading the DeepSpeech Comodoro model, could the error be due to another cause?

That's not good :/

Could you help me in debugging this problem? If yes, please follow the instruction:

$ flatpak install runtime/org.kde.Sdk/x86_64/5.15-22.08
$ flatpak run --command=sh --devel net.mkiol.SpeechNote
[📦 net.mkiol.SpeechNote ~]$ gdb --eval-command=r --eval-command=bt --eval-command=q --args dsnote --verbose

I would be very grateful if you could help me with this.

snden commented 1 year ago

Honestly, I don't know what I'm doing right now, I'm not a software developer, but I'll try to help. DeepSpeechComodoro model selected Pressed Listen

[D] 13:09:56.974 0x7fffef172d00 () - stt start listen
[D] 13:09:56.974 0x7fffef172d00 () - model or lang id: "cs"
[D] 13:09:56.975 0x7fffef172d00 () - default tts model not found: "en"
[D] 13:09:56.975 0x7fffef172d00 () - restart stt engine config: "lang=cs, model-files=[model-file=/home/user/.var/app/net.mkiol.SpeechNote/cache/net.mkiol/dsnote/speech-models/cs.tflite, scorer-file=/home/user/.var/app/net.mkiol.SpeechNote/cache/net.mkiol/dsnote/speech-models/cs.scorer, ttt-model-file=], speech-mode=single-sentence, vad-mode=aggressiveness-3, speech-started=0"
[D] 13:09:56.975 0x7fffef172d00 () - new stt engine required

Thread 1 "dsnote" received signal SIGILL, Illegal instruction.
0x00007fffd02795a7 in ?? () from /app/lib/libkenlm.so
#0  0x00007fffd02795a7 in  () at /app/lib/libkenlm.so
#1  0x00007ffff7fcbfff in call_init () at /lib64/ld-linux-x86-64.so.2
#2  0x00007ffff7fcc0fd in _dl_init () at /lib64/ld-linux-x86-64.so.2
#3  0x00007ffff2a51f18 in _dl_catch_exception ()
    at /usr/lib/x86_64-linux-gnu/libc.so.6
--Type <RET> for more, q to quit, c to continue without paging--

'c' confirmed

#4  0x00007ffff7fd33bc in dl_open_worker () at /lib64/ld-linux-x86-64.so.2
#5  0x00007ffff2a51ebc in _dl_catch_exception () at /usr/lib/x86_64-linux-gnu/libc.so.6
#6  0x00007ffff7fd3764 in _dl_open () at /lib64/ld-linux-x86-64.so.2
#7  0x00007ffff297c094 in dlopen_doit () at /usr/lib/x86_64-linux-gnu/libc.so.6
#8  0x00007ffff2a51ebc in _dl_catch_exception () at /usr/lib/x86_64-linux-gnu/libc.so.6
#9  0x00007ffff2a51f88 in _dl_catch_error () at /usr/lib/x86_64-linux-gnu/libc.so.6
#10 0x00007ffff297bb63 in _dlerror_run () at /usr/lib/x86_64-linux-gnu/libc.so.6
#11 0x00007ffff297c14f in dlopen () at /usr/lib/x86_64-linux-gnu/libc.so.6
#12 0x00005555556145e5 in  ()
#13 0x0000555555614d1b in  ()
#14 0x0000555555600568 in  ()
#15 0x00005555556015ed in  ()
#16 0x00005555555cc208 in  ()
#17 0x00005555555bc302 in  ()
#18 0x00007ffff3f6fe83 in  () at /usr/lib/x86_64-linux-gnu/libQt5Qml.so.5
#19 0x00007ffff3e467fd in  () at /usr/lib/x86_64-linux-gnu/libQt5Qml.so.5
#20 0x00007ffff3e49b6b in QV4::QObjectMethod::callInternal(QV4::Value const*, QV4::Value const*, int) const () at /usr/lib/x86_64-linux-gnu/libQt5Qml.so.5
#21 0x00007ffff3e66416 in  () at /usr/lib/x86_64-linux-gnu/libQt5Qml.so.5
#22 0x00007ffff3e69b3f in  () at /usr/lib/x86_64-linux-gnu/libQt5Qml.so.5
#23 0x00007ffff3dfc54e in QV4::Function::call(QV4::Value const*, QV4::Value const*, int, QV4::ExecutionContext const*) () at /usr/lib/x86_64-linux-gnu/libQt5Qml.so.5
#24 0x00007ffff3f8a6fd in QQmlJavaScriptExpression::evaluate(QV4::CallData*, bool*) () at /usr/lib/x86_64-linux-gnu/libQt5Qml.so.5
#25 0x00007ffff3f3b98f in QQmlBoundSignalExpression::evaluate(void**) () at /usr/lib/x86_64-linux-gnu/libQt5Qml.so.5
#26 0x00007ffff3f3d0d8 in  () at /usr/lib/x86_64-linux-gnu/libQt5Qml.so.5
#27 0x00007ffff3f6f96f in QQmlNotifier::emitNotify(QQmlNotifierEndpoint*, void**) () at /usr/lib/x86_64-linux-gnu/libQt5Qml.so.5
#28 0x00007ffff3148d4d in  () at /usr/lib/x86_64-linux-gnu/libQt5Core.so.5
#29 0x00007fffdeb7b341 in QQuickAbstractButtonPrivate::handleRelease(QPointF const&) () at /usr/lib/x86_64-linux-gnu/libQt5QuickTemplates2.so.5
#30 0x00007fffdeb9a191 in QQuickControl::mouseReleaseEvent(QMouseEvent*) () at /usr/lib/x86_64-linux-gnu/libQt5QuickTemplates2.so.5
#31 0x00007fffdf03c768 in QQuickItem::event(QEvent*) () at /usr/lib/x86_64-linux-gnu/libQt5Quick.so.5
#32 0x00007ffff3111d48 in QCoreApplication::notifyInternal2(QObject*, QEvent*) () at /usr/lib/x86_64-linux-gnu/libQt5Core.so.5
#33 0x00007fffdf05752a in QQuickWindowPrivate::deliverMouseEvent(QQuickPointerMouseEvent*) () at /usr/lib/x86_64-linux-gnu/libQt5Quick.so.5
#34 0x00007fffdf058c0d in QQuickWindowPrivate::deliverPointerEvent(QQuickPointerEvent*) () at /usr/lib/x86_64-linux-gnu/libQt5Quick.so.5
#35 0x00007fffdf059d99 in QQuickWindowPrivate::handleMouseEvent(QMouseEvent*) () at /usr/lib/x86_64-linux-gnu/libQt5Quick.so.5
#36 0x00007ffff355a0a5 in QWindow::event(QEvent*) () at /usr/lib/x86_64-linux-gnu/libQt5Gui.so.5
#37 0x00007ffff3111d48 in QCoreApplication::notifyInternal2(QObject*, QEvent*) () at /usr/lib/x86_64-linux-gnu/libQt5Core.so.5
#38 0x00007ffff354e1bd in QGuiApplicationPrivate::processMouseEvent(QWindowSystemInterfacePrivate::MouseEvent*) () at /usr/lib/x86_64-linux-gnu/libQt5Gui.so.5
#39 0x00007ffff35210bc in QWindowSystemInterface::sendWindowSystemEvents(QFlags<QEventLoop::ProcessEventsFlag>) () at /usr/lib/x86_64-linux-gnu/libQt5Gui.so.5
#40 0x00007fffed58b92a in  () at /usr/lib/x86_64-linux-gnu/libQt5XcbQpa.so.5
#41 0x00007ffff277dd71 in g_main_context_dispatch () at /usr/lib/x86_64-linux-gnu/libglib-2.0.so.0
#42 0x00007ffff27d4028 in g_main_context_iterate.constprop () at /usr/lib/x86_64-linux-gnu/libglib-2.0.so.0
#43 0x00007ffff277b4a3 in g_main_context_iteration () at /usr/lib/x86_64-linux-gnu/libglib-2.0.so.0
#44 0x00007ffff3169a96 in QEventDispatcherGlib::processEvents(QFlags<QEventLoop::ProcessEventsFlag>) () at /usr/lib/x86_64-linux-gnu/libQt5Core.so.5
#45 0x00007ffff31107bb in QEventLoop::exec(QFlags<QEventLoop::ProcessEventsFlag>) () at /usr/lib/x86_64-linux-gnu/libQt5Core.so.5
#46 0x00007ffff3118926 in QCoreApplication::exec() () at /usr/lib/x86_64-linux-gnu/libQt5Core.so.5
#47 0x000055555559765f in main ()
A debugging session is active.

    Inferior 1 [process 9] will be killed.

Quit anyway? (y or n) 
mkiol commented 1 year ago

It is perfect. I have what I need. Big thank you!

mkiol commented 1 year ago

I will try to fix one or both problems and will kindly ask you to re-test on beta version.

I'll get back to you when it's ready.

snden commented 1 year ago

Of course I will test the new version, I am ready. Thank you.

mkiol commented 1 year ago

Here is a 'beta' version. Crash when using Whisper models should be fixed but the problem with DeepSpeech still exists.

You can install this test version with:

flatpak install --user https://dl.flathub.org/build-repo/33447/net.mkiol.SpeechNote.flatpakref

To run test version:

flatpak run --branch=test net.mkiol.SpeechNote

To uninstall test version:

flatpak uninstall app/net.mkiol.SpeechNote//test

Could you please test if you can do speech-to-text with Whisper model?

mkiol commented 1 year ago

Fix for problem with Whisper model: https://github.com/mkiol/dsnote/commit/afc898af2f241e87c16c348e21b54801ad79a625. Issue should be resolved in version 3.1.4.

Problem with DeepSpeech model will be tracked in https://github.com/mkiol/dsnote/issues/10.

snden commented 1 year ago

Thank you, Speechnote is now working for Whisper model. Compared to the VOSK model, however, not in real time, and even for the recognition of a few words, it needs an extremely long time.

mkiol commented 1 year ago

Thanks for the test.

Unfortunately, I'm not surprised. Whisper will be slow on old CPU. It needs a lot of processing power and works well only if processor supports modern extensions like AVX/AVX2 or FMA.

nPrevail commented 3 months ago

Sorry to reopen this conversation, but I'm having a very similar situation with two of my devices, which are running KDE Plasma, and NixOS.

Neither of these devices (with Plasma) can open Speech Note for more than 10 seconds. They seem to instantly crash just a few seconds after opening. However, on the same desktop, if I switch the DE from from KDE Plasma to GNOME, Speech Note works fine.

I ran net.mkiol.SpeechNote --verbose. Here was my output:

net.mkiol.SpeechNote --verbose
[I] 21:57:58.895 0x7f6ffea72d00 init:49 - logging to stderr enabled
[D] 21:57:58.895 0x7f6ffea72d00 () - version: 4.4.0
[D] 21:57:58.895 0x7f6ffea72d00 parse_cpuinfo:117 - cpu flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc art arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc cpuid aperfmperf pni pclmulqdq dtes64 monitor ds_cpl vmx est tm2 ssse3 sdbg fma cx16 xtpr pdcm pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand lahf_lm abm 3dnowprefetch cpuid_fault epb pti ssbd ibrs ibpb stibp tpr_shadow flexpriority ept vpid ept_ad fsgsbase tsc_adjust sgx bmi1 avx2 smep bmi2 erms invpcid mpx rdseed adx smap clflushopt intel_pt xsaveopt xsavec xgetbv1 xsaves dtherm ida arat pln pts hwp hwp_notify hwp_act_window hwp_epp vnmi md_clear flush_l1d arch_capabilities
[D] 21:57:58.895 0x7f6ffea72d00 parse_cpuinfo:125 - cpuinfo: processor-count=8, flags=[avx, avx2, fma, f16c, ]
[D] 21:57:58.895 0x7f6ffea72d00 () - translation: "en_US"
[W] 21:57:58.895 0x7f6ffea72d00 () - failed to install translation
[D] 21:57:58.895 0x7f6ffea72d00 () - starting standalone app
[D] 21:57:58.896 0x7f6ffea72d00 () - app: net.mkiol dsnote
[D] 21:57:58.896 0x7f6ffea72d00 () - config location: "/home/nprevail/.var/app/net.mkiol.SpeechNote/config"
[D] 21:57:58.896 0x7f6ffea72d00 () - data location: "/home/nprevail/.var/app/net.mkiol.SpeechNote/data/net.mkiol/dsnote"
[D] 21:57:58.896 0x7f6ffea72d00 () - cache location: "/home/nprevail/.var/app/net.mkiol.SpeechNote/cache/net.mkiol/dsnote"
[D] 21:57:58.896 0x7f6ffea72d00 () - settings file: "/home/nprevail/.var/app/net.mkiol.SpeechNote/config/net.mkiol/dsnote/settings.conf"
[D] 21:57:58.896 0x7f6ffea72d00 () - platform: "wayland"
[D] 21:57:58.896 0x7f6ffea72d00 () - nvidia addon exists
[D] 21:57:58.896 0x7f6ffea72d00 () - enforcing num threads: 0
[D] 21:57:59.45 0x7f6ffea72d00 () - supported audio input devices:
ALSA lib ../../oss/pcm_oss.c:397:(_snd_pcm_oss_open) Cannot open device /dev/dsp
[D] 21:57:59.65 0x7f6ffea72d00 () - "pulse"
[D] 21:57:59.84 0x7f6ffea72d00 () - "default"
ALSA lib ../../../src/pcm/pcm_direct.c:2045:(snd1_pcm_direct_parse_open_conf) The field ipc_gid must be a valid group (create group audio)
[D] 21:57:59.89 0x7f6ffea72d00 () - "alsa_input.pci-0000_00_1f.3.analog-stereo"
[D] 21:57:59.89 0x7f6ffea72d00 () - "alsa_output.usb-Bose_Corporation_Bose_Revolve_SoundLink_Q3090008902742956K40710-00.analog-stereo.monitor"
[D] 21:57:59.89 0x7f6ffea72d00 () - "alsa_output.pci-0000_00_1f.3.analog-stereo.monitor"
[D] 21:57:59.89 0x7f6ffea72d00 () - "bluez_output.AC_BF_71_3A_23_B6.1.monitor"
[D] 21:57:59.110 0x7f6ffea72d00 () - starting service: app-standalone
[D] 21:57:59.112 0x7f6ffea72d00 () - mbrola dir: "/app/bin"
[D] 21:57:59.112 0x7f6ffea72d00 () - espeak dir: "/app/bin"
[D] 21:57:59.112 0x7f6fe1400600 loop:75 - py executor loop started
[D] 21:57:59.116 0x7f6ffea72d00 () - module already unpacked: "rhvoicedata"
[D] 21:57:59.116 0x7f6ffea72d00 () - module already unpacked: "rhvoiceconfig"
[D] 21:57:59.121 0x7f6ffea72d00 () - module already unpacked: "espeakdata"
[D] 21:57:59.121 0x7f6ffea72d00 () - default stt model not found: "en_vosk_large"
[D] 21:57:59.121 0x7f6ffea72d00 () - default tts model not found: "en"
[D] 21:57:59.121 0x7f6ffea72d00 () - default mnt lang not found: "en"
[D] 21:57:59.121 0x7f6ffea72d00 () - new default mnt lang: "en"
[D] 21:57:59.121 0x7f6ffea72d00 () - service refresh status, new state: busy
[D] 21:57:59.121 0x7f6ffea72d00 () - service state changed: unknown => busy
[D] 21:57:59.121 0x7f6ffea72d00 () - delaying features availability
[D] 21:57:59.123 0x7f6ffea72d00 () - runtime prefix: "/app"
[D] 21:57:59.124 0x7f6ffea72d00 () - available styles: ("Default", "Fusion", "Imagine", "Material", "org.kde.breeze", "org.kde.desktop", "Plasma", "Universal")
[D] 21:57:59.124 0x7f6ffea72d00 () - style paths: ("/usr/lib/qml/QtQuick/Controls.2")
[D] 21:57:59.124 0x7f6ffea72d00 () - import paths: ("/usr/lib/qml", "/app/bin", "qrc:/qt-project.org/imports")
[D] 21:57:59.124 0x7f6ffea72d00 () - library paths: ("/usr/share/runtime/lib/plugins", "/usr/lib/plugins", "/app/bin")
[D] 21:57:59.124 0x7f6ffea72d00 () - using auto qt style
[D] 21:57:59.124 0x7f6ffea72d00 () - XDG_CURRENT_DESKTOP: KDE
[D] 21:57:59.124 0x7f6ffea72d00 () - switching to style: "org.kde.desktop"
[D] 21:57:59.128 0x7f6fe1e00600 () - config version: 65 65
[D] 21:57:59.128 0x7f6fe1400600 libs_availability:61 - checking: torch cuda
[D] 21:57:59.184 0x7f6fe1e00600 () - models changed
[D] 21:58:00.54 0x7f6ffea72d00 () - starting app: app-standalone
[D] 21:58:00.56 0x7f6ffea72d00 () - app service state: unknown => busy
[W] 21:58:00.57 0x7f6ffea72d00 () - hot keys are supported only under x11
logger error: invalid format string
qrc:/qml/main.qml:340:5: QML Connections: Implicitly defined onFoo properties in Connections are deprecated. Use this syntax instead: function onFoo(<arguments>) { ... }
logger error: invalid format string
qrc:/qml/main.qml:331:5: QML Connections: Implicitly defined onFoo properties in Connections are deprecated. Use this syntax instead: function onFoo(<arguments>) { ... }
logger error: invalid format string
qrc:/qml/Notepad.qml:24:5: QML Connections: Implicitly defined onFoo properties in Connections are deprecated. Use this syntax instead: function onFoo(<arguments>) { ... }
logger error: invalid format string
qrc:/qml/Translator.qml:29:5: QML Connections: Implicitly defined onFoo properties in Connections are deprecated. Use this syntax instead: function onFoo(<arguments>) { ... }
logger error: invalid format string
qrc:/qml/MainToolBar.qml:282:13: QML Connections: Implicitly defined onFoo properties in Connections are deprecated. Use this syntax instead: function onFoo(<arguments>) { ... }
[D] 21:58:00.197 0x7f6ffea72d00 onCompleted:180 - default font pixel size: 13
[D] 21:58:00.213 0x7f6ffea72d00 () - default tts model not found: "en"
[D] 21:58:00.214 0x7f6ffea72d00 () - default mnt lang not found: "en"
[D] 21:58:00.214 0x7f6ffea72d00 () - new default mnt lang: "en"
[D] 21:58:00.214 0x7f6ffea72d00 () - service refresh status, new state: busy
[D] 21:58:00.214 0x7f6ffea72d00 () - service refresh status, new state: busy
[D] 21:58:00.229 0x7f6ffea72d00 () - trying features availability update: false
[D] 21:58:00.229 0x7f6ffea72d00 () - stt models changed
[D] 21:58:00.231 0x7f6ffea72d00 () - update listen
[D] 21:58:00.231 0x7f6ffea72d00 () - app stt configured: false => true
[D] 21:58:00.231 0x7f6ffea72d00 () - app active stt model: "" => "en_vosk_large"
[D] 21:58:00.231 0x7f6ffea72d00 () - update listen
[D] 21:58:00.231 0x7f6ffea72d00 () - tts models changed
[D] 21:58:00.231 0x7f6ffea72d00 () - update listen
[W] 21:58:00.231 0x7f6ffea72d00 () - no available tts models for in mnt
[W] 21:58:00.231 0x7f6ffea72d00 () - no available tts models for out mnt
[D] 21:58:00.231 0x7f6ffea72d00 () - ttt models changed
[D] 21:58:00.236 0x7f6ffea72d00 () - mnt langs changed
[D] 21:58:00.236 0x7f6ffea72d00 () - update listen
[W] 21:58:00.236 0x7f6ffea72d00 () - no available mnt langs
[W] 21:58:00.236 0x7f6ffea72d00 () - no available mnt out langs
[D] 21:58:00.835 0x7f6ffea72d00 () - trying features availability update: false
[D] 21:58:01.357 0x7f6fe1400600 libs_availability:69 - checking: coqui tts
[D] 21:58:01.358 0x7f6fe1400600 libs_availability:77 - checking: faster-whisper
[D] 21:58:01.851 0x7f6ffea72d00 () - trying features availability update: false
[D] 21:58:01.978 0x7f6fe1400600 libs_availability:85 - checking: transformers
[D] 21:58:01.978 0x7f6fe1400600 libs_availability:87 - checking: accelerate
[D] 21:58:02.647 0x7f6fe1400600 libs_availability:95 - checking: unikud
[D] 21:58:02.647 0x7f6fe1400600 libs_availability:106 - checking: mimic3 tts
[D] 21:58:02.851 0x7f6ffea72d00 () - trying features availability update: false
[D] 21:58:03.599 0x7f6fe1400600 libs_availability:114 - checking: gruut
[D] 21:58:03.599 0x7f6fe1400600 libs_availability:118 - checking: gruut-de
[D] 21:58:03.599 0x7f6fe1400600 libs_availability:126 - checking: gruut-es
[D] 21:58:03.599 0x7f6fe1400600 libs_availability:134 - checking: gruut-fr
[D] 21:58:03.599 0x7f6fe1400600 libs_availability:142 - checking: gruut-it
[D] 21:58:03.600 0x7f6fe1400600 libs_availability:150 - checking: gruut-ru
[D] 21:58:03.600 0x7f6fe1400600 libs_availability:158 - checking: gruut-fa
[D] 21:58:03.600 0x7f6fe1400600 libs_availability:166 - checking: gruut-sw
[D] 21:58:03.600 0x7f6fe1400600 libs_availability:174 - checking: gruut-nl
[D] 21:58:03.601 0x7f6fe1400600 libs_availability:185 - checking: mecab
[D] 21:58:03.608 0x7f6fe1400600 libs_availability:187 - checking: unidic-lite
[D] 21:58:03.608 0x7f6fe1400600 libs_availability:194 - py libs availability: [coqui-tts=true, faster-whisper=true, mimic3-tts=true, transformers=true, unikud=true, gruut_de=true, gruut_es=true, gruut_fa=true, gruut_fr=true, gruut_nl=true, gruut_it=true, gruut_ru=true, gruut_sw=true, mecab=true, torch-cuda=false]
[D] 21:58:03.851 0x7f6ffea72d00 () - trying features availability update: true
[D] 21:58:03.851 0x7f6ffea72d00 () - features availability ready
[W] 21:58:03.863 0x7f6ffea72d00 has_cuda:56 - failed to open whisper-cublas lib: libcuda.so.1: cannot open shared object file: No such file or directory
[W] 21:58:03.864 0x7f6ffea72d00 has_hip:80 - failed to open whisper-hipblas lib: libwhisper-hipblas.so: cannot open shared object file: No such file or directory
[D] 21:58:03.876 0x7f6ffea72d00 () - updating model using availability
[D] 21:58:03.876 0x7f6ffea72d00 () - updating model using availability internal
[D] 21:58:03.878 0x7f6ffea72d00 () - default tts model not found: "en"
[D] 21:58:03.878 0x7f6ffea72d00 () - default mnt lang not found: "en"
[D] 21:58:03.878 0x7f6ffea72d00 () - new default mnt lang: "en"
[D] 21:58:03.878 0x7f6ffea72d00 () - service refresh status, new state: idle
[D] 21:58:03.878 0x7f6ffea72d00 () - service state changed: busy => idle
[D] 21:58:03.878 0x7f6ffea72d00 () - scan cuda: true
[D] 21:58:03.878 0x7f6ffea72d00 () - scan hip: true
[D] 21:58:03.878 0x7f6ffea72d00 () - scan opencl: true false
[D] 21:58:03.878 0x7f6ffea72d00 add_cuda_devices:281 - scanning for cuda devices
[D] 21:58:03.878 0x7f6ffea72d00 add_cuda_devices:290 - cuda version: driver=0, runtime=12020
[W] 21:58:03.878 0x7f6ffea72d00 add_cuda_devices:295 - cudaGetDeviceCount returned: 35
[D] 21:58:03.879 0x7f6ffea72d00 add_hip_devices:318 - scanning for hip devices
[W] 21:58:03.879 0x7f6ffea72d00 hip_api:223 - failed to open hip lib: libamdhip64.so: cannot open shared object file: No such file or directory
[D] 21:58:03.879 0x7f6ffea72d00 add_opencl_devices:357 - scanning for opencl devices
[D] 21:58:03.917 0x7f6ffea72d00 add_opencl_devices:374 - opencl number of platforms: 1
[D] 21:58:03.917 0x7f6ffea72d00 add_opencl_devices:399 - opencl platform: 0, name=Clover, vendor=Mesa
[D] 21:58:03.918 0x7f6ffea72d00 add_opencl_devices:413 - opencl number of devices: 0
[D] 21:58:03.918 0x7f6ffea72d00 () - service refresh status, new state: idle
[D] 21:58:03.934 0x7f6ffea72d00 () - app service state: busy => idle
[W] 21:58:03.946 0x7f6ffea72d00 () - no available mnt langs
[W] 21:58:03.946 0x7f6ffea72d00 () - no available mnt out langs
[W] 21:58:03.946 0x7f6ffea72d00 () - no available tts models for in mnt
[W] 21:58:03.946 0x7f6ffea72d00 () - no available tts models for out mnt
[W] 21:58:03.946 0x7f6ffea72d00 () - invalid task, reseting task state
[D] 21:58:03.946 0x7f6ffea72d00 () - app busy: true => false
[D] 21:58:03.947 0x7f6ffea72d00 () - stt models changed
[D] 21:58:03.947 0x7f6ffea72d00 () - update listen
[D] 21:58:03.947 0x7f6ffea72d00 () - tts models changed
[D] 21:58:03.947 0x7f6ffea72d00 () - update listen
[W] 21:58:03.947 0x7f6ffea72d00 () - no available tts models for in mnt
[W] 21:58:03.947 0x7f6ffea72d00 () - no available tts models for out mnt
[D] 21:58:03.947 0x7f6ffea72d00 () - ttt models changed
[D] 21:58:03.952 0x7f6ffea72d00 () - mnt langs changed
[D] 21:58:03.952 0x7f6ffea72d00 () - update listen
[W] 21:58:03.952 0x7f6ffea72d00 () - no available mnt langs
[W] 21:58:03.953 0x7f6ffea72d00 () - no available mnt out langs

Crash occurs at this point.