Open dreimer1986 opened 2 weeks ago
The issue is that the external/experimental esphome_audio component needed to get esp-idf working with a media player (rather than just a speaker) has some duplex limitations. Specifically, input and output get the same sample rate. There is a resampler, but for whatever reason it doesn't seem to work with the input (mic). So that means the sample rate has to be defined as exactly what the mic can handle. Which is 16kHz.
You can try it yourself! Set the rate to 48000 (lines 147, 160) and the mic will stop working. Different values for input and output won't work. Add the resampler to the input (see lines 181, 171) and the mic will still not work. This is the only setting that currently works.
Hopefully they get the resampler working with input, then you'll be able to set 48000 or whatever.
IMO, the microwakeword version is not ready for use just yet.
In my case I don't even use the microwakeword one as it does not allow me to use my own wakeword. The default one has the same audio "problem" even though there is nothing set up for 16 kHz inside the config. The wakeword one is not ready for use maybe, yes. When I try to use it I get funny errors, like:
adf_pipeline.i2s_audio: [source /data/packages/2ab3d759/esphome/onju-voice-microwakeword.yaml:153]
Unknown value 'audioin', did you mean 'audio_in', 'audio_out'?.
@jherby2k do you know if there's an issue we can track regarding this limitation on gnumpi/esphome_audio? I can't seem to find one at first glance.
There isn't - i added a comment to the existing thread you had there, but it was ignored. I've opened one now: https://github.com/gnumpi/esphome_audio/issues/42
Thank you!
Did someone already try it out with the new branch? I am not at home right now and can't test before tomorrow. I am not sure if my audioin renaming to audio_in is really correct, too. (As I said, I use the config without microwakeword right now as my personalized sarcastic catgirl OpenAI is called Luna ^^) I really wonder if I can create a wake word file for microwakeword too...
I won’t have time until tomorrow.Sent from my iPhoneOn Jun 15, 2024, at 3:18 PM, Daniel Reimer @.***> wrote: Did someone already try it out with the new branch? I am not at home right now and can't test before tomorrow. I am not sure if my audioin renaming to audio_in is really correct, too. (As I said, I use the config without microwakeword right now as my personalized sarcastic catgirl OpenAI is called Luna ^^) I really wonder if I can create a wake word file for microwakeword too...
—Reply to this email directly, view it on GitHub, or unsubscribe.You are receiving this because you were mentioned.Message ID: @.***>
I tried my luck and if there is no fault on my side it does NOT work. I used this config:
substitutions:
name: onju-voice-e27688
friendly_name: Onju Voice Satellite e27688
packages:
tetele.onju_voice_satellite:
url: https://github.com/dreimer1986/onju-voice-satellite
file: esphome/onju-voice-microwakeword_1.yaml
ref: main
refresh: 0d
esphome:
name: ${name}
name_add_mac_suffix: false
friendly_name: ${friendly_name}
api:
encryption:
key: jkgjhkulh
wifi:
ssid: xxx
password: xxx
ota:
platform: esphome
and this package https://github.com/dreimer1986/onju-voice-satellite/blob/main/esphome/onju-voice-microwakeword_1.yaml
and here the log: logs_onju-voice-e27688_run.txt
P.S. Github once more casues MADNESS in line 154 where by magic my "audio-in" written there suddently becomes "audio"+"In" after a bit. Even in here I am UNABLE to type this line as one word in without it being replaced by "audio". O_o I can build a few times and then Github magically changed it back. So take this one with a bit of salt.
Somehow I am too stupid to make things work. According to the report over there it should work now, but something I do is wrong I guess?
i was testing with gnumpi. please test the following config:
https://github.com/dwyschka/esphome-configs/blob/main/onjou-voice.yml
Adjust it to your needs pls
@dwyschka thx for the help. It was me setting input and output to 16 Bits... Now all is fine again. Music Assistant still is a mess and most of the time does not play as MP3 is not my main source and the only one it sems to like, but the normal media play feature does fine where I can select the right files easily. So... Seems to be way better :D
enforce lossy mp3 in music assistant on your device, it works better.
Tried that, but nope. Even then the stream seems to be sort of flac alike: http://192.168.181.42:8098/flow/media_player.onju_voice_e27688_onju_voice_satellite_e27688/87bd58754b47491b800608f96ed48eef.flac?ts=1718660677 it works in browser though, so this is fine, but Onju stays silent. I think that my beta fetish is in the way again ^^ I try to use 2.1.0b6 here.
Did anyone try to correct the non microwakeword version, too? I tried to (see my fork) but it seems like my copy together tinkering is not working at all.
Tried that, but nope. Even then the stream seems to be sort of flac alike: http://192.168.181.42:8098/flow/media_player.onju_voice_e27688_onju_voice_satellite_e27688/87bd58754b47491b800608f96ed48eef.flac?ts=1718660677 it works in browser though, so this is fine, but Onju stays silent. I think that my beta fetish is in the way again ^^ I try to use 2.1.0b6 here.
Which audio provider are you using? i tested with spotify and yt. but, i set the audio quality on the onju to "44100 mhz 24bit". maybe thats the fix for you?
I tried changing the supported audio output in Music Assistant to 44100 @24Bit, but still have a flac stream inside onju's logs. Do you see a mp3 there?
Subscribe cause I'm interested to make also my 7 ghome mini around my home. Noroc :-)
Subscribe cause I'm interested to make also my 7 ghome mini around my home. Noroc :-)
Same
Checklist
Is your feature request related to a problem? Please describe.
Not really a problem, just muffled audio because of the limit to 16000 Hz. I am very sure I read somewhere about a few reasons for using this limit. I just don't find it anymore.
Describe the solution you'd like
44100 Hz sounds like a nice solution, especially if the media player features are used more often.
Describe alternatives you've considered
Of course I just can use another device as media player, but I want less than more and Onju voice should make my Echo obsolete which is not much left todo for.
Additional context
-