planning: Ichigo Speech Enhancements

janhq / ichigo

Local realtime voice AI

Apache License 2.0

1.97k stars 99 forks source link

planning: Ichigo Speech Enhancements #93

Open dan-homebrew opened 1 month ago

dan-homebrew commented 1 month ago

Goal

Baseline enhancements, i.e. noise reduction
Processing step: pre-model before Ichigo

tikikun commented 1 month ago

should we use current speech enhancement tech to tackle this instead of developing our own?

PodsAreAllYouNeed commented 3 weeks ago

We should very much use out of the box speech enhancement tech. The problem is general enough that the ichigo use-case is not differentiated enough to warrant domain-specific fine-tuning.

tikikun commented 2 weeks ago

@PodsAreAllYouNeed we can icebox this if this is not priority

tikikun commented 2 weeks ago

also cc @dan-homebrew

PodsAreAllYouNeed commented 1 week ago

Speech Enhancement is also included in the VADHandler proposed in #91