espressif / esp-skainet

Espressif intelligent voice assistant
Other
533 stars 117 forks source link

Difference between noise suppression methods (AIS-1517) #128

Closed PrathamG closed 4 months ago

PrathamG commented 4 months ago

Hi! I wanted to know what is the difference between these 2 noise suppression methods during AFE configuration:

typedef enum { NS_MODE_SSP = 0, // speech signal process method NS_MODE_NET = 1, // deep noise suppression net method } afe_ns_mode_t;

I could not find anything about it in the documentation. Thanks!

PrathamG commented 4 months ago

From ESP-SR v1.6 release notes:

  1. nsnet1 - The first deep noise suppression model We are also introducing nsnet1, our first deep noise suppression model. This model is designed to enhance speech quality in > noisy environments, making it perfect for real-world applications like telephony systems.

nsnet1 uses a deep learning approach to suppress background noise while preserving the original speech signal. It is trained > on a large dataset to learn the patterns of noise and effectively cancel them out without distorting the speech.

This model is available for ESP32-S3 chip. You can enable it by setting afe_config.afe_ns_mode = NS_MODE_NET; . Please > refer to esp-skainet/examples/voice_communication for more details.

Note: currently only AFE_VC support nsnet1. AFE_SR does not support nsnet1.