Dear author, when I tried to optimize the model, I found a phenomenon that when the model processing includes a small volume of human voice segment, the segment will be suppressed or even completely removed. I'm not sure if this is related to the model or target_dB_FS, or do you have a suggested solution? . I look forward to your reply. Thank you
Dear author, when I tried to optimize the model, I found a phenomenon that when the model processing includes a small volume of human voice segment, the segment will be suppressed or even completely removed. I'm not sure if this is related to the model or target_dB_FS, or do you have a suggested solution? . I look forward to your reply. Thank you