NVIDIA / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
https://docs.nvidia.com/nemo-framework/user-guide/latest/overview.html
Apache License 2.0
11.72k stars 2.45k forks source link

CONF-TSASR #8709

Closed valentin7121 closed 2 months ago

valentin7121 commented 6 months ago

The article "Conformer-based Target-Speaker Automatic Speech Recognition for Single-Channel Audio" states that the model CONF-TSASR will be open-sourced through NVIDIA NeMo toolkit. Is it already open-sourced? Will pretrained weights be available? If yes, where can I found it?

AntoineBlanot commented 6 months ago

Up! I wish to try it as well to check if reported results are as good as they say. Can we already find it and if no, when will it be available?

github-actions[bot] commented 5 months ago

This issue is stale because it has been open for 30 days with no activity. Remove stale label or comment or this will be closed in 7 days.

valentin7121 commented 5 months ago

Still waiting for the answer, thank you.

github-actions[bot] commented 4 months ago

This issue is stale because it has been open for 30 days with no activity. Remove stale label or comment or this will be closed in 7 days.

valentin7121 commented 3 months ago

Up

github-actions[bot] commented 2 months ago

This issue is stale because it has been open for 30 days with no activity. Remove stale label or comment or this will be closed in 7 days.

github-actions[bot] commented 2 months ago

This issue was closed because it has been inactive for 7 days since being marked as stale.

lotuscarvedlife commented 2 months ago

wait for it