open-mmlab / Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
https://openhlt.github.io/amphion/
MIT License
4.45k stars 379 forks source link

evaluation refactor #117

Closed VocodexElysium closed 6 months ago

VocodexElysium commented 7 months ago

✨ Description

This pull request includes an overall refactor to the Amphion Evaluation module for faster inference speed and more computational modes for compatibility, which includes:

🚧 Related Issues

110

👨‍💻 Changes Proposed

🧑‍🤝‍🧑 Who Can Review?

@lmxue @HeCheng0625 @zhizhengwu

✅ Checklist

VocodexElysium commented 6 months ago

Please refine for WavLM features extraction. Support for both huggingface pretrained model and offline model.

support added