X-LANCE / SLAM-LLM

Speech, Language, Audio, Music Processing with Large Language Model
MIT License
579 stars 52 forks source link

merge raw dataset and mel dataset; support wavlm #38

Closed ddlBoJack closed 9 months ago

ddlBoJack commented 9 months ago

What does this PR do?

  1. merge raw dataset and mel dataset, with "dataset_config.input_type" flag
  2. support wavlm

Feature/Issue validation/testing

Please describe the tests that you ran to verify your changes and relevant result summary. Provide instructions so it can be reproduced. Please also list any relevant details for your test configuration.

Before submitting

Thanks for contributing 🎉!