Open luobendewugong opened 2 months ago
you should change model from HF
Using the code from this PR: https://github.com/huggingface/speech-to-speech/pull/60
You can call the system with:
python s2s_pipeline.py --recv_host 0.0.0.0 --send_host 0.0.0.0 --lm_model_name meta-llama/Meta-Llama-3.1-8B-Instruct --init_chat_role system --tts melo --stt_model_name openai/whisper-large-v3 --language zh
There, whisper is larger than the distil version (but it works for Chinese). The llm is larger (but it works for Chinese, you can change it for another one). The TTS is smaller than the default (and it works for chinese)
Let me know if it works :)
I merged the PR for multiple language, so you should be able to run with the code in main
Hi, thanks for your work,I encountered an issue while running the code from your repository on my Mac. The error I received is as follows:【ValueError: Please select a valid model】,
The error occurs when initializing the LightningWhisperMLX model with the following command: python s2s_pipeline.py --local_mac_optimal_settings --device mps --lm_model_name meta-llama/Meta-Llama-3.1-8B-Instruct --init_chat_role system --tts melo --stt_model_name openai/whisper-large-v3 --language zh
could you please provide some guidance on what might be causing this issue or suggest any potential solutions?
Thank you very much for your help!
You can run it on mac with:
python s2s_pipeline.py --device mps --lm_model_name meta-llama/Meta-Llama-3.1-8B-Instruct --init_chat_role system --tts melo --stt_model_name openai/whisper-large-v3 --language zh --mode local
But we still didn't make the changes to the MLX classes to support Chinese, so the generation will be quite slow.
If you want to make the changes, we welcome PRs! otherwise I'll adapt it in the coming days
Thank you very much, I can use Chinese now, but I feel that there are three more things I want to ask:
Using the code from this PR: #60
You can call the system with:
python s2s_pipeline.py --recv_host 0.0.0.0 --send_host 0.0.0.0 --lm_model_name meta-llama/Meta-Llama-3.1-8B-Instruct --init_chat_role system --tts melo --stt_model_name openai/whisper-large-v3 --language zh
Hello, thanks for your work, I have two problems with the default model and would like to consult you:
Thank you so much!