PaddlePaddle / PaddleHub

Awesome pre-trained models toolkit based on PaddlePaddle. (400+ models including Image, Text, Audio, Video and Cross-Modal with Easy Inference & Serving)【安全加固,暂停交互,请耐心等待】
https://www.paddlepaddle.org.cn/hub
Apache License 2.0
12.74k stars 2.07k forks source link

ge2e_fastspeech2_pwgan UnicodeDecodeError: 'gbk' codec can't decode byte 0x8c in position 2448: illegal multibyte sequence #2215

Open monkeycc opened 1 year ago

monkeycc commented 1 year ago
import paddlehub as hub
import os

model = hub.Module(name='ge2e_fastspeech2_pwgan', output_dir='D:/HUB/', speaker_audio='D:/334292423-1-208.wav')  # 指定目标音色音频文件
texts = [
    '语音的表现形式在未来将变得越来越重要']
wavs = model.generate(texts, use_gpu=True)

for text, wav in zip(texts, wavs):
    print('='*30)
    print(f'Text: {text}')
    print(f'Wav: {wav}')

D:\anaconda31114\envs\paddHUB\lib\site-packages\_distutils_hack\__init__.py:33: UserWarning: Setuptools is replacing distutils.
  warnings.warn("Setuptools is replacing distutils.")
[nltk_data] Error loading averaged_perceptron_tagger: <urlopen error
[nltk_data]     [Errno 11004] getaddrinfo failed>
[nltk_data] Error loading cmudict: <urlopen error [Errno 11004]
[nltk_data]     getaddrinfo failed>
W0228 17:21:57.891395 22916 gpu_resources.cc:61] Please NOTE: device: 0, GPU Compute Capability: 8.6, Driver API Version: 11.7, Runtime API Version: 11.6
W0228 17:21:57.895401 22916 gpu_resources.cc:91] device: 0, cuDNN Version: 8.4.
Traceback (most recent call last):
  File "f:/qian_HUB/sy.py", line 6, in <module>
    model = hub.Module(name='ge2e_fastspeech2_pwgan', output_dir='D:/HUB/', speaker_audio='D:/334292423-1-208.wav')  # 指定目标音色音频文件
  File "D:\anaconda31114\envs\paddHUB\lib\site-packages\paddlehub\module\module.py", line 393, in __new__
    module = cls.init_with_name(
  File "D:\anaconda31114\envs\paddHUB\lib\site-packages\paddlehub\module\module.py", line 516, in init_with_name
    return user_module_cls(**kwargs)
  File "C:\Users\monke\.paddlehub\modules\ge2e_fastspeech2_pwgan\module.py", line 77, in __init__
    phn_id = [line.strip().split() for line in f.readlines()]
UnicodeDecodeError: 'gbk' codec can't decode byte 0x8c in position 2448: illegal multibyte sequence

-------------------- -------------
aiofiles             23.1.0
aiohttp              3.8.4
aiosignal            1.3.1
altair               4.2.2
anyio                3.6.2
astor                0.8.1
async-timeout        4.0.2
attrs                22.2.0
audioread            3.0.0
Babel                2.11.0
bce-python-sdk       0.8.79
Bottleneck           1.3.6
certifi              2022.12.7
cffi                 1.15.1
charset-normalizer   3.0.1
click                8.1.3
colorama             0.4.6
colorlog             6.7.0
contourpy            1.0.7
cycler               0.11.0
Cython               0.29.33
datasets             2.10.0
decorator            4.4.2
dill                 0.3.4
Distance             0.1.3
easydict             1.10
editdistance         0.6.2
entrypoints          0.4
fastapi              0.92.0
ffmpeg               1.4
ffmpy                0.3.0
filelock             3.9.0
Flask                2.2.3
Flask-Babel          2.0.0
fonttools            4.38.0
frozenlist           1.3.3
fsspec               2023.1.0
future               0.18.3
g2p-en               2.1.0
g2pM                 0.1.2.5
gradio               3.19.1
h11                  0.14.0
h5py                 3.8.0
httpcore             0.16.3
httpx                0.23.3
huggingface-hub      0.12.1
idna                 3.4
imageio              2.26.0
imageio-ffmpeg       0.4.8
importlib-metadata   6.0.0
importlib-resources  5.12.0
inflect              6.0.2
itsdangerous         2.1.2
jieba                0.42.1
Jinja2               3.1.2
joblib               1.2.0
jsonlines            3.1.0
jsonschema           4.17.3
kaldiio              2.17.2
kiwisolver           1.4.4
librosa              0.8.1
linkify-it-py        2.0.0
llvmlite             0.39.1
loguru               0.6.0
lxml                 4.9.2
markdown-it-py       2.2.0
MarkupSafe           2.1.2
matplotlib           3.7.0
mdit-py-plugins      0.3.3
mdurl                0.1.1
mock                 5.0.1
moviepy              1.0.3
multidict            6.0.4
multiprocess         0.70.12.2
nara-wpe             0.0.9
nltk                 3.8.1
numba                0.56.4
numpy                1.23.5
opencv-python        4.7.0.72
opt-einsum           3.3.0
orjson               3.8.6
packaging            23.0
paddle-bfloat        0.1.7
paddle2onnx          1.0.5
paddleaudio          1.1.0
paddlefsl            1.1.0
paddlehub            2.3.1
paddlenlp            2.5.1
paddlepaddle-gpu     2.4.1.post116
paddlespeech         0.1.0a13
paddlespeech-feat    0.1.0
pandas               1.5.3
parameterized        0.8.1
pathos               0.2.8
Pillow               9.4.0
pip                  22.3.1
pkgutil_resolve_name 1.3.10
platformdirs         3.0.0
pooch                1.7.0
portalocker          2.7.0
pox                  0.3.2
ppft                 1.7.6.6
praatio              4.4.0
proglog              0.1.10
protobuf             3.20.0
pyarrow              11.0.0
pybind11             2.10.3
pycparser            2.21
pycryptodome         3.17
pydantic             1.10.5
pydub                0.25.1
pyglet               2.0.3
Pygments             2.14.0
pyparsing            3.0.9
pypinyin             0.48.0
pyrsistent           0.19.3
python-dateutil      2.8.2
python-multipart     0.0.5
pytz                 2022.7.1
pywin32              305
pyworld              0.3.2
PyYAML               6.0
pyzmq                25.0.0
rarfile              4.0
regex                2022.10.31
requests             2.28.2
resampy              0.2.2
responses            0.18.0
rfc3986              1.5.0
rich                 13.3.1
sacrebleu            2.3.1
scikit-learn         1.2.1
scipy                1.10.1
sentencepiece        0.1.96
seqeval              1.2.2
setuptools           65.6.3
six                  1.16.0
sniffio              1.3.0
soundfile            0.12.1
starlette            0.25.0
tabulate             0.9.0
TextGrid             1.5
threadpoolctl        3.1.0
timer                0.2.2
toolz                0.12.0
tqdm                 4.64.1
typeguard            2.13.3
typer                0.7.0
typing_extensions    4.5.0
uc-micro-py          1.0.1
urllib3              1.26.14
uvicorn              0.20.0
visualdl             2.4.2
webrtcvad            2.0.10
websockets           10.4
Werkzeug             2.2.3
wheel                0.38.4
win32-setctime       1.1.0
wincertstore         0.2
xxhash               3.2.0
yacs                 0.1.8
yarl                 1.8.2
zipp                 3.15.0
pyqf66 commented 1 year ago

Has the same problem with model "disco_diffusion_ernievil_base"