lifeiteng / vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
https://lifeiteng.github.io/valle/index.html
Apache License 2.0
1.99k stars 320 forks source link

Training on wenetspeech dataset #182

Open codeking233 opened 5 months ago

codeking233 commented 5 months ago

File "/home/twlan/anaconda3/envs/valle/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1520, in _call_impl return forward_call(*args, kwargs) File "/home/twlan/anaconda3/envs/valle/lib/python3.8/site-packages/encodec/modules/seanet.py", line 63, in forward return self.shortcut(x) + self.block(x) File "/home/twlan/anaconda3/envs/valle/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1511, in _wrapped_call_impl return self._call_impl(*args, *kwargs) File "/home/twlan/anaconda3/envs/valle/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1520, in _call_impl return forward_call(args, kwargs) File "/home/twlan/anaconda3/envs/valle/lib/python3.8/site-packages/encodec/modules/conv.py", line 204, in forward x = pad1d(x, (padding_total, extra_padding), mode=self.pad_mode) File "/home/twlan/anaconda3/envs/valle/lib/python3.8/site-packages/encodec/modules/conv.py", line 92, in pad1d padded = F.pad(x, paddings, mode, value) File "/home/twlan/anaconda3/envs/valle/lib/python3.8/site-packages/torch/nn/functional.py", line 4495, in pad return torch._C._nn.pad(input, pad, mode, value) RuntimeError: input tensor must fit into 32-bit index math

how can i solve this problem?