An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
Apache License 2.0
2.73k
stars
288
forks
source link
Mode 'parallel' for EncSALayer to speed up infer on ONNX #191
'transformer-parallel' is widely used in GPT-J-6B and has been proven to have the same effect as traditional transformer.
It can be simplified as:
This saves a skip link and a LayerNorm.This can bring a slight improvement in training speed on Diffsinger.
After experimentation, this modification has shown a more significant improvement on ONNX.
The following are the experimental parameters and results. The benchmark was performed using infer_acoustic.py, and the backbone of the model used lynxnet, without using shallow diffusion.
On average, the inference speed has increased by 8%.
This change has been applied to yousaV1.42ReFlow and there have been no reports of any issues yet.
'transformer-parallel' is widely used in GPT-J-6B and has been proven to have the same effect as traditional transformer. It can be simplified as:
This saves a skip link and a LayerNorm.This can bring a slight improvement in training speed on Diffsinger.
After experimentation, this modification has shown a more significant improvement on ONNX. The following are the experimental parameters and results. The benchmark was performed using
infer_acoustic.py
, and the backbone of the model used lynxnet, without using shallow diffusion.On average, the inference speed has increased by 8%. This change has been applied to yousaV1.42ReFlow and there have been no reports of any issues yet.