Closed laiyoi closed 1 year ago
翻了好久源码解决了,在preparation/acoustic_preparation.ipynb中4.2部分的代码块中把设置augmentation_args的一段改为
augmentation_args = {}
if random_pitch_shifting['enabled']:
augmentation_args['random_pitch_shifting'] = {
'enabled': True,
'range': random_pitch_shifting['range'],
'scale': random_pitch_shifting['scale']
}
configs['use_key_shift_embed'] = True
if fixed_pitch_shifting['enabled']:
augmentation_args['fixed_pitch_shifting'] = {
'enabled': True,
'targets': fixed_pitch_shifting['targets'],
'scale': fixed_pitch_shifting['scale']
}
configs['use_spk_id'] = True
configs['num_spk'] = 1 + len(fixed_pitch_shifting['targets'])
if random_time_stretching['enabled']:
augmentation_args['random_time_stretching'] = {
'enabled': True,
'range': random_time_stretching['range'],
'domain': random_time_stretching['domain'],
'scale': random_time_stretching['scale']
}
configs['use_speed_embed'] = True
configs['augmentation_args'] = augmentation_args
即给每项加上 'enabled': True,
或者可以在config.yaml中给每项加上enabled: true
v2分支暂时没有正式发布,想使用的话,最好仔细阅读并subscribe这个issue:https://github.com/openvpi/DiffSinger/issues/74
refactor-v2分支 我在data文件夹下的数据集文件夹中启用了fixed_pitch_shifting和random_time_stretching,但是python data_gen/binarize.py --config data/expname/config.yaml时控制台显示
其中fixed_pitch_shifting': {'enabled': False还有fixed_pitch_shifting': {'enabled': False,都与我之前在acoustic_preparation.ipynb中选的不符 训练时也是这样,我认为可能与base_config: ['configs/acoustic.yaml'],有关,并且我在acoustic_preparation.ipynb中填写的训练集在tensorboard中只出现了一条