Closed Victor-Chow closed 1 year ago
This model must be run with DiffSingerCascadeInfer instead of DiffSingerE2eInfer. In addition, MIDI-less mode models cannot run without explicit phone durations and f0 sequence inputs. Please refer to main.py and samples/*.ds to infer from a file. Your input format is also deprecated in this forked repository.
Output: