NetEase-GameAI / ChoreoMaster

This repository contains the dataset used in paper "ChoreoMaster: Choreography -Oriented Music Driven Dance Synthesis".
112 stars 10 forks source link

the input length #5

Open zhuyingSeu opened 2 years ago

zhuyingSeu commented 2 years ago

Thanks for your sharing.I have some questions.

  1. The input length for the style embedding network is 8s, the input length for the rhythm embedding network is 2s.Does it mean that four 2s in 8s share the same style emdedding?
  2. The duration of the paired music and motion ranges from 1s to 3s, how to adjust them to the input length 8s and 2s?
dianxin556 commented 2 years ago

+1

chuangchuang-wang commented 1 year ago

+1

moeKedama commented 11 months ago

I think just use some upsample or downsample or any method that can transform every clip unit to 2s. easy to process with .wav but hard with .fbx.

MY PROPOSAL IS IF music clips and dance clips are both 2s, Use a window of 8s (include 4 clips, stride 2s) to capture the inputs. And the corresponding 2s clip c[1] should be the second clip of the 8s window [c[0], c[1], c[2], c[3]]. So the inputs capture not only the features of the current state c[1], but also the features of the previous c[0] and subsequent ones c[2] c[3] At the begining and ending, Just padding 0 and believe the MODEL can understand it😇. only have 3 situations.

  1. c[0]=0
  2. c[2]!=0,c[3]=0
  3. c[2]=0,c[3]=0